ERIC - Search Results

Publication Date

In 2025	37
Since 2024	160
Since 2021 (last 5 years)	583
Since 2016 (last 10 years)	1218
Since 2006 (last 20 years)	2724

Descriptor

Item Analysis	5124
Test Items	1478
Foreign Countries	1210
Test Construction	1166
Test Validity	945
Factor Analysis	918
Test Reliability	886
Psychometrics	759
Correlation	697
Statistical Analysis	645
Comparative Analysis	595
Measures (Individuals)	561
Item Response Theory	518
Scores	488
Difficulty Level	483
Questionnaires	478
Evaluation Methods	468
Student Attitudes	427
Achievement Tests	411
Higher Education	401
Multiple Choice Tests	385
Measurement Techniques	372
Validity	342
Models	338
Reliability	334
More ▼

Education Level

Higher Education	841
Postsecondary Education	566
Secondary Education	411
Elementary Education	305
Elementary Secondary Education	205
High Schools	187
Middle Schools	171
Junior High Schools	116
Adult Education	103
Early Childhood Education	98
Grade 8	65
Grade 4	62
Intermediate Grades	59
Grade 5	58
Grade 6	47
Primary Education	45
Grade 7	43
Preschool Education	43
Kindergarten	37
Grade 3	32
Grade 9	28
Grade 10	23
Grade 12	22
Grade 1	18
Grade 11	16
More ▼

Audience

Researchers	169
Practitioners	49
Teachers	32
Administrators	8
Policymakers	8
Counselors	4
Students	4
Media Staff	1

Location

Turkey	172
Australia	81
Canada	79
China	68
United States	55
Germany	43
Taiwan	43
Japan	40
United Kingdom	38
Iran	36
Spain	33
California	30
Indonesia	30
Netherlands	28
South Korea	28
United Kingdom (England)	28
Hong Kong	26
India	25
Florida	24
New York	24
Malaysia	23
Singapore	20
Israel	18
Nigeria	18
Pennsylvania	17
More ▼

Laws, Policies, & Programs

No Child Left Behind Act 2001	18
Individuals with Disabilities…	15
Elementary and Secondary…	5
Elementary and Secondary…	2
Every Student Succeeds Act…	2
Deferred Action for Childhood…	1
Education Amendments 1974	1
Education Consolidation…	1
Emergency School Aid Act 1972	1
Individuals with Disabilities…	1
National Defense Education Act	1
More ▼

What Works Clearinghouse Rating

Meets WWC Standards without Reservations	1
Meets WWC Standards with or without Reservations	1
Does not meet standards	1

Showing 16 to 30 of 5,124 results Save | Export

Bayesian Diagnostic Classification Models for a Partially Known Q-Matrix

Peer reviewed

Direct link

Kazuhiro Yamaguchi – Journal of Educational and Behavioral Statistics, 2025

This study proposes a Bayesian method for diagnostic classification models (DCMs) for a partially known Q-matrix setting between exploratory and confirmatory DCMs. This Q-matrix setting is practical and useful because test experts have pre-knowledge of the Q-matrix but cannot readily specify it completely. The proposed method employs priors for…

Descriptors: Models, Classification, Bayesian Statistics, Evaluation Methods

Toward Better Outcome Measurement for Insomnia in Children with Autism Spectrum Disorder

Peer reviewed

Direct link

Lawrence Scahill; Luc Lecavalier; Michael C. Edwards; Megan L. Wenzell; Leah M. Barto; Arielle Mulligan; Auscia T. Williams; Opal Ousley; Cynthia B. Sinha; Christopher A. Taylor; Soo Youn Kim; Laura M. Johnson; Scott E. Gillespie; Cynthia R. Johnson – Autism: The International Journal of Research and Practice, 2024

This report presents a new parent-rated outcome measure of insomnia for children with autism spectrum disorder. Parents of 1185 children with autism spectrum disorder (aged 3-12; 80.3% male) completed the first draft of the measure online. Factor and item response theory analyses reduced the set of 40 items to the final 21-item Pediatric Insomnia…

Descriptors: Autism Spectrum Disorders, Children, Sleep, Test Construction

A Note on Standard Errors for Multidimensional Two-Parameter Logistic Models Using Gaussian Variational Estimation

Peer reviewed

Direct link

Jiaying Xiao; Chun Wang; Gongjun Xu – Grantee Submission, 2024

Accurate item parameters and standard errors (SEs) are crucial for many multidimensional item response theory (MIRT) applications. A recent study proposed the Gaussian Variational Expectation Maximization (GVEM) algorithm to improve computational efficiency and estimation accuracy (Cho et al., 2021). However, the SE estimation procedure has yet to…

Descriptors: Error of Measurement, Models, Evaluation Methods, Item Analysis

Test Score Comparison Tables: How Well are They Serving Test Users?

Peer reviewed

Direct link

Ute Knoch; Jason Fan – Language Testing, 2024

While several test concordance tables have been published, the research underpinning such tables has rarely been examined in detail. This study aimed to survey the publically available studies or documentation underpinning the test concordance tables of the providers of four major international language tests, all accepted by the Australian…

Descriptors: Language Tests, English, Test Validity, Item Analysis

A Comparison of Yen's Q3 Coefficient and Rasch Testlet Modeling for Identifying Local Item Dependence: Evidence from Two Vocabulary Matching Tests

Peer reviewed

Direct link

Hung Tan Ha; Duyen Thi Bich Nguyen; Tim Stoeckel – Language Assessment Quarterly, 2025

This article compares two methods for detecting local item dependence (LID): residual correlation examination and Rasch testlet modeling (RTM), in a commonly used 3:6 matching format and an extended matching test (EMT) format. The two formats are hypothesized to facilitate different levels of item dependency due to differences in the number of…

Descriptors: Comparative Analysis, Language Tests, Test Items, Item Analysis

Correcting for Extreme Response Style: Model Choice Matters

Peer reviewed

Direct link

Martijn Schoenmakers; Jesper Tijmstra; Jeroen Vermunt; Maria Bolsinova – Educational and Psychological Measurement, 2024

Extreme response style (ERS), the tendency of participants to select extreme item categories regardless of the item content, has frequently been found to decrease the validity of Likert-type questionnaire results. For this reason, various item response theory (IRT) models have been proposed to model ERS and correct for it. Comparisons of these…

Descriptors: Item Response Theory, Response Style (Tests), Models, Likert Scales

The Impact of Survey Mode Design and Questionnaire Length on Measurement Quality

Peer reviewed

Direct link

Alexandru Cernat; Joseph Sakshaug; Pablo Christmann; Tobias Gummer – Sociological Methods & Research, 2024

Mixed-mode surveys are popular as they can save costs and maintain (or improve) response rates relative to single-mode surveys. Nevertheless, it is not yet clear how design decisions like survey mode or questionnaire length impact measurement quality. In this study, we compare measurement quality in an experiment of three distinct survey designs…

Descriptors: Surveys, Questionnaires, Item Analysis, Attitude Measures

Wald X[superscript 2] Test for Differential Item Functioning Detection with Polytomous Items in Multilevel Data

Peer reviewed

Direct link

Sijia Huang; Dubravka Svetina Valdivia – Educational and Psychological Measurement, 2024

Identifying items with differential item functioning (DIF) in an assessment is a crucial step for achieving equitable measurement. One critical issue that has not been fully addressed with existing studies is how DIF items can be detected when data are multilevel. In the present study, we introduced a Lord's Wald X[superscript 2] test-based…

Descriptors: Item Analysis, Item Response Theory, Algorithms, Accuracy

Latent Class Analysis with Measurement Invariance Testing: Simulation Study to Compare Overall Likelihood Ratio vs Residual Fit Statistics Based Model Selection

Peer reviewed

Direct link

Zsuzsa Bakk – Structural Equation Modeling: A Multidisciplinary Journal, 2024

A standard assumption of latent class (LC) analysis is conditional independence, that is the items of the LC are independent of the covariates given the LCs. Several approaches have been proposed for identifying violations of this assumption. The recently proposed likelihood ratio approach is compared to residual statistics (bivariate residuals…

Descriptors: Goodness of Fit, Error of Measurement, Comparative Analysis, Models

Item-Validity Analysis of the SED-S in a Multicentre Study of Adults with Intellectual Disabilities

Peer reviewed

Direct link

Hauke Hermann; Annemieke Witte; Gloria Kempelmann; Brian F. Barrett; Sandra Zaal; Jolanda Vonk; Filip Morisse; Anna Pöhlmann; Paula S. Sterkenburg; Tanja Sappok – Journal of Applied Research in Intellectual Disabilities, 2024

Background: Valid and reliable instruments for measuring emotional development are critical for a proper diagnostic assignment in individuals with intellectual disabilities. This exploratory study examined the psychometric properties of the items on the Scale of Emotional Development--Short (SED-S). Method: The sample included 612 adults with…

Descriptors: Measures (Individuals), Emotional Development, Intellectual Disability, Psychometrics

Practical Considerations in Item Calibration with Small Samples under Multistage Test Design: A Case Study. Research Report. ETS RR-24-03

Peer reviewed
PDF on ERIC

Download full text

Hongwen Guo; Matthew S. Johnson; Daniel F. McCaffrey; Lixong Gu – ETS Research Report Series, 2024

The multistage testing (MST) design has been gaining attention and popularity in educational assessments. For testing programs that have small test-taker samples, it is challenging to calibrate new items to replenish the item pool. In the current research, we used the item pools from an operational MST program to illustrate how research studies…

Descriptors: Test Items, Test Construction, Sample Size, Scaling

Parameters and Models of Item Response Theory (IRT): A Review of Literature

Peer reviewed

Direct link

Gyamfi, Abraham; Acquaye, Rosemary – Acta Educationis Generalis, 2023

Introduction: Item response theory (IRT) has received much attention in validation of assessment instrument because it allows the estimation of students' ability from any set of the items. Item response theory allows the difficulty and discrimination levels of each item on the test to be estimated. In the framework of IRT, item characteristics are…

Descriptors: Item Response Theory, Models, Test Items, Difficulty Level

Exploration of Latent Structure in Test Revision and Review Log Data

Peer reviewed

Direct link

Zhang, Susu; Li, Anqi; Wang, Shiyu – Educational Measurement: Issues and Practice, 2023

In computer-based tests allowing revision and reviews, examinees' sequence of visits and answer changes to questions can be recorded. The variable-length revision log data introduce new complexities to the collected data but, at the same time, provide additional information on examinees' test-taking behavior, which can inform test development and…

Descriptors: Computer Assisted Testing, Test Construction, Test Wiseness, Test Items

An Investigation of the Nature and Consequence of the Relationship between IRT Difficulty and Discrimination

Peer reviewed

Direct link

Sweeney, Sandra M.; Sinharay, Sandip; Johnson, Matthew S.; Steinhauer, Eric W. – Educational Measurement: Issues and Practice, 2022

The focus of this paper is on the empirical relationship between item difficulty and item discrimination. Two studies--an empirical investigation and a simulation study--were conducted to examine the association between item difficulty and item discrimination under classical test theory and item response theory (IRT), and the effects of the…

Descriptors: Correlation, Item Response Theory, Item Analysis, Difficulty Level

The Impact of Measurement Model Misspecification on Coefficient Omega Estimates of Composite Reliability

Peer reviewed

Direct link

Stephanie M. Bell; R. Philip Chalmers; David B. Flora – Educational and Psychological Measurement, 2024

Coefficient omega indices are model-based composite reliability estimates that have become increasingly popular. A coefficient omega index estimates how reliably an observed composite score measures a target construct as represented by a factor in a factor-analysis model; as such, the accuracy of omega estimates is likely to depend on correct…

Descriptors: Influences, Models, Measurement Techniques, Reliability

« Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10 | 11 | ... | 342

Educational and Psychological…	274
Journal of Educational…	132
ProQuest LLC	125
Online Submission	72
Psychometrika	70
Applied Psychological…	60
Journal of Psychoeducational…	59
Language Testing	45
Applied Measurement in…	40
Educ Psychol Meas	34
ETS Research Report Series	33
Journal of Experimental…	33
Educational Measurement:…	31
Journal of Experimental…	31
Measurement and Evaluation in…	31
Journal of Consulting and…	29
Educational Sciences: Theory…	28
Journal of Educational and…	28
Grantee Submission	27
Psychological Assessment	27
International Journal of…	26
International Journal of…	24
Physical Review Physics…	24
Research on Social Work…	24
Multivariate Behavioral…	23
More ▼

Hambleton, Ronald K.	21
Reckase, Mark D.	20
Tindal, Gerald	15
Weiss, David J.	14
Lord, Frederic M.	13
Bart, William M.	12
Plake, Barbara S.	12
van der Linden, Wim J.	12
Alonzo, Julie	11
Dorans, Neil J.	11
Samejima, Fumiko	10
Dawis, Rene V.	9
Harnisch, Delwyn L.	9
McKinley, Robert L.	9
Rudner, Lawrence M.	9
Wainer, Howard	9
Wright, Benjamin D.	9
Angoff, William H.	8
Baker, Eva L.	8
Gierl, Mark J.	8
Matson, Johnny L.	8
Raykov, Tenko	8
Sireci, Stephen G.	8
More ▼

Reports - Research	3125
Journal Articles	3104
Reports - Evaluative	626
Speeches/Meeting Papers	429
Reports - Descriptive	298
Tests/Questionnaires	266
Dissertations/Theses -…	127
Information Analyses	85
Numerical/Quantitative Data	85
Guides - Non-Classroom	66
Opinion Papers	54
Books	16
Guides - Classroom - Teacher	15
Guides - General	13
Collected Works - General	11
Reports - General	7
Reference Materials -…	6
Non-Print Media	5
Collected Works - Serials	4
Computer Programs	4
Collected Works - Proceedings	3
Dissertations/Theses	3
Guides - Classroom - Learner	3
Historical Materials	3
Legal/Legislative/Regulatory…	3
More ▼

SAT (College Admission Test)	53
Program for International…	48
National Assessment of…	40
Trends in International…	31
Test of English as a Foreign…	30
Minnesota Multiphasic…	23
California Achievement Tests	22
Graduate Record Examinations	22
Stanford Achievement Tests	22
Iowa Tests of Basic Skills	21
Wechsler Intelligence Scale…	19
ACT Assessment	18
Peabody Picture Vocabulary…	18
Metropolitan Achievement Tests	15
Raven Progressive Matrices	14
Wechsler Adult Intelligence…	14
Armed Services Vocational…	13
Stanford Binet Intelligence…	13
Beck Depression Inventory	10
National Longitudinal Study…	9
Autism Diagnostic Observation…	8
Comprehensive Tests of Basic…	8
Eysenck Personality Inventory	7
International English…	7
Pennsylvania Educational…	7
More ▼