ERIC - Search Results

Publication Date

In 2026	0
Since 2025	3
Since 2022 (last 5 years)	12
Since 2017 (last 10 years)	24
Since 2007 (last 20 years)	61

Descriptor

Comparative Analysis	99
Test Length	99
Item Response Theory	42
Test Items	40
Sample Size	31
Computer Assisted Testing	28
Simulation	27
Adaptive Testing	20
Test Format	20
Error of Measurement	17
Scores	17
Statistical Analysis	16
Test Reliability	16
Item Analysis	14
Models	14
Correlation	13
Monte Carlo Methods	13
Test Validity	13
Accuracy	12
Difficulty Level	12
Higher Education	12
Computation	11
Mathematical Models	11
Maximum Likelihood Statistics	11
Classification	10
More ▼

Publication Type

Reports - Research	66
Journal Articles	56
Speeches/Meeting Papers	20
Reports - Evaluative	19
Dissertations/Theses -…	12
Numerical/Quantitative Data	2
Tests/Questionnaires	2
Information Analyses	1
Reports - Descriptive	1

Education Level

Higher Education	8
Postsecondary Education	6
Elementary Secondary Education	3
Secondary Education	3
Elementary Education	2
High Schools	2
Grade 6	1
Grade 7	1
Intermediate Grades	1
Middle Schools	1

Audience

Researchers

Location

Turkey	4
Asia	1
Canada	1
China	1
Michigan	1
Netherlands	1
Singapore	1
United Kingdom	1

Laws, Policies, & Programs

Assessments and Surveys

ACT Assessment	3
Wechsler Adult Intelligence…	3
Kaufman Brief Intelligence…	2
Minnesota Multiphasic…	2
ACTFL Oral Proficiency…	1
Advanced Placement…	1
Center for Epidemiologic…	1
Law School Admission Test	1
Marlowe Crowne Social…	1
NEO Five Factor Inventory	1
Program for International…	1
SAT (College Admission Test)	1
School and College Ability…	1
Sensation Seeking Scale	1
Trends in International…	1
Wechsler Individual…	1
Wechsler Intelligence Scale…	1
More ▼

What Works Clearinghouse Rating

Comparative Analysis X

Showing 16 to 30 of 99 results Save | Export

A Comparison of Score Aggregation Methods for Unidimensional Tests on Different Dimensions. Research Report. ETS RR-18-01

Peer reviewed
PDF on ERIC

Download full text

Fu, Jianbin; Feng, Yuling – ETS Research Report Series, 2018

In this study, we propose aggregating test scores with unidimensional within-test structure and multidimensional across-test structure based on a 2-level, 1-factor model. In particular, we compare 6 score aggregation methods: average of standardized test raw scores (M1), regression factor score estimate of the 1-factor model based on the…

Descriptors: Comparative Analysis, Scores, Correlation, Standardized Tests

The Impact of Q-Matrix Designs on Diagnostic Classification Accuracy in the Presence of Attribute Hierarchies

Peer reviewed

Direct link

Liu, Ren; Huggins-Manley, Anne Corinne; Bradshaw, Laine – Educational and Psychological Measurement, 2017

There is an increasing demand for assessments that can provide more fine-grained information about examinees. In response to the demand, diagnostic measurement provides students with feedback on their strengths and weaknesses on specific skills by classifying them into mastery or nonmastery attribute categories. These attributes often form a…

Descriptors: Matrices, Classification, Accuracy, Diagnostic Tests

Assessing the Performance of Classical Test Theory Item Discrimination Estimators in Monte Carlo Simulations

Peer reviewed

Direct link

Bazaldua, Diego A. Luna; Lee, Young-Sun; Keller, Bryan; Fellers, Lauren – Asia Pacific Education Review, 2017

The performance of various classical test theory (CTT) item discrimination estimators has been compared in the literature using both empirical and simulated data, resulting in mixed results regarding the preference of some discrimination estimators over others. This study analyzes the performance of various item discrimination estimators in CTT:…

Descriptors: Test Items, Monte Carlo Methods, Item Response Theory, Correlation

Comparative Analyses of MIRT Models and Software (BMIRT and flexMIRT)

Peer reviewed

Direct link

Yavuz, Guler; Hambleton, Ronald K. – Educational and Psychological Measurement, 2017

Application of MIRT modeling procedures is dependent on the quality of parameter estimates provided by the estimation software and techniques used. This study investigated model parameter recovery of two popular MIRT packages, BMIRT and flexMIRT, under some common measurement conditions. These packages were specifically selected to investigate the…

Descriptors: Item Response Theory, Models, Comparative Analysis, Computer Software

Are the Nonparametric Person-Fit Statistics More Powerful than Their Parametric Counterparts? Revisiting the Simulations in Karabatsos (2003)

Peer reviewed

Direct link

Sinharay, Sandip – Applied Measurement in Education, 2017

Karabatsos compared the power of 36 person-fit statistics using receiver operating characteristics curves and found the "H[superscript T]" statistic to be the most powerful in identifying aberrant examinees. He found three statistics, "C", "MCI", and "U3", to be the next most powerful. These four statistics,…

Descriptors: Nonparametric Statistics, Goodness of Fit, Simulation, Comparative Analysis

A Fair Comparison of the Performance of Computerized Adaptive Testing and Multistage Adaptive Testing

Direct link

Wang, Keyin – ProQuest LLC, 2017

The comparison of item-level computerized adaptive testing (CAT) and multistage adaptive testing (MST) has been researched extensively (e.g., Kim & Plake, 1993; Luecht et al., 1996; Patsula, 1999; Jodoin, 2003; Hambleton & Xing, 2006; Keng, 2008; Zheng, 2012). Various CAT and MST designs have been investigated and compared under the same…

Descriptors: Comparative Analysis, Computer Assisted Testing, Adaptive Testing, Test Items

ACTFL Oral Proficiency Interview -- Computer (OPIc)

Peer reviewed

Direct link

Isbell, Dan; Winke, Paula – Language Testing, 2019

The American Council on the Teaching of Foreign Languages (ACTFL) oral proficiency interview -- computer (OPIc) testing system represents an ambitious effort in language assessment: Assessing oral proficiency in over a dozen languages, on the same scale, from virtually anywhere at any time. Especially for users in contexts where multiple foreign…

Descriptors: Oral Language, Language Tests, Language Proficiency, Second Language Learning

Comparison of Two Test Methods for VIS: Paper-Pencil Test and CAT

Peer reviewed

Direct link

Senel, Selma; Kutlu, Ömer – European Journal of Special Needs Education, 2018

This paper examines listening comprehension skills of visually impaired students (VIS) using computerised adaptive testing (CAT) and reader-assisted paper-pencil testing (raPPT) and student views about them. Explanatory mixed method design was used in this study. Sample is comprised of 51 VIS, in 7th and 8th grades. 9 of these students were…

Descriptors: Computer Assisted Testing, Adaptive Testing, Visual Impairments, Student Attitudes

On the Issue of Item Selection in Computerized Adaptive Testing with Response Times

Peer reviewed

Direct link

Veldkamp, Bernard P. – Journal of Educational Measurement, 2016

Many standardized tests are now administered via computer rather than paper-and-pencil format. The computer-based delivery mode brings with it certain advantages. One advantage is the ability to adapt the difficulty level of the test to the ability level of the test taker in what has been termed computerized adaptive testing (CAT). A second…

Descriptors: Computer Assisted Testing, Reaction Time, Standardized Tests, Difficulty Level

Effect of Differential Item Functioning on Test Equating

Peer reviewed
PDF on ERIC

Download full text

Kabasakal, Kübra Atalay; Kelecioglu, Hülya – Educational Sciences: Theory and Practice, 2015

This study examines the effect of differential item functioning (DIF) items on test equating through multilevel item response models (MIRMs) and traditional IRMs. The performances of three different equating models were investigated under 24 different simulation conditions, and the variables whose effects were examined included sample size, test…

Descriptors: Test Bias, Equated Scores, Item Response Theory, Simulation

In Search of the Optimal Number of Response Categories in a Rating Scale

Peer reviewed

Direct link

Lee, Jihyun; Paek, Insu – Journal of Psychoeducational Assessment, 2014

Likert-type rating scales are still the most widely used method when measuring psychoeducational constructs. The present study investigates a long-standing issue of identifying the optimal number of response categories. A special emphasis is given to categorical data, which were generated by the Item Response Theory (IRT) Graded-Response Modeling…

Descriptors: Likert Scales, Responses, Item Response Theory, Classification

The Effects of Extended Time on Writing Performance

Peer reviewed
PDF on ERIC

Download full text

Goegan, Lauren D.; Harrison, Gina L. – Learning Disabilities: A Contemporary Journal, 2017

The effects of extended time on the writing performance of university students with learning disabilities (LD) was examined. Thirty-eight students (19 LD; 19 non-LD) completed a collection of cognitive, linguistic, and literacy measures, and wrote essays under regular and extended time conditions. Limited evidence was found to support the…

Descriptors: Foreign Countries, Undergraduate Students, Testing Accommodations, Learning Disabilities

Comparing Three Estimation Methods for the Three-Parameter Logistic IRT Model

Direct link

Lamsal, Sunil – ProQuest LLC, 2015

Different estimation procedures have been developed for the unidimensional three-parameter item response theory (IRT) model. These techniques include the marginal maximum likelihood estimation, the fully Bayesian estimation using Markov chain Monte Carlo simulation techniques, and the Metropolis-Hastings Robbin-Monro estimation. With each…

Descriptors: Item Response Theory, Monte Carlo Methods, Maximum Likelihood Statistics, Markov Processes

Accuracy and Variability of Item Parameter Estimates from Marginal Maximum a Posteriori Estimation and Bayesian Inference via Gibbs Samplers

Direct link

Wu, Yi-Fang – ProQuest LLC, 2015

Item response theory (IRT) uses a family of statistical models for estimating stable characteristics of items and examinees and defining how these characteristics interact in describing item and test performance. With a focus on the three-parameter logistic IRT (Birnbaum, 1968; Lord, 1980) model, the current study examines the accuracy and…

Descriptors: Item Response Theory, Test Items, Accuracy, Computation

Comparing Performances (Type I Error and Power) of IRT Likelihood Ratio SIBTEST and Mantel-Haenszel Methods in the Determination of Differential Item Functioning

Peer reviewed
PDF on ERIC

Download full text

Atalay Kabasakal, Kübra; Arsan, Nihan; Gök, Bilge; Kelecioglu, Hülya – Educational Sciences: Theory and Practice, 2014

This simulation study compared the performances (Type I error and power) of Mantel-Haenszel (MH), SIBTEST, and item response theory-likelihood ratio (IRT-LR) methods under certain conditions. Manipulated factors were sample size, ability differences between groups, test length, the percentage of differential item functioning (DIF), and underlying…

Descriptors: Comparative Analysis, Item Response Theory, Statistical Analysis, Test Bias

« Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4 | 5 | 6 | 7

ProQuest LLC	12
Educational and Psychological…	9
Applied Psychological…	8
ETS Research Report Series	5
Psychological Assessment	4
ACT Education Corp.	3
Applied Measurement in…	3
Educational Sciences: Theory…	2
International Journal of…	2
Journal of Educational…	2
Psychometrika	2
Asia Pacific Education Review	1
College Entrance Examination…	1
Education and Information…	1
Educational Research and…	1
European Journal of Special…	1
Grantee Submission	1
International Journal of…	1
Journal of Educational…	1
Journal of Educational and…	1
Journal of Psychoeducational…	1
Language Testing	1
Learning Disabilities: A…	1
Measurement in Physical…	1
Measurement:…	1
More ▼

Hambleton, Ronald K.	3
Dogan, Nuri	2
Drasgow, Fritz	2
Eggen, Theo J. H. M.	2
Frick, Theodore W.	2
Gessaroli, Marc E.	2
Kelecioglu, Hülya	2
Kim, Seock-Ho	2
Lee, Yi-Hsuan	2
Paek, Insu	2
Reckase, Mark D.	2
Schumacker, Randall E.	2
Weiss, David J.	2
Zhang, Jinming	2
Allan S. Cohen	1
Allen, Nancy L.	1
Allspach, Jill R.	1
Ann Arthur	1
Arsan, Nihan	1
Atalay Kabasakal, Kübra	1
Bazaldua, Diego A. Luna	1
Bejar, Isaac I.	1
Benton, Tom	1
Bergstrom, Betty A.	1
More ▼