ERIC - Search Results

Publication Date

In 2026	0
Since 2025	3
Since 2022 (last 5 years)	12
Since 2017 (last 10 years)	24
Since 2007 (last 20 years)	61

Descriptor

Comparative Analysis	99
Test Length	99
Item Response Theory	42
Test Items	40
Sample Size	31
Computer Assisted Testing	28
Simulation	27
Adaptive Testing	20
Test Format	20
Error of Measurement	17
Scores	17
Statistical Analysis	16
Test Reliability	16
Item Analysis	14
Models	14
Correlation	13
Monte Carlo Methods	13
Test Validity	13
Accuracy	12
Difficulty Level	12
Higher Education	12
Computation	11
Mathematical Models	11
Maximum Likelihood Statistics	11
Classification	10
More ▼

Publication Type

Reports - Research	66
Journal Articles	56
Speeches/Meeting Papers	20
Reports - Evaluative	19
Dissertations/Theses -…	12
Numerical/Quantitative Data	2
Tests/Questionnaires	2
Information Analyses	1
Reports - Descriptive	1

Education Level

Higher Education	8
Postsecondary Education	6
Elementary Secondary Education	3
Secondary Education	3
Elementary Education	2
High Schools	2
Grade 6	1
Grade 7	1
Intermediate Grades	1
Middle Schools	1

Audience

Researchers

Location

Turkey	4
Asia	1
Canada	1
China	1
Michigan	1
Netherlands	1
Singapore	1
United Kingdom	1

Laws, Policies, & Programs

Assessments and Surveys

ACT Assessment	3
Wechsler Adult Intelligence…	3
Kaufman Brief Intelligence…	2
Minnesota Multiphasic…	2
ACTFL Oral Proficiency…	1
Advanced Placement…	1
Center for Epidemiologic…	1
Law School Admission Test	1
Marlowe Crowne Social…	1
NEO Five Factor Inventory	1
Program for International…	1
SAT (College Admission Test)	1
School and College Ability…	1
Sensation Seeking Scale	1
Trends in International…	1
Wechsler Individual…	1
Wechsler Intelligence Scale…	1
More ▼

What Works Clearinghouse Rating

Comparative Analysis X

Showing 46 to 60 of 99 results Save | Export

Conditions Affecting the Accuracy of Classical Equating Methods for Small Samples under the NEAT Design: A Simulation Study

Direct link

Sunnassee, Devdass – ProQuest LLC, 2011

Small sample equating remains a largely unexplored area of research. This study attempts to fill in some of the research gaps via a large-scale, IRT-based simulation study that evaluates the performance of seven small-sample equating methods under various test characteristic and sampling conditions. The equating methods considered are typically…

Descriptors: Test Length, Test Format, Sample Size, Simulation

Comparative Validity of Brief to Medium-Length Big Five and Big Six Personality Questionnaires

Peer reviewed

Direct link

Thalmayer, Amber Gayle; Saucier, Gerard; Eigenhuis, Annemarie – Psychological Assessment, 2011

A general consensus on the Big Five model of personality attributes has been highly generative for the field of personality psychology. Many important psychological and life outcome correlates with Big Five trait dimensions have been established. But researchers must choose between multiple Big Five inventories when conducting a study and are…

Descriptors: Test Validity, Personality Measures, Test Length, Undergraduate Students

Differential Item Functioning: Its Consequences. Research Report. ETS RR-10-01

Peer reviewed
PDF on ERIC

Download full text

Lee, Yi-Hsuan; Zhang, Jinming – ETS Research Report Series, 2010

This report examines the consequences of differential item functioning (DIF) using simulated data. Its impact on total score, item response theory (IRT) ability estimate, and test reliability was evaluated in various testing scenarios created by manipulating the following four factors: test length, percentage of DIF items per form, sample sizes of…

Descriptors: Test Bias, Item Response Theory, Test Items, Scores

Evaluating IRT- and CTT-Based Methods of Estimating Classification Consistency and Accuracy Indices from Single Administrations

Direct link

Deng, Nina – ProQuest LLC, 2011

Three decision consistency and accuracy (DC/DA) methods, the Livingston and Lewis (LL) method, LEE method, and the Hambleton and Han (HH) method, were evaluated. The purposes of the study were: (1) to evaluate the accuracy and robustness of these methods, especially when their assumptions were not well satisfied, (2) to investigate the "true"…

Descriptors: Item Response Theory, Test Theory, Computation, Classification

Comparability of Examinee Proficiency Scores on Computer Adaptive Tests Using Real and Simulated Data

Direct link

Evans, Josiah Jeremiah – ProQuest LLC, 2010

In measurement research, data simulations are a commonly used analytical technique. While simulation designs have many benefits, it is unclear if these artificially generated datasets are able to accurately capture real examinee item response behaviors. This potential lack of comparability may have important implications for administration of…

Descriptors: Computer Assisted Testing, Adaptive Testing, Educational Testing, Admission (School)

Conspiracies and Test Compromise: An Evaluation of the Resistance of Test Systems to Small-Scale Cheating

Peer reviewed

Direct link

Guo, Jing; Tay, Louis; Drasgow, Fritz – International Journal of Testing, 2009

Test compromise is a concern in cognitive ability testing because such tests are widely used in employee selection and administered on a continuous basis. In this study, the resistance of cognitive tests, deployed in different test systems, to small-scale cheating conspiracies, was evaluated regarding the accuracy of ability estimation.…

Descriptors: Cheating, Cognitive Tests, Adaptive Testing, Computer Assisted Testing

A Comparison of Computer-Based Classification Testing Approaches Using Mixed-Format Tests with the Generalized Partial Credit Model

Direct link

Kim, Jiseon – ProQuest LLC, 2010

Classification testing has been widely used to make categorical decisions by determining whether an examinee has a certain degree of ability required by established standards. As computer technologies have developed, classification testing has become more computerized. Several approaches have been proposed and investigated in the context of…

Descriptors: Test Length, Computer Assisted Testing, Classification, Probability

Application of the Bifactor Model to Computerized Adaptive Testing

Direct link

Seo, Dong Gi – ProQuest LLC, 2011

Most computerized adaptive tests (CAT) have been studied under the framework of unidimensional item response theory. However, many psychological variables are multidimensional and might benefit from using a multidimensional approach to CAT. In addition, a number of psychological variables (e.g., quality of life, depression) can be conceptualized…

Descriptors: Test Length, Quality of Life, Item Analysis, Geometric Concepts

Modification of the Mantel-Haenszel and Logistic Regression DIF Procedures to Incorporate the SIBTEST Regression Correction

Peer reviewed

Direct link

DeMars, Christine E. – Journal of Educational and Behavioral Statistics, 2009

The Mantel-Haenszel (MH) and logistic regression (LR) differential item functioning (DIF) procedures have inflated Type I error rates when there are large mean group differences, short tests, and large sample sizes.When there are large group differences in mean score, groups matched on the observed number-correct score differ on true score,…

Descriptors: Regression (Statistics), Test Bias, Error of Measurement, True Scores

Comparison of Multiple-Indicators, Multiple-Causes--and Item Response Theory-Based Analyses of Subgroup Differences

Peer reviewed

Direct link

Willse, John T.; Goodman, Joshua T. – Educational and Psychological Measurement, 2008

This research provides a direct comparison of effect size estimates based on structural equation modeling (SEM), item response theory (IRT), and raw scores. Differences between the SEM, IRT, and raw score approaches are examined under a variety of data conditions (IRT models underlying the data, test lengths, magnitude of group differences, and…

Descriptors: Test Length, Structural Equation Models, Effect Size, Raw Scores

Comparison of Parametric and Nonparametric Bootstrap Methods for Estimating Random Error in Equipercentile Equating

Peer reviewed

Direct link

Cui, Zhongmin; Kolen, Michael J. – Applied Psychological Measurement, 2008

This article considers two methods of estimating standard errors of equipercentile equating: the parametric bootstrap method and the nonparametric bootstrap method. Using a simulation study, these two methods are compared under three sample sizes (300, 1,000, and 3,000), for two test content areas (the Iowa Tests of Basic Skills Maps and Diagrams…

Descriptors: Test Length, Test Content, Simulation, Computation

Comparing Different Approaches of Bias Correction for Ability Estimation in IRT Models. Research Report. ETS RR-08-13

Peer reviewed
PDF on ERIC

Download full text

Lee, Yi-Hsuan; Zhang, Jinming – ETS Research Report Series, 2008

The method of maximum-likelihood is typically applied to item response theory (IRT) models when the ability parameter is estimated while conditioning on the true item parameters. In practice, the item parameters are unknown and need to be estimated first from a calibration sample. Lewis (1985) and Zhang and Lu (2007) proposed the expected response…

Descriptors: Item Response Theory, Comparative Analysis, Computation, Ability

Comparing the Similarities and Differences of PISA 2003 and TIMSS. OECD Education Working Papers, No. 32

Direct link

Wu, Margaret – OECD Publishing (NJ1), 2010

This paper makes an in-depth comparison of the PISA (OECD) and TIMSS (IEA) mathematics assessments conducted in 2003. First, a comparison of survey methodologies is presented, followed by an examination of the mathematics frameworks in the two studies. The methodologies and the frameworks in the two studies form the basis for providing…

Descriptors: Mathematics Achievement, Foreign Countries, Gender Differences, Comparative Analysis

The Impact of Anchor Test Length on Equating Results in a Nonequivalent Groups Design. Research Report. ETS RR-07-44

Peer reviewed
PDF on ERIC

Download full text

Ricker, Kathryn L.; von Davier, Alina A. – ETS Research Report Series, 2007

This study explored the effects of external anchor test length on final equating results of several equating methods, including equipercentile (frequency estimation), chained equipercentile, kernel equating (KE) poststratification PSE with optimal bandwidths, and KE PSE linear (large bandwidths) when using the nonequivalent groups anchor test…

Descriptors: Equated Scores, Test Items, Statistical Analysis, Test Length

Ramsay Curve IRT for Likert-Type Data

Peer reviewed

Direct link

Woods, Carol M. – Applied Psychological Measurement, 2007

Ramsay curve item response theory (RC-IRT) was recently developed to detect and correct for nonnormal latent variables when unidimensional IRT models are fitted to data using maximum marginal likelihood estimation. The purpose of this research is to evaluate the performance of RC-IRT for Likert-type item responses with varying test lengths, sample…

Descriptors: Test Length, Item Response Theory, Sample Size, Comparative Analysis

« Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4 | 5 | 6 | 7

ProQuest LLC	12
Educational and Psychological…	9
Applied Psychological…	8
ETS Research Report Series	5
Psychological Assessment	4
ACT Education Corp.	3
Applied Measurement in…	3
Educational Sciences: Theory…	2
International Journal of…	2
Journal of Educational…	2
Psychometrika	2
Asia Pacific Education Review	1
College Entrance Examination…	1
Education and Information…	1
Educational Research and…	1
European Journal of Special…	1
Grantee Submission	1
International Journal of…	1
Journal of Educational…	1
Journal of Educational and…	1
Journal of Psychoeducational…	1
Language Testing	1
Learning Disabilities: A…	1
Measurement in Physical…	1
Measurement:…	1
More ▼

Hambleton, Ronald K.	3
Dogan, Nuri	2
Drasgow, Fritz	2
Eggen, Theo J. H. M.	2
Frick, Theodore W.	2
Gessaroli, Marc E.	2
Kelecioglu, Hülya	2
Kim, Seock-Ho	2
Lee, Yi-Hsuan	2
Paek, Insu	2
Reckase, Mark D.	2
Schumacker, Randall E.	2
Weiss, David J.	2
Zhang, Jinming	2
Allan S. Cohen	1
Allen, Nancy L.	1
Allspach, Jill R.	1
Ann Arthur	1
Arsan, Nihan	1
Atalay Kabasakal, Kübra	1
Bazaldua, Diego A. Luna	1
Bejar, Isaac I.	1
Benton, Tom	1
Bergstrom, Betty A.	1
More ▼