ERIC - Search Results

Publication Date

In 2026	0
Since 2025	0
Since 2022 (last 5 years)	2
Since 2017 (last 10 years)	20
Since 2007 (last 20 years)	44

Descriptor

Statistical Analysis	71
Test Length	71
Item Response Theory	30
Test Items	29
Sample Size	28
Comparative Analysis	16
Test Reliability	16
Correlation	15
Error of Measurement	13
Scores	13
Computation	12
Simulation	12
Models	11
Goodness of Fit	9
Test Bias	9
Computer Assisted Testing	8
Difficulty Level	8
Foreign Countries	8
Mathematical Models	8
Adaptive Testing	7
Classification	7
Equated Scores	7
Item Analysis	7
Accuracy	6
Sampling	6
More ▼

Publication Type

Reports - Research	55
Journal Articles	47
Reports - Evaluative	9
Speeches/Meeting Papers	5
Dissertations/Theses -…	3
Tests/Questionnaires	2
Information Analyses	1
Numerical/Quantitative Data	1

Education Level

Higher Education	5
Postsecondary Education	4
Secondary Education	3
Elementary Education	1
Elementary Secondary Education	1
Grade 3	1
High Schools	1

Audience

Researchers

Location

Canada	2
Netherlands	2
Turkey	2
Colombia	1
Indonesia	1
Jordan	1
Michigan	1
Peru	1
Qatar	1
Taiwan	1

Laws, Policies, & Programs

Assessments and Surveys

SAT (College Admission Test)	2
California Psychological…	1
Program for International…	1
Stanford Binet Intelligence…	1
Test of English as a Foreign…	1
Wechsler Adult Intelligence…	1
Wechsler Individual…	1

What Works Clearinghouse Rating

Statistical Analysis X

Showing 16 to 30 of 71 results Save | Export

Mixture IRT Model with a Higher-Order Structure for Latent Traits

Peer reviewed

Direct link

Huang, Hung-Yu – Educational and Psychological Measurement, 2017

Mixture item response theory (IRT) models have been suggested as an efficient method of detecting the different response patterns derived from latent classes when developing a test. In testing situations, multiple latent traits measured by a battery of tests can exhibit a higher-order structure, and mixtures of latent classes may occur on…

Descriptors: Item Response Theory, Models, Bayesian Statistics, Computation

Dimensionality in Compensatory MIRT When Complex Structure Exists: Evaluation of DETECT and NOHARM

Peer reviewed

Direct link

Svetina, Dubravka; Levy, Roy – Journal of Experimental Education, 2016

This study investigated the effect of complex structure on dimensionality assessment in compensatory multidimensional item response models using DETECT- and NOHARM-based methods. The performance was evaluated via the accuracy of identifying the correct number of dimensions and the ability to accurately recover item groupings using a simple…

Descriptors: Item Response Theory, Accuracy, Correlation, Sample Size

Evaluating the Impact of Guessing and Its Interactions with Other Test Characteristics on Confidence Interval Procedures for Coefficient Alpha

Peer reviewed

Direct link

Paek, Insu – Educational and Psychological Measurement, 2016

The effect of guessing on the point estimate of coefficient alpha has been studied in the literature, but the impact of guessing and its interactions with other test characteristics on the interval estimators for coefficient alpha has not been fully investigated. This study examined the impact of guessing and its interactions with other test…

Descriptors: Guessing (Tests), Computation, Statistical Analysis, Test Length

The Matching Criterion Purification for Differential Item Functioning Analyses in a Large-Scale Assessment

Peer reviewed

Direct link

Lee, HyeSun; Geisinger, Kurt F. – Educational and Psychological Measurement, 2016

The current study investigated the impact of matching criterion purification on the accuracy of differential item functioning (DIF) detection in large-scale assessments. The three matching approaches for DIF analyses (block-level matching, pooled booklet matching, and equated pooled booklet matching) were employed with the Mantel-Haenszel…

Descriptors: Test Bias, Measurement, Accuracy, Statistical Analysis

A Monte Carlo Study of an Iterative Wald Test Procedure for DIF Analysis

Peer reviewed

Direct link

Cao, Mengyang; Tay, Louis; Liu, Yaowu – Educational and Psychological Measurement, 2017

This study examined the performance of a proposed iterative Wald approach for detecting differential item functioning (DIF) between two groups when preknowledge of anchor items is absent. The iterative approach utilizes the Wald-2 approach to identify anchor items and then iteratively tests for DIF items with the Wald-1 approach. Monte Carlo…

Descriptors: Monte Carlo Methods, Test Items, Test Bias, Error of Measurement

Item Response Theory with Covariates (IRT-C): Assessing Item Recovery and Differential Item Functioning for the Three-Parameter Logistic Model

Peer reviewed

Direct link

Tay, Louis; Huang, Qiming; Vermunt, Jeroen K. – Educational and Psychological Measurement, 2016

In large-scale testing, the use of multigroup approaches is limited for assessing differential item functioning (DIF) across multiple variables as DIF is examined for each variable separately. In contrast, the item response theory with covariate (IRT-C) procedure can be used to examine DIF across multiple variables (covariates) simultaneously. To…

Descriptors: Item Response Theory, Test Bias, Simulation, College Entrance Examinations

An Information-Correction Method for Testlet-Based Test Analysis: From the Perspectives of Item Response Theory and Generalizability Theory. Research Report. ETS RR-17-27

Peer reviewed
PDF on ERIC

Download full text

Li, Feifei – ETS Research Report Series, 2017

An information-correction method for testlet-based tests is introduced. This method takes advantage of both generalizability theory (GT) and item response theory (IRT). The measurement error for the examinee proficiency parameter is often underestimated when a unidimensional conditional-independence IRT model is specified for a testlet dataset. By…

Descriptors: Item Response Theory, Generalizability Theory, Tests, Error of Measurement

Variability in Percentage above Cut Scores Due to Discreteness in Score Scale. Research Report. ETS RR-17-32

Peer reviewed
PDF on ERIC

Download full text

Lu, Ying – ETS Research Report Series, 2017

For standard- or criterion-based assessments, the use of cut scores to indicate mastery, nonmastery, or different levels of skill mastery is very common. As part of performance summary, it is of interest to examine the percentage of examinees at or above the cut scores (PAC) and how PAC evolves across administrations. This paper shows that…

Descriptors: Cutting Scores, Evaluation Methods, Mastery Learning, Performance Based Assessment

Large-Corpus Phoneme and Word Recognition and the Generality of Lexical Context in CVC Word Perception

Peer reviewed

Direct link

Gelfand, Jessica T.; Christie, Robert E.; Gelfand, Stanley A. – Journal of Speech, Language, and Hearing Research, 2014

Purpose: Speech recognition may be analyzed in terms of recognition probabilities for perceptual wholes (e.g., words) and parts (e.g., phonemes), where j or the j-factor reveals the number of independent perceptual units required for recognition of the whole (Boothroyd, 1968b; Boothroyd & Nittrouer, 1988; Nittrouer & Boothroyd, 1990). For…

Descriptors: Phonemes, Word Recognition, Vowels, Syllables

Identifying Sets of Maximally Efficient Items from the Academic Competence Evaluation Scales-Teacher Form

Peer reviewed

Direct link

Anthony, Christopher James; DiPerna, James Clyde – School Psychology Quarterly, 2017

The Academic Competence Evaluation Scales-Teacher Form (ACES-TF; DiPerna & Elliott, 2000) was developed to measure student academic skills and enablers (interpersonal skills, engagement, motivation, and study skills). Although ACES-TF scores have demonstrated psychometric adequacy, the length of the measure may be prohibitive for certain…

Descriptors: Test Items, Efficiency, Item Response Theory, Test Length

The Effects of Extended Time on Writing Performance

Peer reviewed
PDF on ERIC

Download full text

Goegan, Lauren D.; Harrison, Gina L. – Learning Disabilities: A Contemporary Journal, 2017

The effects of extended time on the writing performance of university students with learning disabilities (LD) was examined. Thirty-eight students (19 LD; 19 non-LD) completed a collection of cognitive, linguistic, and literacy measures, and wrote essays under regular and extended time conditions. Limited evidence was found to support the…

Descriptors: Foreign Countries, Undergraduate Students, Testing Accommodations, Learning Disabilities

Comparing Performances (Type I Error and Power) of IRT Likelihood Ratio SIBTEST and Mantel-Haenszel Methods in the Determination of Differential Item Functioning

Peer reviewed
PDF on ERIC

Download full text

Atalay Kabasakal, Kübra; Arsan, Nihan; Gök, Bilge; Kelecioglu, Hülya – Educational Sciences: Theory and Practice, 2014

This simulation study compared the performances (Type I error and power) of Mantel-Haenszel (MH), SIBTEST, and item response theory-likelihood ratio (IRT-LR) methods under certain conditions. Manipulated factors were sample size, ability differences between groups, test length, the percentage of differential item functioning (DIF), and underlying…

Descriptors: Comparative Analysis, Item Response Theory, Statistical Analysis, Test Bias

Assessing Dimensionality of Noncompensatory Multidimensional Item Response Theory with Complex Structures

Peer reviewed

Direct link

Svetina, Dubravka – Educational and Psychological Measurement, 2013

The purpose of this study was to investigate the effect of complex structure on dimensionality assessment in noncompensatory multidimensional item response models using dimensionality assessment procedures based on DETECT (dimensionality evaluation to enumerate contributing traits) and NOHARM (normal ogive harmonic analysis robust method). Five…

Descriptors: Item Response Theory, Statistical Analysis, Computation, Test Length

An Investigation of Sample Size Splitting on ATFIND and DIMTEST

Peer reviewed

Direct link

Socha, Alan; DeMars, Christine E. – Educational and Psychological Measurement, 2013

Modeling multidimensional test data with a unidimensional model can result in serious statistical errors, such as bias in item parameter estimates. Many methods exist for assessing the dimensionality of a test. The current study focused on DIMTEST. Using simulated data, the effects of sample size splitting for use with the ATFIND procedure for…

Descriptors: Sample Size, Test Length, Correlation, Test Format

Item Purification in Differential Item Functioning Using Generalized Linear Mixed Models

Direct link

Liu, Qian – ProQuest LLC, 2011

For this dissertation, four item purification procedures were implemented onto the generalized linear mixed model for differential item functioning (DIF) analysis, and the performance of these item purification procedures was investigated through a series of simulations. Among the four procedures, forward and generalized linear mixed model (GLMM)…

Descriptors: Test Bias, Test Items, Statistical Analysis, Models

« Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4 | 5

Educational and Psychological…	17
ETS Research Report Series	6
Applied Measurement in…	3
Applied Psychological…	3
Journal of Educational…	3
ProQuest LLC	3
Educational Sciences: Theory…	2
Measurement:…	2
ACT, Inc.	1
College Entrance Examination…	1
College Student Journal	1
Eurasian Journal of…	1
European Journal of Science…	1
International Journal of…	1
International Journal of…	1
Journal of Educational…	1
Journal of Experimental…	1
Journal of Learning in Higher…	1
Journal of Speech, Language,…	1
Learning Disabilities: A…	1
Perceptual and Motor Skills	1
Psychometrika	1
School Psychology Quarterly	1
Toegepaste taalwetenschap in…	1
More ▼

Bulut, Okan	2
Cohen, Allan S.	2
Huggins-Manley, Anne Corinne	2
Paek, Insu	2
Svetina, Dubravka	2
Tay, Louis	2
Wang, Wen-Chung	2
Weiss, David J.	2
Yormaz, Seha	2
de Jong, John H. A. L.	2
Abad, Francisco J.	1
Allspach, Jill R.	1
Anthony, Christopher James	1
Arsan, Nihan	1
Atalay Kabasakal, Kübra	1
Baba, Kyoko	1
Bauer, Ernest A.	1
Brown, Joel M.	1
Budescu, David	1
Burton, Nancy	1
Cao, Mengyang	1
Chen, Cheng-Te	1
Chen, Troy T.	1
Christie, Robert E.	1
More ▼