ERIC - Search Results

Publication Date

In 2025	0
Since 2024	1
Since 2021 (last 5 years)	2
Since 2016 (last 10 years)	2
Since 2006 (last 20 years)	3

Descriptor

Comparative Analysis	46
Mathematical Models	46
Test Items	46
Statistical Analysis	16
Item Analysis	15
Item Response Theory	15
Test Construction	14
Difficulty Level	10
Equations (Mathematics)	10
Estimation (Mathematics)	10
Factor Analysis	10
Goodness of Fit	9
Latent Trait Theory	9
Simulation	9
Test Reliability	8
Achievement Tests	7
Computer Simulation	7
Item Bias	7
Sample Size	7
Adaptive Testing	6
Error of Measurement	6
Monte Carlo Methods	6
High Schools	5
Maximum Likelihood Statistics	5
Chi Square	4
More ▼

Source

Applied Psychological…	3
Educational and Psychological…	2
Journal of Educational…	2
Applied Measurement in…	1
Educational Measurement:…	1
Journal of Educational and…	1
Measurement:…	1
ProQuest LLC	1
Psychometrika	1

Publication Type

Reports - Research	26
Speeches/Meeting Papers	19
Reports - Evaluative	16
Journal Articles	12
Guides - Non-Classroom	2
Reports - Descriptive	2
Reports - General	2
Books	1
Dissertations/Theses -…	1

Education Level

Higher Education

Audience

Researchers

Location

Ohio	1
South Carolina	1
United States	1

Laws, Policies, & Programs

Assessments and Surveys

SAT (College Admission Test)	3
Iowa Tests of Educational…	2
Comprehensive Tests of Basic…	1
School and College Ability…	1

What Works Clearinghouse Rating

Showing 1 to 15 of 46 results Save | Export

A Comparison of Latent Semantic Analysis and Latent Dirichlet Allocation in Educational Measurement

Peer reviewed

Direct link

Jordan M. Wheeler; Allan S. Cohen; Shiyu Wang – Journal of Educational and Behavioral Statistics, 2024

Topic models are mathematical and statistical models used to analyze textual data. The objective of topic models is to gain information about the latent semantic space of a set of related textual data. The semantic space of a set of textual data contains the relationship between documents and words and how they are used. Topic models are becoming…

Descriptors: Semantics, Educational Assessment, Evaluators, Reliability

Comparison of R Packages for Automated Test Assembly with Mixed-Integer Linear Programming

Peer reviewed

Direct link

Peabody, Michael R. – Measurement: Interdisciplinary Research and Perspectives, 2023

Many organizations utilize some form of automation in the test assembly process; either fully algorithmic or heuristically constructed. However, one issue with heuristic models is that when the test assembly problem changes the entire model may need to be re-conceptualized and recoded. In contrast, mixed-integer programming (MIP) is a mathematical…

Descriptors: Programming Languages, Algorithms, Heuristics, Mathematical Models

Advanced Quantitative Measurement Methodology in Physics Education Research

Direct link

Wang, Jing – ProQuest LLC, 2009

The ultimate goal of physics education research (PER) is to develop a theoretical framework to understand and improve the learning process. In this journey of discovery, assessment serves as our headlamp and alpenstock. It sometimes detects signals in student mental structures, and sometimes presents the difference between expert understanding and…

Descriptors: Test Items, Mathematical Models, Educational Testing, Physics

Comparison of Two Logistic Multidimensional Item Response Theory Models. Research Report ONR90-8.

Download full text

Spray, Judith A.; And Others – 1990

Test data generated according to two different multidimensional item response theory (IRT) models were compared at both the item response level and the test score level to determine whether measurable differences between the models could be detected when the data sets were constrained to be equivalent in terms of item "p"-values. The…

Descriptors: Ability, Comparative Analysis, Item Response Theory, Mathematical Models

DIF Detection and Description: Mantel-Haenszel and Standardization.

Download full text

Dorans, Neil J.; Holland, Paul W. – 1992

At the Educational Testing Service, the Mantel-Haenszel procedure is used for differential item functioning (DIF) detection, and the standardization procedure is used to describe DIF. This report describes these procedures. First, an important distinction is made between DIF and impact, pointing to the need to compare the comparable. Then, these…

Descriptors: Comparative Analysis, Distractors (Tests), Identification, Item Bias

A Comparison of the One-, the Modified Three-, and the Three-Parameter Item Response Theory Models in the Test Development Item Selection Process.

Eignor, Daniel R.; Douglass, James B. – 1982

This paper attempts to provide some initial information about the use of a variety of item response theory (IRT) models in the item selection process; its purpose is to compare the information curves derived from the selection of items characterized by several different IRT models and their associated parameter estimation programs. These…

Descriptors: Comparative Analysis, Latent Trait Theory, Mathematical Models, Multiple Choice Tests

Comparing Item Characteristic Curves.

Peer reviewed

Rosenbaum, Paul R. – Psychometrika, 1987

This paper develops and applies three nonparametric comparisons of the shapes of two item characteristic surfaces: (1) proportional latent odds; (2) uniform relative difficulty; and (3) item sensitivity. A method is presented for comparing the relative shapes of two item characteristic curves in two examinee populations who were administered an…

Descriptors: Comparative Analysis, Computer Simulation, Difficulty Level, Item Analysis

An NCME Instructional Module on Comparison of Classical Test Theory and Item Response Theory and Their Applications to Test Development.

Peer reviewed

Hambleton, Ronald K.; Jones, Russell W. – Educational Measurement: Issues and Practice, 1993

This National Council on Measurement in Education (NCME) instructional module compares classical test theory and item response theory and describes their applications in test development. Related concepts, models, and methods are explored; and advantages and disadvantages of each framework are reviewed. (SLD)

Descriptors: Comparative Analysis, Educational Assessment, Graphs, Item Response Theory

Validity of Using Two Numerical Analysis Techniques To Estimate Item and Ability Parameters via MMLE: Gauss-Hermite Quadrature Formula and Mislevy's Histogram Solution.

Download full text

Seong, Tae-Je – 1990

The similarity of item and ability parameter estimations was investigated using two numerical analysis techniques via marginal maximum likelihood estimation (MMLE) with a large simulated data set (n=1,000 examinees) and changing the number of quadrature points. MMLE estimation uses a numerical analysis technique to integrate examinees' abilities…

Descriptors: Comparative Analysis, Equations (Mathematics), Estimation (Mathematics), Mathematical Models

Accuracy of Two Procedures for Estimating Reliability of Mastery Tests.

Peer reviewed

Huynh, Huynh; Saunders, Joseph C. – Journal of Educational Measurement, 1980

Single administration (beta-binomial) estimates for the raw agreement index p and the corrected-for-chance kappa index in mastery testing are compared with those based on two test administrations in terms of estimation bias and sampling variability. Bias is about 2.5 percent for p and 10 percent for kappa. (Author/RL)

Descriptors: Comparative Analysis, Error of Measurement, Mastery Tests, Mathematical Models

Rasch-Based Factor Analysis of Dichotomously-Scored Item Response Data.

Download full text

Schumacker, Randall E.; Fluke, Rickey – 1991

Three methods of factor analyzing dichotomously scored item performance data were compared using two raw score data sets of 20-item tests, one reflecting normally distributed latent traits and the other reflecting uniformly distributed latent traits. This comparison was accomplished by using phi and tetrachoric correlations among dichotomous data…

Descriptors: Comparative Analysis, Equations (Mathematics), Estimation (Mathematics), Factor Analysis

A Brief Overview of Three Classes of Methods for Detecting Item Bias.

Fisk, Yvette Hester – 1991

The reasons for recent endeavors to evaluate item bias are discussed, and item bias is defined. Some of the literature regarding the most promising methods of detecting item bias is reviewed. Three classes of methods for detecting item bias are discussed using concrete examples and illustrations. These methods are: (1) latent trait; (2)…

Descriptors: Chi Square, Comparative Analysis, Difficulty Level, Item Bias

A Comparison of Item- and Person-Fit Methods of Assessing Model-Data Fit in IRT.

Peer reviewed

Reise, Steven P. – Applied Psychological Measurement, 1990

To demonstrate that some methods used to assess item fit can be applied to assess person fit and vice versa, performance of a chi-squared item-fit statistic was compared with that of a likelihood-based person-fit statistic for examinees and items under Monte Carlo conditions. (SLD)

Descriptors: Chi Square, Comparative Analysis, Goodness of Fit, Item Response Theory

Revising Answers to Items in Computerized Adaptive Tests: A Comparison of Three Models.

Download full text

Stocking, Martha L. – 1996

The interest in the application of large-scale computerized adaptive testing has served to focus attention on issues that arise when theoretical advances are made operational. Some of these issues stem less from changes in testing conditions and more from changes in testing paradigms. One such issue is that of the order in which questions are…

Descriptors: Adaptive Testing, Cognitive Processes, Comparative Analysis, Computer Assisted Testing

A Comparison of the Partial Credit and Graded Response Models in Computerized Adaptive Testing.

Download full text

De Ayala, R. J.; And Others – 1990

Computerized adaptive testing procedures (CATPs) based on the graded response method (GRM) of F. Samejima (1969) and the partial credit model (PCM) of G. Masters (1982) were developed and compared. Both programs used maximum likelihood estimation of ability, and item selection was conducted on the basis of information. Two simulated data sets, one…

Descriptors: Ability Identification, Adaptive Testing, Comparative Analysis, Computer Assisted Testing

Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4

Reckase, Mark D.	5
McKinley, Robert L.	3
Benson, Jeri	2
Cohen, Allan S.	2
Douglass, James B.	2
Kim, Seock-Ho	2
Kromrey, Jeffrey D.	2
Ackerman, Terry A.	1
Albanese, Mark A.	1
Allan S. Cohen	1
Bacon, Tina P.	1
Berger, Martijn P. F.	1
Camilli, Gregory	1
Convey, John J.	1
De Ayala, R. J.	1
Dodd, Barbara G.	1
Dorans, Neil J.	1
Downey, Ronald G.	1
Eignor, Daniel R.	1
Engelen, Ronald J. H.	1
Evans, John A.	1
Fisk, Yvette Hester	1
Fluke, Rickey	1
Forsyth, Robert A.	1
More ▼