ERIC - Search Results

Publication Date

In 2026	0
Since 2025	0
Since 2022 (last 5 years)	4
Since 2017 (last 10 years)	4
Since 2007 (last 20 years)	5

Descriptor

Item Analysis	8
Reliability	8
Simulation	8
Test Items	6
Sample Size	4
Mathematical Models	3
Sampling	3
Comparative Analysis	2
Correlation	2
Difficulty Level	2
Error of Measurement	2
Evaluation Methods	2
Latent Trait Theory	2
Measurement Techniques	2
Reaction Time	2
Academic Achievement	1
Algorithms	1
Bias	1
Case Studies	1
Computation	1
Computer Assisted Testing	1
Computer Software	1
Content Analysis	1
Context Effect	1
Criticism	1
More ▼

Source

Journal of Educational Data…	1
Journal of Educational and…	1
Measurement:…	1
Multivariate Behavioral…	1
Studies in Second Language…	1

Author

Allan S. Cohen	1
Bronson Hui	1
Effenberger, Tomáš	1
Hartig, Johannes	1
Holzel, Britta	1
Jensen, Harald E.	1
Jordan M. Wheeler	1
Kolen, Michael J.	1
Kukucka, Adam	1
Moosbrugger, Helfried	1
Novak, Josip	1
Pelánek, Radek	1
Rebernjak, Blaž	1
Reckase, Mark D.	1
Ree, Malcom James	1
Shiyu Wang	1
Whitney, Douglas R.	1
Zhiyi Wu	1
More ▼

Publication Type

Reports - Research	7
Journal Articles	5
Reports - Evaluative	1

Education Level

Audience

Researchers

Location

Laws, Policies, & Programs

Assessments and Surveys

What Works Clearinghouse Rating

Showing all 8 results Save | Export

Estimating Reliability for Response-Time Difference Measures: Toward a Standardized, Model-Based Approach

Peer reviewed

Direct link

Bronson Hui; Zhiyi Wu – Studies in Second Language Acquisition, 2024

A slowdown or a speedup in response times across experimental conditions can be taken as evidence of online deployment of knowledge. However, response-time difference measures are rarely evaluated on their reliability, and there is no standard practice to estimate it. In this article, we used three open data sets to explore an approach to…

Descriptors: Reliability, Reaction Time, Psychometrics, Criticism

There Are Many Greater Lower Bounds than Cronbach's [alpha]: A Monte Carlo Simulation Study

Peer reviewed

Direct link

Novak, Josip; Rebernjak, Blaž – Measurement: Interdisciplinary Research and Perspectives, 2023

A Monte Carlo simulation study was conducted to examine the performance of [alpha], [lambda]2, [lambda][subscript 4], [lambda][subscript 2], [omega][subscript T], GLB[subscript MRFA], and GLB[subscript Algebraic] coefficients. Population reliability, distribution shape, sample size, test length, and number of response categories were varied…

Descriptors: Monte Carlo Methods, Evaluation Methods, Reliability, Simulation

A Comparison of Latent Semantic Analysis and Latent Dirichlet Allocation in Educational Measurement

Peer reviewed

Direct link

Jordan M. Wheeler; Allan S. Cohen; Shiyu Wang – Journal of Educational and Behavioral Statistics, 2024

Topic models are mathematical and statistical models used to analyze textual data. The objective of topic models is to gain information about the latent semantic space of a set of related textual data. The semantic space of a set of textual data contains the relationship between documents and words and how they are used. Topic models are becoming…

Descriptors: Semantics, Educational Assessment, Evaluators, Reliability

Towards Design-Loop Adaptivity: Identifying Items for Revision

Peer reviewed
PDF on ERIC

Download full text

Pelánek, Radek; Effenberger, Tomáš; Kukucka, Adam – Journal of Educational Data Mining, 2022

We study the automatic identification of educational items worthy of content authors' attention. Based on the results of such analysis, content authors can revise and improve the content of learning environments. We provide an overview of item properties relevant to this task, including difficulty and complexity measures, item discrimination, and…

Descriptors: Item Analysis, Identification, Difficulty Level, Case Studies

A Confirmatory Analysis of Item Reliability Trends (CAIRT): Differentiating True Score and Error Variance in the Analysis of Item Context Effects

Peer reviewed

Direct link

Hartig, Johannes; Holzel, Britta; Moosbrugger, Helfried – Multivariate Behavioral Research, 2007

Numerous studies have shown increasing item reliabilities as an effect of the item position in personality scales. Traditionally, these context effects are analyzed based on item-total correlations. This approach neglects that trends in item reliabilities can be caused either by an increase in true score variance or by a decrease in error…

Descriptors: True Scores, Error of Measurement, Structural Equation Models, Simulation

Accuracy of Estimating Two Parameter Logistic Latent Trait Parameters and Implications for Classroom Tests.

Download full text

Kolen, Michael J.; Whitney, Douglas R. – 1978

The application of latent trait theory to classroom tests necessitates the use of small sample sizes for parameter estimation. Computer generated data were used to assess the accuracy of estimation of the slope and location parameters in the two parameter logistic model with fixed abilities and varying small sample sizes. The maximum likelihood…

Descriptors: Difficulty Level, Item Analysis, Latent Trait Theory, Mathematical Models

A Comparison of the One- and Three-Parameter Logistic Models for Item Calibration.

Download full text

Reckase, Mark D. – 1978

Five comparisons were made relative to the quality of estimates of ability parameters and item calibrations obtained from the one-parameter and three-parameter logistic models. The results indicate: (1) The three-parameter model fit the test data better in all cases than did the one-parameter model. For simulation data sets, multi-factor data were…

Descriptors: Comparative Analysis, Goodness of Fit, Item Analysis, Mathematical Models

Item Characteristic Curve Parameters: Effects of Sample Size on Linear Equating.

Download full text

Ree, Malcom James; Jensen, Harald E. – 1980

By means of computer simulation of test responses, the reliability of item analysis data and the accuracy of equating were examined for hypothetical samples of 250, 500, 1000, and 2000 subjects for two tests with 20 equating items plus 60 additional items on the same scale. Birnbaum's three-parameter logistic model was used for the simulation. The…

Descriptors: Computer Assisted Testing, Equated Scores, Error of Measurement, Item Analysis