ERIC - Search Results

Publication Date

In 2026	0
Since 2025	0
Since 2022 (last 5 years)	1
Since 2017 (last 10 years)	1
Since 2007 (last 20 years)	3

Descriptor

Bayesian Statistics	4
Correlation	4
Test Reliability	4
Item Response Theory	3
Achievement Tests	2
Foreign Countries	2
Mathematics Tests	2
Scores	2
Statistical Analysis	2
Test Validity	2
College Faculty	1
College Students	1
Comparative Analysis	1
Computation	1
Computer Software	1
Criterion Referenced Tests	1
Cutting Scores	1
Elementary School Students	1
Gender Differences	1
Goodness of Fit	1
Grade 4	1
Guessing (Tests)	1
Hierarchical Linear Modeling	1
High School Students	1
Markov Processes	1
More ▼

Source

Educational and Psychological…

Author

Brennan, Robert L.	1
Carvajal, Jorge	1
Flore, Paulette C.	1
Huang, Hung-Yu	1
Kane, Michael T.	1
Phillips, Lorraine A. T.	1
Pietschnig, Jakob	1
Schwabe, Inga	1
Skorupski, William P.	1
Stoevenbelt, Andrea H.	1
Verschuere, Bruno	1
Voracek, Martin	1
Wang, Wen-Chung	1
Wicherts, Jelte M.	1
More ▼

Publication Type

Journal Articles	3
Reports - Research	3
Reports - Evaluative	1

Education Level

Higher Education	2
Postsecondary Education	2
Elementary Education	1
Grade 4	1
High Schools	1
Intermediate Grades	1
Secondary Education	1

Audience

Location

Netherlands	1
Taiwan	1

Laws, Policies, & Programs

Assessments and Surveys

Students Evaluation of…	1
Trends in International…	1

What Works Clearinghouse Rating

Showing all 4 results Save | Export

Are Speeded Tests Unfair? Modeling the Impact of Time Limits on the Gender Gap in Mathematics

Peer reviewed

Direct link

Stoevenbelt, Andrea H.; Wicherts, Jelte M.; Flore, Paulette C.; Phillips, Lorraine A. T.; Pietschnig, Jakob; Verschuere, Bruno; Voracek, Martin; Schwabe, Inga – Educational and Psychological Measurement, 2023

When cognitive and educational tests are administered under time limits, tests may become speeded and this may affect the reliability and validity of the resulting test scores. Prior research has shown that time limits may create or enlarge gender gaps in cognitive and academic testing. On average, women complete fewer items than men when a test…

Descriptors: Timed Tests, Gender Differences, Item Response Theory, Correlation

Multilevel Higher-Order Item Response Theory Models

Peer reviewed

Direct link

Huang, Hung-Yu; Wang, Wen-Chung – Educational and Psychological Measurement, 2014

In the social sciences, latent traits often have a hierarchical structure, and data can be sampled from multiple levels. Both hierarchical latent traits and multilevel data can occur simultaneously. In this study, we developed a general class of item response theory models to accommodate both hierarchical latent traits and multilevel data. The…

Descriptors: Item Response Theory, Hierarchical Linear Modeling, Computation, Test Reliability

A Comparison of Approaches for Improving the Reliability of Objective Level Scores

Peer reviewed

Direct link

Skorupski, William P.; Carvajal, Jorge – Educational and Psychological Measurement, 2010

This study is an evaluation of the psychometric issues associated with estimating objective level scores, often referred to as "subscores." The article begins by introducing the concepts of reliability and validity for subscores from statewide achievement tests. These issues are discussed with reference to popular scaling techniques, classical…

Descriptors: Testing Programs, Test Validity, Achievement Tests, Scores

Agreement Coefficients as Indices of Dependability for Domain-Referenced Tests. ACT Technical Bulletin No. 28.

Download full text

Kane, Michael T.; Brennan, Robert L. – 1977

A large number of seemingly diverse coefficients have been proposed as indices of dependability, or reliability, for domain-referenced and/or mastery tests. In this paper, it is shown that most of these indices are special cases of two generalized indices of agreement: one that is corrected for chance, and one that is not. The special cases of…

Descriptors: Bayesian Statistics, Correlation, Criterion Referenced Tests, Cutting Scores