ERIC - Search Results

Publication Date

In 2026	0
Since 2025	0
Since 2022 (last 5 years)	2
Since 2017 (last 10 years)	3
Since 2007 (last 20 years)	5

Descriptor

Bayesian Statistics	7
Scores	7
Test Reliability	7
Error of Measurement	4
Comparative Analysis	3
Test Validity	3
Accuracy	2
Correlation	2
Equations (Mathematics)	2
High School Students	2
Item Response Theory	2
Test Construction	2
Test Items	2
Ability	1
Achievement Tests	1
Analysis of Variance	1
Aptitude Tests	1
Behavioral Objectives	1
College Entrance Examinations	1
Computer Assisted Testing	1
Criterion Referenced Tests	1
Cutting Scores	1
Educational Objectives	1
Estimation (Mathematics)	1
Foreign Countries	1
More ▼

Source

Educational and Psychological…	2
ETS Research Report Series	1
Education and Information…	1
ProQuest LLC	1
Psychometrika	1

Publication Type

Journal Articles	5
Reports - Research	4
Reports - Evaluative	2
Dissertations/Theses -…	1

Education Level

High Schools	2
Higher Education	2
Postsecondary Education	2
Secondary Education	2

Audience

Location

Netherlands

Laws, Policies, & Programs

Assessments and Surveys

National Merit Scholarship…	1
Preliminary Scholastic…	1

What Works Clearinghouse Rating

Showing all 7 results Save | Export

Are Speeded Tests Unfair? Modeling the Impact of Time Limits on the Gender Gap in Mathematics

Peer reviewed

Direct link

Stoevenbelt, Andrea H.; Wicherts, Jelte M.; Flore, Paulette C.; Phillips, Lorraine A. T.; Pietschnig, Jakob; Verschuere, Bruno; Voracek, Martin; Schwabe, Inga – Educational and Psychological Measurement, 2023

When cognitive and educational tests are administered under time limits, tests may become speeded and this may affect the reliability and validity of the resulting test scores. Prior research has shown that time limits may create or enlarge gender gaps in cognitive and academic testing. On average, women complete fewer items than men when a test…

Descriptors: Timed Tests, Gender Differences, Item Response Theory, Correlation

Measuring Language Ability of Students with Compensatory Multidimensional CAT: A Post-Hoc Simulation Study

Peer reviewed

Direct link

Ozdemir, Burhanettin; Gelbal, Selahattin – Education and Information Technologies, 2022

The computerized adaptive tests (CAT) apply an adaptive process in which the items are tailored to individuals' ability scores. The multidimensional CAT (MCAT) designs differ in terms of different item selection, ability estimation, and termination methods being used. This study aims at investigating the performance of the MCAT designs used to…

Descriptors: Scores, Computer Assisted Testing, Test Items, Language Proficiency

Bayesian Approaches to Test Score Measurement Errors in Student Growth Prediction Models

Direct link

Pei-Hsuan Chiu – ProQuest LLC, 2018

Evidence of student growth is a primary outcome of interest for educational accountability systems. When three or more years of student test data are available, questions around how students grow and what their predicted growth is can be answered. Given that test scores contain measurement error, this error should be considered in growth and…

Descriptors: Bayesian Statistics, Scores, Error of Measurement, Growth Models

A Comparison of Approaches for Improving the Reliability of Objective Level Scores

Peer reviewed

Direct link

Skorupski, William P.; Carvajal, Jorge – Educational and Psychological Measurement, 2010

This study is an evaluation of the psychometric issues associated with estimating objective level scores, often referred to as "subscores." The article begins by introducing the concepts of reliability and validity for subscores from statewide achievement tests. These issues are discussed with reference to popular scaling techniques, classical…

Descriptors: Testing Programs, Test Validity, Achievement Tests, Scores

Improved Reliability Estimates for Small Samples Using Empirical Bayes Techniques. Research Report. ETS RR-09-46

Peer reviewed
PDF on ERIC

Download full text

Oh, Hyeonjoo J.; Guo, Hongwen; Walker, Michael E. – ETS Research Report Series, 2009

Issues of equity and fairness across subgroups of the population (e.g., gender or ethnicity) must be seriously considered in any standardized testing program. For this reason, many testing programs require some means for assessing test characteristics, such as reliability, for subgroups of the population. However, often only small sample sizes are…

Descriptors: Standardized Tests, Test Reliability, Sample Size, Bayesian Statistics

Ability Estimation for Conventional Tests.

Peer reviewed

Kim, Jwa K.; Nicewander, W. Alan – Psychometrika, 1993

Bias, standard error, and reliability of five ability estimators were evaluated using Monte Carlo estimates of the unknown conditional means and variances of the estimators. Results indicate that estimates based on Bayesian modal, expected a posteriori, and weighted likelihood estimators were reasonably unbiased with relatively small standard…

Descriptors: Ability, Bayesian Statistics, Equations (Mathematics), Error of Measurement

Criterion-Referenced Measurement.

Millman, Jason – 1974

This chapter should not only acquaint the reader with the present state of the art on Criterion-Referenced (CR) measurement but also suggest possible directions for further inquiry. The goal of the first part of this chapter is to deal with the definitional dilemma of CR measurement by proceeding from the more traditional view of CR measurement to…

Descriptors: Analysis of Variance, Bayesian Statistics, Behavioral Objectives, Comparative Analysis

Carvajal, Jorge	1
Flore, Paulette C.	1
Gelbal, Selahattin	1
Guo, Hongwen	1
Kim, Jwa K.	1
Millman, Jason	1
Nicewander, W. Alan	1
Oh, Hyeonjoo J.	1
Ozdemir, Burhanettin	1
Pei-Hsuan Chiu	1
Phillips, Lorraine A. T.	1
Pietschnig, Jakob	1
Schwabe, Inga	1
Skorupski, William P.	1
Stoevenbelt, Andrea H.	1
Verschuere, Bruno	1
Voracek, Martin	1
Walker, Michael E.	1
Wicherts, Jelte M.	1
More ▼