ERIC - Search Results

Publication Date

In 2026	0
Since 2025	1
Since 2022 (last 5 years)	3
Since 2017 (last 10 years)	8
Since 2007 (last 20 years)	14

Descriptor

Bayesian Statistics	24
Scoring	24
Comparative Analysis	9
Item Response Theory	9
Test Items	8
Scores	7
Adaptive Testing	5
Accuracy	4
Computation	4
Correlation	4
Difficulty Level	4
Item Analysis	4
Maximum Likelihood Statistics	4
Simulation	4
Achievement Tests	3
Computer Assisted Testing	3
Error of Measurement	3
Inferences	3
Latent Trait Theory	3
Measurement Techniques	3
Models	3
Monte Carlo Methods	3
Probability	3
Statistical Bias	3
Test Construction	3
More ▼

Source

ETS Research Report Series	3
Educational and Psychological…	3
Journal of Educational and…	3
Journal of Educational…	2
Applied Measurement in…	1
Applied Psychological…	1
Grantee Submission	1
International Educational…	1
International Journal of…	1
International Journal of…	1
Journal of Educational Data…	1
Journal of Educational…	1
National Center for Research…	1
More ▼

Publication Type

Reports - Research	24
Journal Articles	17
Speeches/Meeting Papers	3
Numerical/Quantitative Data	1

Education Level

Elementary Education	2
Higher Education	2
Postsecondary Education	2
Secondary Education	2
Early Childhood Education	1
Grade 5	1
Intermediate Grades	1
Middle Schools	1
Preschool Education	1

Audience

Location

Trinidad and Tobago

Laws, Policies, & Programs

Assessments and Surveys

Adjustment Scales for…	1
Comprehensive Tests of Basic…	1
Early Childhood Longitudinal…	1
Graduate Record Examinations	1
MacArthur Communicative…	1
Program for International…	1

What Works Clearinghouse Rating

Showing 1 to 15 of 24 results Save | Export

Is Effort Moderated Scoring Robust to Multidimensional Rapid Guessing?

Peer reviewed

Direct link

Joseph A. Rios; Jiayi Deng – Educational and Psychological Measurement, 2025

To mitigate the potential damaging consequences of rapid guessing (RG), a form of noneffortful responding, researchers have proposed a number of scoring approaches. The present simulation study examines the robustness of the most popular of these approaches, the unidimensional effort-moderated (EM) scoring procedure, to multidimensional RG (i.e.,…

Descriptors: Scoring, Guessing (Tests), Reaction Time, Item Response Theory

Maintaining Score Scales over Time: A Comparison of Five Scoring Methods

Peer reviewed

Direct link

Kim, Stella Yun; Lee, Won-Chan – Applied Measurement in Education, 2023

This study evaluates various scoring methods including number-correct scoring, IRT theta scoring, and hybrid scoring in terms of scale-score stability over time. A simulation study was conducted to examine the relative performance of five scoring methods in terms of preserving the first two moments of scale scores for a population in a chain of…

Descriptors: Scoring, Comparative Analysis, Item Response Theory, Simulation

Aggregating Polytomous DIF Results over Multiple Test Administrations

Peer reviewed

Direct link

Zwick, Rebecca; Ye, Lei; Isham, Steven – Journal of Educational Measurement, 2018

In typical differential item functioning (DIF) assessments, an item's DIF status is not influenced by its status in previous test administrations. An item that has shown DIF at multiple administrations may be treated the same way as an item that has shown DIF in only the most recent administration. Therefore, much useful information about the…

Descriptors: Test Bias, Testing, Test Items, Bayesian Statistics

Implementing Confidence Assessment in Low-Stakes, Formative Mathematics Assessments

Peer reviewed

Direct link

Foster, Colin – International Journal of Science and Mathematics Education, 2022

Confidence assessment (CA) involves students stating alongside each of their answers a confidence rating (e.g. 0 low to 10 high) to express how certain they are that their answer is correct. Each student's score is calculated as the sum of the confidence ratings on the items that they answered correctly, minus the sum of the confidence ratings on…

Descriptors: Mathematics Tests, Mathematics Education, Secondary School Students, Meta Analysis

Revealing the Comprehension Processes of Underprepared College Students: An Evaluation of the Reading Strategies Assessment Tool

Peer reviewed
PDF on ERIC

Download full text

Magliano, Joseph P.; Lampi, Jodi P.; Ray, Melissa; Chan, Greta – Grantee Submission, 2020

Coherent mental models for successful comprehension require inferences that establish semantic "bridges" between discourse constituents and "elaborations" that incorporate relevant background knowledge. While it is established that individual differences in the extent to which postsecondary students engage in these processes…

Descriptors: Reading Comprehension, Reading Strategies, Inferences, Reading Tests

Interval Estimation of Latent Variable Scores in Item Response Theory

Peer reviewed

Direct link

Liu, Yang; Yang, Ji Seung – Journal of Educational and Behavioral Statistics, 2018

The uncertainty arising from item parameter estimation is often not negligible and must be accounted for when calculating latent variable (LV) scores in item response theory (IRT). It is particularly so when the calibration sample size is limited and/or the calibration IRT model is complex. In the current work, we treat two-stage IRT scoring as a…

Descriptors: Intervals, Scores, Item Response Theory, Bayesian Statistics

Analyzing Student Process Data in Game-Based Assessments with Bayesian Knowledge Tracing and Dynamic Bayesian Networks

Peer reviewed
PDF on ERIC

Download full text

Cui, Yang; Chu, Man-Wai; Chen, Fu – Journal of Educational Data Mining, 2019

Digital game-based assessments generate student process data that is much more difficult to analyze than traditional assessments. The formative nature of game-based assessments permits students, through applying and practicing the targeted knowledge and skills during gameplay, to gain experiences, receive immediate feedback, and as a result,…

Descriptors: Educational Games, Student Evaluation, Data Analysis, Bayesian Statistics

Variational Item Response Theory: Fast, Accurate, and Expressive

Peer reviewed
PDF on ERIC

Download full text

Wu, Mike; Davis, Richard L.; Domingue, Benjamin W.; Piech, Chris; Goodman, Noah – International Educational Data Mining Society, 2020

Item Response Theory (IRT) is a ubiquitous model for understanding humans based on their responses to questions, used in fields as diverse as education, medicine and psychology. Large modern datasets offer opportunities to capture more nuances in human behavior, potentially improving test scoring and better informing public policy. Yet larger…

Descriptors: Item Response Theory, Accuracy, Data Analysis, Public Policy

Effectiveness of Item Response Theory (IRT) Proficiency Estimation Methods under Adaptive Multistage Testing. Research Report. ETS RR-15-11

Peer reviewed
PDF on ERIC

Download full text

Kim, Sooyeon; Moses, Tim; Yoo, Hanwook Henry – ETS Research Report Series, 2015

The purpose of this inquiry was to investigate the effectiveness of item response theory (IRT) proficiency estimators in terms of estimation bias and error under multistage testing (MST). We chose a 2-stage MST design in which 1 adaptation to the examinees' ability levels takes place. It includes 4 modules (1 at Stage 1, 3 at Stage 2) and 3 paths…

Descriptors: Item Response Theory, Computation, Statistical Bias, Error of Measurement

Automated Scoring of Teachers' Open-Ended Responses to Video Prompts: Bringing the Classroom-Video-Analysis Assessment to Scale

Peer reviewed
PDF on ERIC

Download full text

Direct link

Nicole B. Kersting; Bruce L. Sherin; James W. Stigler – Educational and Psychological Measurement, 2014

In this study, we explored the potential for machine scoring of short written responses to the Classroom-Video-Analysis (CVA) assessment, which is designed to measure teachers' usable mathematics teaching knowledge. We created naïve Bayes classifiers for CVA scales assessing three different topic areas and compared computer-generated scores to…

Descriptors: Scoring, Automation, Video Technology, Teacher Evaluation

Trinidad and Tobago National Standardization of the Adjustment Scales for Children and Adolescents

Peer reviewed

Direct link

McDermott, Paul A.; Watkins, Marley W.; Rhoad, Anna M.; Chao, Jessica L.; Worrell, Frank C.; Hall, Tracey E. – International Journal of School & Educational Psychology, 2015

Given relevant cultural distinctions across nations, it is important to determine the dimensional structure and normative characteristics of psychological assessment devices in each focal population. This article examines the national standardization and validation of the Adjustment Scales for Children and Adolescents (ASCA) with a nationally…

Descriptors: Foreign Countries, Children, Adolescents, Adjustment (to Environment)

Treatment of Not-Administered Items on Individually Administered Intelligence Tests

Peer reviewed

Direct link

He, Wei; Wolfe, Edward W. – Educational and Psychological Measurement, 2012

In administration of individually administered intelligence tests, items are commonly presented in a sequence of increasing difficulty, and test administration is terminated after a predetermined number of incorrect answers. This practice produces stochastically censored data, a form of nonignorable missing data. By manipulating four factors…

Descriptors: Individual Testing, Intelligence Tests, Test Items, Test Length

Automatic Assessment of Complex Task Performance in Games and Simulations. CRESST Report 775

Download full text

Iseli, Markus R.; Koenig, Alan D.; Lee, John J.; Wainess, Richard – National Center for Research on Evaluation, Standards, and Student Testing (CRESST), 2010

Assessment of complex task performance is crucial to evaluating personnel in critical job functions such as Navy damage control operations aboard ships. Games and simulations can be instrumental in this process, as they can present a broad range of complex scenarios without involving harm to people or property. However, "automatic"…

Descriptors: Performance Tests, Performance Based Assessment, Decision Making Skills, Military Training

Modeling Change in Large-Scale Longitudinal Studies of Educational Growth: Four Decades of Contributions to the Assessment of Educational Growth. Research Report. ETS RR-12-04. ETS R&D Scientific and Policy Contributions Series. ETS SPC-12-01

Peer reviewed
PDF on ERIC

Download full text

Rock, Donald A. – ETS Research Report Series, 2012

This paper provides a history of ETS's role in developing assessment instruments and psychometric procedures for measuring change in large-scale national assessments funded by the Longitudinal Studies branch of the National Center for Education Statistics. It documents the innovations developed during more than 30 years of working with…

Descriptors: Models, Educational Change, Longitudinal Studies, Educational Development

A Bayesian Method for Evaluating Passing Scores: The PPoP Curve

Peer reviewed

Direct link

Wainer, Howard; Wang, X. A.; Skorupski, William P.; Bradlow, Eric T. – Journal of Educational Measurement, 2005

In this note, we demonstrate an interesting use of the posterior distributions (and corresponding posterior samples of proficiency) that are yielded by fitting a fully Bayesian test scoring model to a complex assessment. Specifically, we examine the efficacy of the test in combination with the specific passing score that was chosen through expert…

Descriptors: Scoring, Bayesian Statistics

Previous Page | Next Page »

Pages: 1 | 2

Bradlow, Eric T.	1
Bruce L. Sherin	1
Camilli, Gregory	1
Chan, Greta	1
Chao, Jessica L.	1
Chen, Fu	1
Chu, Man-Wai	1
Cui, Yang	1
Davis, Richard L.	1
DeAyala, R. J.	1
Domingue, Benjamin W.	1
Foster, Colin	1
Goodman, Noah	1
Green, Bert F.	1
Hall, Tracey E.	1
He, Wei	1
Huynh, Huynh	1
Iseli, Markus R.	1
Isham, Steven	1
James W. Stigler	1
Jiayi Deng	1
Johnson, Matthew	1
Joseph A. Rios	1
Kim, Sooyeon	1
Kim, Stella Yun	1
More ▼