ERIC - Search Results

Publication Date

In 2026	0
Since 2025	1
Since 2022 (last 5 years)	3
Since 2017 (last 10 years)	11
Since 2007 (last 20 years)	21

Descriptor

Bayesian Statistics	39
Scoring	39
Item Response Theory	17
Test Items	14
Comparative Analysis	11
Adaptive Testing	9
Scores	9
Models	8
Simulation	8
Mathematical Models	7
Measurement Techniques	7
Monte Carlo Methods	7
Accuracy	6
Computation	6
Computer Assisted Testing	6
Difficulty Level	6
Foreign Countries	6
Item Analysis	6
Maximum Likelihood Statistics	6
Probability	6
Achievement Tests	5
Classification	5
Correlation	5
Latent Trait Theory	5
Test Construction	5
More ▼

Source

Journal of Educational and…	5
ETS Research Report Series	3
Educational and Psychological…	3
International Educational…	3
Applied Psychological…	2
Journal of Educational…	2
ProQuest LLC	2
Applied Measurement in…	1
Grantee Submission	1
International Journal of…	1
International Journal of…	1
Journal of Educational Data…	1
Journal of Educational…	1
Journal of Psychoeducational…	1
National Center for Research…	1
Online Submission	1
Practical Assessment,…	1
Psychometrika	1
More ▼

Publication Type

Reports - Research	24
Journal Articles	23
Reports - Evaluative	7
Speeches/Meeting Papers	6
Collected Works - Proceedings	3
Dissertations/Theses -…	2
Reports - Descriptive	2
Numerical/Quantitative Data	1

Education Level

Higher Education	6
Postsecondary Education	5
Secondary Education	4
Elementary Education	3
Middle Schools	3
Early Childhood Education	2
Elementary Secondary Education	2
Junior High Schools	2
Grade 5	1
Grade 9	1
High Schools	1
Intermediate Grades	1
Preschool Education	1
More ▼

Audience

Location

Brazil	1
China	1
Trinidad and Tobago	1
Turkey	1
Uruguay	1

Laws, Policies, & Programs

Assessments and Surveys

Adjustment Scales for…	1
Comprehensive Tests of Basic…	1
Early Childhood Longitudinal…	1
Graduate Record Examinations	1
MacArthur Communicative…	1
Program for International…	1

What Works Clearinghouse Rating

Showing 1 to 15 of 39 results Save | Export

Is Effort Moderated Scoring Robust to Multidimensional Rapid Guessing?

Peer reviewed

Direct link

Joseph A. Rios; Jiayi Deng – Educational and Psychological Measurement, 2025

To mitigate the potential damaging consequences of rapid guessing (RG), a form of noneffortful responding, researchers have proposed a number of scoring approaches. The present simulation study examines the robustness of the most popular of these approaches, the unidimensional effort-moderated (EM) scoring procedure, to multidimensional RG (i.e.,…

Descriptors: Scoring, Guessing (Tests), Reaction Time, Item Response Theory

Maintaining Score Scales over Time: A Comparison of Five Scoring Methods

Peer reviewed

Direct link

Kim, Stella Yun; Lee, Won-Chan – Applied Measurement in Education, 2023

This study evaluates various scoring methods including number-correct scoring, IRT theta scoring, and hybrid scoring in terms of scale-score stability over time. A simulation study was conducted to examine the relative performance of five scoring methods in terms of preserving the first two moments of scale scores for a population in a chain of…

Descriptors: Scoring, Comparative Analysis, Item Response Theory, Simulation

Aggregating Polytomous DIF Results over Multiple Test Administrations

Peer reviewed

Direct link

Zwick, Rebecca; Ye, Lei; Isham, Steven – Journal of Educational Measurement, 2018

In typical differential item functioning (DIF) assessments, an item's DIF status is not influenced by its status in previous test administrations. An item that has shown DIF at multiple administrations may be treated the same way as an item that has shown DIF in only the most recent administration. Therefore, much useful information about the…

Descriptors: Test Bias, Testing, Test Items, Bayesian Statistics

Evaluating the Effectiveness of the Expectation-Maximization (EM) Algorithm for Bayesian Network Calibration

Direct link

Tingir, Seyfullah – ProQuest LLC, 2019

Educators use various statistical techniques to explain relationships between latent and observable variables. One way to model these relationships is to use Bayesian networks as a scoring model. However, adjusting the conditional probability tables (CPT-parameters) to fit a set of observations is still a challenge when using Bayesian networks. A…

Descriptors: Bayesian Statistics, Statistical Analysis, Scoring, Probability

Implementing Confidence Assessment in Low-Stakes, Formative Mathematics Assessments

Peer reviewed

Direct link

Foster, Colin – International Journal of Science and Mathematics Education, 2022

Confidence assessment (CA) involves students stating alongside each of their answers a confidence rating (e.g. 0 low to 10 high) to express how certain they are that their answer is correct. Each student's score is calculated as the sum of the confidence ratings on the items that they answered correctly, minus the sum of the confidence ratings on…

Descriptors: Mathematics Tests, Mathematics Education, Secondary School Students, Meta Analysis

Revealing the Comprehension Processes of Underprepared College Students: An Evaluation of the Reading Strategies Assessment Tool

Peer reviewed
PDF on ERIC

Download full text

Magliano, Joseph P.; Lampi, Jodi P.; Ray, Melissa; Chan, Greta – Grantee Submission, 2020

Coherent mental models for successful comprehension require inferences that establish semantic "bridges" between discourse constituents and "elaborations" that incorporate relevant background knowledge. While it is established that individual differences in the extent to which postsecondary students engage in these processes…

Descriptors: Reading Comprehension, Reading Strategies, Inferences, Reading Tests

Interval Estimation of Latent Variable Scores in Item Response Theory

Peer reviewed

Direct link

Liu, Yang; Yang, Ji Seung – Journal of Educational and Behavioral Statistics, 2018

The uncertainty arising from item parameter estimation is often not negligible and must be accounted for when calculating latent variable (LV) scores in item response theory (IRT). It is particularly so when the calibration sample size is limited and/or the calibration IRT model is complex. In the current work, we treat two-stage IRT scoring as a…

Descriptors: Intervals, Scores, Item Response Theory, Bayesian Statistics

Analyzing Student Process Data in Game-Based Assessments with Bayesian Knowledge Tracing and Dynamic Bayesian Networks

Peer reviewed
PDF on ERIC

Download full text

Cui, Yang; Chu, Man-Wai; Chen, Fu – Journal of Educational Data Mining, 2019

Digital game-based assessments generate student process data that is much more difficult to analyze than traditional assessments. The formative nature of game-based assessments permits students, through applying and practicing the targeted knowledge and skills during gameplay, to gain experiences, receive immediate feedback, and as a result,…

Descriptors: Educational Games, Student Evaluation, Data Analysis, Bayesian Statistics

Variational Item Response Theory: Fast, Accurate, and Expressive

Peer reviewed
PDF on ERIC

Download full text

Wu, Mike; Davis, Richard L.; Domingue, Benjamin W.; Piech, Chris; Goodman, Noah – International Educational Data Mining Society, 2020

Item Response Theory (IRT) is a ubiquitous model for understanding humans based on their responses to questions, used in fields as diverse as education, medicine and psychology. Large modern datasets offer opportunities to capture more nuances in human behavior, potentially improving test scoring and better informing public policy. Yet larger…

Descriptors: Item Response Theory, Accuracy, Data Analysis, Public Policy

Effectiveness of Item Response Theory (IRT) Proficiency Estimation Methods under Adaptive Multistage Testing. Research Report. ETS RR-15-11

Peer reviewed
PDF on ERIC

Download full text

Kim, Sooyeon; Moses, Tim; Yoo, Hanwook Henry – ETS Research Report Series, 2015

The purpose of this inquiry was to investigate the effectiveness of item response theory (IRT) proficiency estimators in terms of estimation bias and error under multistage testing (MST). We chose a 2-stage MST design in which 1 adaptation to the examinees' ability levels takes place. It includes 4 modules (1 at Stage 1, 3 at Stage 2) and 3 paths…

Descriptors: Item Response Theory, Computation, Statistical Bias, Error of Measurement

Automated Scoring of Teachers' Open-Ended Responses to Video Prompts: Bringing the Classroom-Video-Analysis Assessment to Scale

Peer reviewed
PDF on ERIC

Download full text

Direct link

Nicole B. Kersting; Bruce L. Sherin; James W. Stigler – Educational and Psychological Measurement, 2014

In this study, we explored the potential for machine scoring of short written responses to the Classroom-Video-Analysis (CVA) assessment, which is designed to measure teachers' usable mathematics teaching knowledge. We created naïve Bayes classifiers for CVA scales assessing three different topic areas and compared computer-generated scores to…

Descriptors: Scoring, Automation, Video Technology, Teacher Evaluation

Trinidad and Tobago National Standardization of the Adjustment Scales for Children and Adolescents

Peer reviewed

Direct link

McDermott, Paul A.; Watkins, Marley W.; Rhoad, Anna M.; Chao, Jessica L.; Worrell, Frank C.; Hall, Tracey E. – International Journal of School & Educational Psychology, 2015

Given relevant cultural distinctions across nations, it is important to determine the dimensional structure and normative characteristics of psychological assessment devices in each focal population. This article examines the national standardization and validation of the Adjustment Scales for Children and Adolescents (ASCA) with a nationally…

Descriptors: Foreign Countries, Children, Adolescents, Adjustment (to Environment)

Treatment of Not-Administered Items on Individually Administered Intelligence Tests

Peer reviewed

Direct link

He, Wei; Wolfe, Edward W. – Educational and Psychological Measurement, 2012

In administration of individually administered intelligence tests, items are commonly presented in a sequence of increasing difficulty, and test administration is terminated after a predetermined number of incorrect answers. This practice produces stochastically censored data, a form of nonignorable missing data. By manipulating four factors…

Descriptors: Individual Testing, Intelligence Tests, Test Items, Test Length

A Study of Bayesian Estimation and Comparison of Response Time Models in Item Response Theory

Direct link

Suh, Hongwook – ProQuest LLC, 2010

Response time has been regarded as an important source for investigating the relationship between human performance and response speed. It is important to examine the relationship between response time and item characteristics, especially in the perspective of the relationship between response time and various factors that affect examinee's…

Descriptors: Bayesian Statistics, Computation, Reaction Time, Item Response Theory

An Item Response Theory Analysis of the Mathematics Teaching Efficacy Beliefs Instrument

Peer reviewed

Direct link

Kieftenbeld, Vincent; Natesan, Prathiba; Eddy, Colleen – Journal of Psychoeducational Assessment, 2011

The mathematics teaching efficacy beliefs of preservice elementary teachers have been the subject of several studies. A widely used measure in these studies is the Mathematics Teaching Efficacy Beliefs Instrument (MTEBI). The present study provides a detailed analysis of the psychometric properties of the MTEBI using Bayesian item response theory.…

Descriptors: Item Response Theory, Bayesian Statistics, Mathematics Instruction, Preservice Teachers

Previous Page | Next Page »

Pages: 1 | 2 | 3

Barnes, Tiffany, Ed.	1
Bradlow, Eric T.	1
Bruce L. Sherin	1
Camilli, Gregory	1
Cavalli-Sforza, Violetta, Ed.	1
Chan, Greta	1
Chao, Jessica L.	1
Chen, Fu	1
Chernyshenko, Oleksandr S.	1
Chu, Man-Wai	1
Cui, Yang	1
Davis, Richard L.	1
DeAyala, R. J.	1
Domingue, Benjamin W.	1
Drasgow, Fritz	1
Eddy, Colleen	1
Edwards, Michael C.	1
Foster, Colin	1
Goodman, Noah	1
Green, Bert F.	1
Hall, Tracey E.	1
He, Wei	1
Hershkovitz, Arnon, Ed.	1
Hsu, Tse-Chi	1
Hu, Xiangen, Ed.	1
More ▼