ERIC - Search Results

Publication Date

In 2026	0
Since 2025	0
Since 2022 (last 5 years)	2
Since 2017 (last 10 years)	5
Since 2007 (last 20 years)	7

Descriptor

Accuracy	7
Difficulty Level	7
Test Format	7
Test Items	6
Foreign Countries	3
Item Response Theory	3
Science Tests	3
Achievement Tests	2
Comparative Analysis	2
Computation	2
Equated Scores	2
Language Tests	2
Multiple Choice Tests	2
Reading Tests	2
Classification	1
Cognitive Processes	1
Computer Assisted Testing	1
Data Interpretation	1
Decision Making	1
Differences	1
Elementary School Students	1
Elementary Secondary Education	1
English (Second Language)	1
Field Tests	1
Grade 3	1
More ▼

Source

ETS Research Report Series	1
Educational Assessment	1
Educational and Psychological…	1
Malaysian Journal of Learning…	1
ProQuest LLC	1
Research in Science Education	1
School Psychology	1

Author

Bulut, Okan	1
Ehrich, John	1
Fadillah, Sarah Meilani	1
Ha, Minsu	1
Howard, Steven J.	1
Indriyanti, Nurma Yunita	1
Liao, Chi-Wen	1
Liou, Pey-Yan	1
Livingston, Samuel A.	1
Morrison, Kristin M.	1
Nuraeni, Eni	1
Steedle, Jeffrey T.	1
Stella Yun Kim	1
Ting Sun	1
Woodcock, Stuart	1
Wu, Yi-Fang	1
More ▼

Publication Type

Journal Articles	6
Reports - Research	6
Dissertations/Theses -…	1

Education Level

Secondary Education	2
Early Childhood Education	1
Elementary Education	1
Elementary Secondary Education	1
Grade 3	1
Grade 8	1
High Schools	1
Higher Education	1
Junior High Schools	1
Middle Schools	1
Postsecondary Education	1
Primary Education	1
More ▼

Audience

Location

Australia	1
Indonesia	1
Taiwan	1

Laws, Policies, & Programs

Assessments and Surveys

National Assessment Program…	1
Trends in International…	1

What Works Clearinghouse Rating

Showing all 7 results Save | Export

Evaluating Equating Methods for Varying Levels of Form Difference

Peer reviewed

Direct link

Ting Sun; Stella Yun Kim – Educational and Psychological Measurement, 2024

Equating is a statistical procedure used to adjust for the difference in form difficulty such that scores on those forms can be used and interpreted comparably. In practice, however, equating methods are often implemented without considering the extent to which two forms differ in difficulty. The study aims to examine the effect of the magnitude…

Descriptors: Difficulty Level, Data Interpretation, Equated Scores, High School Students

Exploring Confidence Accuracy and Item Difficulty in Changing Multiple-Choice Answers of Scientific Reasoning Test

Peer reviewed
PDF on ERIC

Download full text

Fadillah, Sarah Meilani; Ha, Minsu; Nuraeni, Eni; Indriyanti, Nurma Yunita – Malaysian Journal of Learning and Instruction, 2023

Purpose: Researchers discovered that when students were given the opportunity to change their answers, a majority changed their responses from incorrect to correct, and this change often increased the overall test results. What prompts students to modify their answers? This study aims to examine the modification of scientific reasoning test, with…

Descriptors: Science Tests, Multiple Choice Tests, Test Items, Decision Making

A Within-Subject Experiment of Item Format Effects on Early Primary Students' Language, Reading, and Numeracy Assessment Results

Peer reviewed

Direct link

Woodcock, Stuart; Howard, Steven J.; Ehrich, John – School Psychology, 2020

Standardized testing is ubiquitous in educational assessment, but questions have been raised about the extent to which these test scores accurately reflect students' genuine knowledge and skills. To more rigorously investigate this issue, the current study employed a within-subject experimental design to examine item format effects on primary…

Descriptors: Elementary School Students, Grade 3, Test Items, Test Format

The Effects of Item Format and Cognitive Domain on Students' Science Performance in TIMSS 2011

Peer reviewed

Direct link

Liou, Pey-Yan; Bulut, Okan – Research in Science Education, 2020

The purpose of this study was to examine eighth-grade students' science performance in terms of two test design components, item format, and cognitive domain. The portion of Taiwanese data came from the 2011 administration of the Trends in International Mathematics and Science Study (TIMSS), one of the major international large-scale assessments…

Descriptors: Foreign Countries, Middle School Students, Grade 8, Science Achievement

Embedded Field Test Item Statistics: Can They Be Trusted for Estimating Student Proficiency?

Peer reviewed

Direct link

Steedle, Jeffrey T.; Morrison, Kristin M. – Educational Assessment, 2019

Assessment items are commonly field tested prior to operational use to observe statistical item properties such as difficulty. Item parameter estimates from field testing may be used to assign scores via pre-equating or computer adaptive designs. This study examined differences between item difficulty estimates based on field test and operational…

Descriptors: Field Tests, Test Items, Statistics, Difficulty Level

Accuracy and Variability of Item Parameter Estimates from Marginal Maximum a Posteriori Estimation and Bayesian Inference via Gibbs Samplers

Direct link

Wu, Yi-Fang – ProQuest LLC, 2015

Item response theory (IRT) uses a family of statistical models for estimating stable characteristics of items and examinees and defining how these characteristics interact in describing item and test performance. With a focus on the three-parameter logistic IRT (Birnbaum, 1968; Lord, 1980) model, the current study examines the accuracy and…

Descriptors: Item Response Theory, Test Items, Accuracy, Computation

Examining an Alternative to Score Equating: A Randomly Equivalent Forms Approach. Research Report. ETS RR-08-14

Peer reviewed
PDF on ERIC

Download full text

Liao, Chi-Wen; Livingston, Samuel A. – ETS Research Report Series, 2008

Randomly equivalent forms (REF) of tests in listening and reading for nonnative speakers of English were created by stratified random assignment of items to forms, stratifying on item content and predicted difficulty. The study included 50 replications of the procedure for each test. Each replication generated 2 REFs. The equivalence of those 2…

Descriptors: Equated Scores, Item Analysis, Test Items, Difficulty Level