ERIC - Search Results

Publication Date

In 2026	0
Since 2025	2
Since 2022 (last 5 years)	2
Since 2017 (last 10 years)	9
Since 2007 (last 20 years)	15

Descriptor

Computation	17
Test Format	17
Test Items	17
Item Response Theory	8
Accuracy	6
Multiple Choice Tests	5
Foreign Countries	4
Mathematics Tests	4
Problem Solving	4
Achievement Tests	3
Difficulty Level	3
Elementary Secondary Education	3
Models	3
Responses	3
Simulation	3
Statistical Analysis	3
Test Length	3
Ability	2
Comparative Analysis	2
Computer Assisted Testing	2
Equated Scores	2
Goodness of Fit	2
Markov Processes	2
Mathematics Skills	2
Maximum Likelihood Statistics	2
More ▼

Source

Educational and Psychological…	3
Journal of Educational…	2
Applied Psychological…	1
Behavioral Research and…	1
ETS Research Report Series	1
Educational Assessment	1
Educational Studies in…	1
Grantee Submission	1
International Journal of…	1
Journal for Research in…	1
Journal of Science Education…	1
Practical Assessment,…	1
ProQuest LLC	1
Universal Journal of…	1
More ▼

Publication Type

Journal Articles	14
Reports - Research	14
Reports - Evaluative	2
Dissertations/Theses -…	1
Information Analyses	1
Numerical/Quantitative Data	1

Education Level

Elementary Secondary Education	3
Secondary Education	3
Grade 8	2
Junior High Schools	2
Middle Schools	2
Elementary Education	1
Grade 1	1
Grade 2	1
Grade 3	1
Grade 4	1
Grade 5	1
Grade 6	1
Grade 7	1
Grade 9	1
High Schools	1
Higher Education	1
Postsecondary Education	1
More ▼

Audience

Practitioners	1
Researchers	1
Teachers	1

Location

Germany	1
Hong Kong	1
Oregon	1
United Kingdom	1
United States	1

Laws, Policies, & Programs

Assessments and Surveys

Program for International…	2
Armed Services Vocational…	1
National Assessment of…	1
Trends in International…	1

What Works Clearinghouse Rating

Showing 1 to 15 of 17 results Save | Export

Simultaneous Linear Equating for Scenarios with Optional Test Versions or across Multiple Alternative Anchors

Peer reviewed
PDF on ERIC

Download full text

Tom Benton – Practical Assessment, Research & Evaluation, 2025

This paper proposes an extension of linear equating that may be useful in one of two fairly common assessment scenarios. One is where different students have taken different combinations of test forms. This might occur, for example, where students have some free choice over the exam papers they take within a particular qualification. In this…

Descriptors: Equated Scores, Test Format, Test Items, Computation

The Effect of Polytomous Item Ratio on Ability Estimation in Multistage Tests

Peer reviewed
PDF on ERIC

Download full text

Hasibe Yahsi Sari; Hulya Kelecioglu – International Journal of Assessment Tools in Education, 2025

The aim of the study is to examine the effect of polytomous item ratio on ability estimation in different conditions in multistage tests (MST) using mixed tests. The study is simulation-based research. In the PISA 2018 application, the ability parameters of the individuals and the item pool were created by using the item parameters estimated from…

Descriptors: Test Items, Test Format, Accuracy, Test Length

Examining the Impacts of Ignoring Rater Effects in Mixed-Format Tests

Peer reviewed

Direct link

Guo, Wenjing; Wind, Stefanie A. – Journal of Educational Measurement, 2021

The use of mixed-format tests made up of multiple-choice (MC) items and constructed response (CR) items is popular in large-scale testing programs, including the National Assessment of Educational Progress (NAEP) and many district- and state-level assessments in the United States. Rater effects, or raters' scoring tendencies that result in…

Descriptors: Test Format, Multiple Choice Tests, Scoring, Test Items

How the Length and Characteristics of Routing Module Affect Ability Estimation in ca-MST?

Peer reviewed
PDF on ERIC

Download full text

Öztürk, Nagihan Boztunç – Universal Journal of Educational Research, 2019

In this study, how the length and characteristics of routing module in different panel designs affect measurement precision is examined. In the scope of the study, six different routing module length, nine different routing module characteristics, and two different panel design are handled. At the end of the study, the effects of conditions on…

Descriptors: Computer Assisted Testing, Adaptive Testing, Test Length, Test Format

Can Reliability of Multiple Component Measuring Instruments Depend on Response Option Presentation Mode?

Peer reviewed

Direct link

Menold, Natalja; Raykov, Tenko – Educational and Psychological Measurement, 2016

This article examines the possible dependency of composite reliability on presentation format of the elements of a multi-item measuring instrument. Using empirical data and a recent method for interval estimation of group differences in reliability, we demonstrate that the reliability of an instrument need not be the same when polarity of the…

Descriptors: Test Reliability, Test Format, Test Items, Differences

Extension of Caution Indices to Mixed-Format Tests

Peer reviewed
PDF on ERIC

Download full text

Direct link

Sinharay, Sandip – Grantee Submission, 2018

Tatsuoka (1984) suggested several extended caution indices and their standardized versions that have been used as person-fit statistics by researchers such as Drasgow, Levine, and McLaughlin (1987), Glas and Meijer (2003), and Molenaar and Hoijtink (1990). However, these indices are only defined for tests with dichotomous items. This paper extends…

Descriptors: Test Format, Goodness of Fit, Item Response Theory, Error Patterns

Embedded Field Test Item Statistics: Can They Be Trusted for Estimating Student Proficiency?

Peer reviewed

Direct link

Steedle, Jeffrey T.; Morrison, Kristin M. – Educational Assessment, 2019

Assessment items are commonly field tested prior to operational use to observe statistical item properties such as difficulty. Item parameter estimates from field testing may be used to assign scores via pre-equating or computer adaptive designs. This study examined differences between item difficulty estimates based on field test and operational…

Descriptors: Field Tests, Test Items, Statistics, Difficulty Level

Parameter Estimation in Rasch Models for Examinee-Selected Items

Peer reviewed

Direct link

Liu, Chen-Wei; Wang, Wen-Chung – Journal of Educational Measurement, 2017

The examinee-selected-item (ESI) design, in which examinees are required to respond to a fixed number of items in a given set of items (e.g., choose one item to respond from a pair of items), always yields incomplete data (i.e., only the selected items are answered and the others have missing data) that are likely nonignorable. Therefore, using…

Descriptors: Item Response Theory, Models, Maximum Likelihood Statistics, Data Analysis

A Scoping Review of Empirical Research on Recent Computational Thinking Assessments

Peer reviewed

Direct link

Cutumisu, Maria; Adams, Cathy; Lu, Chang – Journal of Science Education and Technology, 2019

Computational thinking (CT) is regarded as an essential twenty-first century competency and it is already embedded in K-12 curricula across the globe. However, research on assessing CT has lagged, with few assessments being implemented and validated. Moreover, there is a lack of systematic grouping of CT assessments. This scoping review examines…

Descriptors: Computation, Thinking Skills, 21st Century Skills, Elementary Secondary Education

Asymmetry in Student Achievement on Multiple-Choice and Constructed-Response Items in Reversible Mathematics Processes

Peer reviewed

Direct link

Sangwin, Christopher J.; Jones, Ian – Educational Studies in Mathematics, 2017

In this paper we report the results of an experiment designed to test the hypothesis that when faced with a question involving the inverse direction of a reversible mathematical process, students solve a multiple-choice version by verifying the answers presented to them by the direct method, not by undertaking the actual inverse calculation.…

Descriptors: Mathematics Achievement, Mathematics Tests, Multiple Choice Tests, Computer Assisted Testing

Effects of Design Properties on Parameter Estimation in Large-Scale Assessments

Peer reviewed

Direct link

Hecht, Martin; Weirich, Sebastian; Siegle, Thilo; Frey, Andreas – Educational and Psychological Measurement, 2015

The selection of an appropriate booklet design is an important element of large-scale assessments of student achievement. Two design properties that are typically optimized are the "balance" with respect to the positions the items are presented and with respect to the mutual occurrence of pairs of items in the same booklet. The purpose…

Descriptors: Measurement, Computation, Test Format, Test Items

Accuracy and Variability of Item Parameter Estimates from Marginal Maximum a Posteriori Estimation and Bayesian Inference via Gibbs Samplers

Direct link

Wu, Yi-Fang – ProQuest LLC, 2015

Item response theory (IRT) uses a family of statistical models for estimating stable characteristics of items and examinees and defining how these characteristics interact in describing item and test performance. With a focus on the three-parameter logistic IRT (Birnbaum, 1968; Lord, 1980) model, the current study examines the accuracy and…

Descriptors: Item Response Theory, Test Items, Accuracy, Computation

Item Response Theory Models for Wording Effects in Mixed-Format Scales

Peer reviewed

Direct link

Wang, Wen-Chung; Chen, Hui-Fang; Jin, Kuan-Yu – Educational and Psychological Measurement, 2015

Many scales contain both positively and negatively worded items. Reverse recoding of negatively worded items might not be enough for them to function as positively worded items do. In this study, we commented on the drawbacks of existing approaches to wording effect in mixed-format scales and used bi-factor item response theory (IRT) models to…

Descriptors: Item Response Theory, Test Format, Language Usage, Test Items

The Effects of Rater Severity and Rater Distribution on Examinees' Ability Estimation for Constructed-Response Items. Research Report. ETS RR-13-23

Peer reviewed
PDF on ERIC

Download full text

Wang, Zhen; Yao, Lihua – ETS Research Report Series, 2013

The current study used simulated data to investigate the properties of a newly proposed method (Yao's rater model) for modeling rater severity and its distribution under different conditions. Our study examined the effects of rater severity, distributions of rater severity, the difference between item response theory (IRT) models with rater effect…

Descriptors: Test Format, Test Items, Responses, Computation

Examining Item Functioning of Math Screening Measures for Grades 1-8 Students. Technical Report Number 08-04

Download full text

Liu, Kimy; Ketterlin-Geller, Leanne R.; Yovanoff, Paul; Tindal, Gerald – Behavioral Research and Teaching, 2008

BRT Math Screening Measures focus on students' mathematics performance in grade-level standards for students in grades 1-8. A total of 24 test forms are available with three test forms per grade corresponding to fall, winter, and spring testing periods. Each form contains computation problems and application problems. BRT Math Screening Measures…

Descriptors: Test Items, Test Format, Test Construction, Item Response Theory

Previous Page | Next Page »

Pages: 1 | 2

Wang, Wen-Chung	2
Adams, Cathy	1
Chen, Hui-Fang	1
Cutumisu, Maria	1
Frey, Andreas	1
Guo, Wenjing	1
Hasibe Yahsi Sari	1
Hecht, Martin	1
Hulya Kelecioglu	1
Jin, Kuan-Yu	1
Jones, Ian	1
Ketterlin-Geller, Leanne R.	1
Liu, Chen-Wei	1
Liu, Kimy	1
Lu, Chang	1
Menold, Natalja	1
Morrison, Kristin M.	1
Nicewander, W. Alan	1
Quenette, Mary A.	1
Raykov, Tenko	1
Sangwin, Christopher J.	1
Schoen, Harold L.	1
Siegle, Thilo	1
Sinharay, Sandip	1
More ▼