ERIC - Search Results

Publication Date

In 2025	0
Since 2024	3
Since 2021 (last 5 years)	9

Source

Applied Measurement in…

Publication Type

Journal Articles	9
Reports - Research	8
Reports - Evaluative	1
Tests/Questionnaires	1

Education Level

Secondary Education	4
Higher Education	3
Postsecondary Education	3
Elementary Education	1
Elementary Secondary Education	1
Grade 4	1
High Schools	1
Intermediate Grades	1
Junior High Schools	1
Middle Schools	1

Audience

Location

Canada	1
Germany	1
Iran	1
Iran (Tehran)	1
Japan	1
Oman	1
Singapore	1
United States	1

Laws, Policies, & Programs

Assessments and Surveys

Program for International…	2
Trends in International…	1

What Works Clearinghouse Rating

Showing all 9 results Save | Export

Cross-Cultural Validation of the Mathematics Construct and Attribute Profiles: A Differential Item Functioning Approach

Peer reviewed

Direct link

Yi-Hsin Chen – Applied Measurement in Education, 2024

This study aims to apply the differential item functioning (DIF) technique with the deterministic inputs, noisy "and" gate (DINA) model to validate the mathematics construct and diagnostic attribute profiles across American and Singaporean students. Even with the same ability level, every single item is expected to show uniform DIF…

Descriptors: Foreign Countries, Achievement Tests, Elementary Secondary Education, International Assessment

Coefficient [beta] as Extension of KR-21 Reliability for Summed and Scaled Scores for Polytomously-Scored Tests

Peer reviewed

Direct link

Almehrizi, Rashid S. – Applied Measurement in Education, 2021

KR-21 reliability and its extension (coefficient [alpha]) gives the reliability estimate of test scores under the assumption of tau-equivalent forms. KR-21 reliability gives the reliability estimate for summed scores for dichotomous items when items are randomly sampled from an infinite pool of similar items (randomly parallel forms). The article…

Descriptors: Test Reliability, Scores, Scoring, Computation

Exploring Interrelationships among L2 Writing Subskills: Insights from Cognitive Diagnostic Models

Peer reviewed

Direct link

Hamdollah Ravand; Farshad Effatpanah; Wenchao Ma; Jimmy de la Torre; Purya Baghaei; Olga Kunina-Habenicht – Applied Measurement in Education, 2024

The purpose of this study was to explore the nature of interactions among second/foreign language (L2) writing subskills. Two types of relationships were investigated: subskill-item and subskill-subskill relationships. To achieve the first purpose, using writing data obtained from the writing essays of 500 English as a foreign language (EFL)…

Descriptors: Second Language Learning, Writing Instruction, Writing Skills, Writing Tests

Not-Reached Items: An Issue of Time and of Test-Taking Disengagement? The Case of PISA 2015 Reading Data

Peer reviewed

Direct link

Pools, Elodie – Applied Measurement in Education, 2022

Many low-stakes assessments, such as international large-scale surveys, are administered during time-limited testing sessions and some test-takers are not able to endorse the last items of the test, resulting in not-reached (NR) items. However, because the test has no consequence for the respondents, these NR items can also stem from quitting the…

Descriptors: Achievement Tests, Foreign Countries, International Assessment, Secondary School Students

Dissecting Knowledge, Guessing, and Blunder in Multiple Choice Assessments

Peer reviewed

Direct link

Abu-Ghazalah, Rashid M.; Dubins, David N.; Poon, Gregory M. K. – Applied Measurement in Education, 2023

Multiple choice results are inherently probabilistic outcomes, as correct responses reflect a combination of knowledge and guessing, while incorrect responses additionally reflect blunder, a confidently committed mistake. To objectively resolve knowledge from responses in an MC test structure, we evaluated probabilistic models that explicitly…

Descriptors: Guessing (Tests), Multiple Choice Tests, Probability, Models

Detecting Differential Item Functioning Using Cognitive Diagnosis Models: Applications of the Wald Test and Likelihood Ratio Test in a University Entrance Examination

Peer reviewed

Direct link

Mehrazmay, Roghayeh; Ghonsooly, Behzad; de la Torre, Jimmy – Applied Measurement in Education, 2021

The present study aims to examine gender differential item functioning (DIF) in the reading comprehension section of a high stakes test using cognitive diagnosis models. Based on the multiple-group generalized deterministic, noisy "and" gate (MG G-DINA) model, the Wald test and likelihood ratio test are used to detect DIF. The flagged…

Descriptors: Test Bias, College Entrance Examinations, Gender Differences, Reading Tests

Measurement Invariance in Relation to First Language: An Evaluation of German Reading and Spelling Tests

Peer reviewed

Direct link

Visser, Linda; Cartschau, Friederike; von Goldammer, Ariane; Brandenburg, Janin; Timmerman, Marieke; Hasselhorn, Marcus; Mähler, Claudia – Applied Measurement in Education, 2023

The growing number of children in primary schools in Germany who have German as their second language (L2) has raised questions about the fairness of performance assessment. Fair tests are a prerequisite for distinguishing between L2 learning delay and a specific learning disability. We evaluated five commonly used reading and spelling tests for…

Descriptors: Foreign Countries, Error of Measurement, Second Language Learning, German

Computer-Based Listening Test with Full Video, Visual-Limited Video, and Audio: A Comparative Analysis Based on Difficulty, Discrimination Power, and Response Time

Peer reviewed

Direct link

Takahiro Terao – Applied Measurement in Education, 2024

This study aimed to compare item characteristics and response time between stimulus conditions in computer-delivered listening tests. Listening materials had three variants: regular videos, frame-by-frame videos, and only audios without visuals. Participants were 228 Japanese high school students who were requested to complete one of nine…

Descriptors: Computer Assisted Testing, Audiovisual Aids, Reaction Time, High School Students

Comparing the Robustness of Three Nonparametric DIF Procedures to Differential Rapid Guessing

Peer reviewed

Direct link

Abulela, Mohammed A. A.; Rios, Joseph A. – Applied Measurement in Education, 2022

When there are no personal consequences associated with test performance for examinees, rapid guessing (RG) is a concern and can differ between subgroups. To date, the impact of differential RG on item-level measurement invariance has received minimal attention. To that end, a simulation study was conducted to examine the robustness of the…

Descriptors: Comparative Analysis, Robustness (Statistics), Nonparametric Statistics, Item Analysis

Foreign Countries	9
Test Items	5
Achievement Tests	3
International Assessment	3
Reading Tests	3
Classification	2
Difficulty Level	2
English (Second Language)	2
Error of Measurement	2
Guessing (Tests)	2
Item Analysis	2
Models	2
Probability	2
Reaction Time	2
Responses	2
Scores	2
Second Language Learning	2
Secondary School Students	2
Test Bias	2
Anxiety	1
Audiovisual Aids	1
Bayesian Statistics	1
Beginning Teachers	1
Behavior Patterns	1
Biotechnology	1
More ▼

Abu-Ghazalah, Rashid M.	1
Abulela, Mohammed A. A.	1
Almehrizi, Rashid S.	1
Brandenburg, Janin	1
Cartschau, Friederike	1
Dubins, David N.	1
Farshad Effatpanah	1
Ghonsooly, Behzad	1
Hamdollah Ravand	1
Hasselhorn, Marcus	1
Jimmy de la Torre	1
Mehrazmay, Roghayeh	1
Mähler, Claudia	1
Olga Kunina-Habenicht	1
Pools, Elodie	1
Poon, Gregory M. K.	1
Purya Baghaei	1
Rios, Joseph A.	1
Takahiro Terao	1
Timmerman, Marieke	1
Visser, Linda	1
Wenchao Ma	1
Yi-Hsin Chen	1
de la Torre, Jimmy	1
von Goldammer, Ariane	1
More ▼