ERIC - Search Results

Publication Date

In 2026	0
Since 2025	0
Since 2022 (last 5 years)	3
Since 2017 (last 10 years)	5
Since 2007 (last 20 years)	8

Descriptor

Classification	10
Comparative Analysis	10
Test Format	10
Accuracy	6
Item Response Theory	4
Simulation	4
Test Items	4
Computer Assisted Testing	3
Achievement Tests	2
Adaptive Testing	2
Diagnostic Tests	2
Foreign Countries	2
Item Analysis	2
Language Tests	2
Language Usage	2
Models	2
Multiple Choice Tests	2
Alternative Assessment	1
Bayesian Statistics	1
Bias	1
Biology	1
Check Lists	1
Clinical Diagnosis	1
College Entrance Examinations	1
College Preparation	1
More ▼

Source

Educational and Psychological…	2
Journal of Educational…	2
ETS Research Report Series	1
International Journal of…	1
Language Testing	1
ProQuest LLC	1

Publication Type

Journal Articles	7
Reports - Research	6
Reports - Evaluative	3
Speeches/Meeting Papers	2
Dissertations/Theses -…	1
Information Analyses	1
Numerical/Quantitative Data	1

Education Level

Secondary Education	2
Grade 7	1
Grade 9	1
Higher Education	1
Postsecondary Education	1

Audience

Location

Greece	1
Turkey (Ankara)	1

Laws, Policies, & Programs

Assessments and Surveys

Program for International…

What Works Clearinghouse Rating

Showing all 10 results Save | Export

Impact of Multidimensionality on Unidimensional IRT Linking and Equating Methods

Direct link

Uk Hyun Cho – ProQuest LLC, 2024

The present study investigates the influence of multidimensionality on linking and equating in a unidimensional IRT. Two hypothetical multidimensional scenarios are explored under a nonequivalent group common-item equating design. The first scenario examines test forms designed to measure multiple constructs, while the second scenario examines a…

Descriptors: Item Response Theory, Classification, Correlation, Test Format

Diagnostic Classification Model for Forced-Choice Items and Noncognitive Tests

Peer reviewed

Direct link

Huang, Hung-Yu – Educational and Psychological Measurement, 2023

The forced-choice (FC) item formats used for noncognitive tests typically develop a set of response options that measure different traits and instruct respondents to make judgments among these options in terms of their preference to control the response biases that are commonly observed in normative tests. Diagnostic classification models (DCMs)…

Descriptors: Test Items, Classification, Bayesian Statistics, Decision Making

IRT Approaches to Modeling Scores on Mixed-Format Tests

Peer reviewed

Direct link

Lee, Won-Chan; Kim, Stella Y.; Choi, Jiwon; Kang, Yujin – Journal of Educational Measurement, 2020

This article considers psychometric properties of composite raw scores and transformed scale scores on mixed-format tests that consist of a mixture of multiple-choice and free-response items. Test scores on several mixed-format tests are evaluated with respect to conditional and overall standard errors of measurement, score reliability, and…

Descriptors: Raw Scores, Item Response Theory, Test Format, Multiple Choice Tests

On-the-Fly Constraint-Controlled Assembly Methods for Multistage Adaptive Testing for Cognitive Diagnosis

Peer reviewed

Direct link

Liu, Shuchang; Cai, Yan; Tu, Dongbo – Journal of Educational Measurement, 2018

This study applied the mode of on-the-fly assembled multistage adaptive testing to cognitive diagnosis (CD-OMST). Several and several module assembly methods for CD-OMST were proposed and compared in terms of measurement precision, test security, and constrain management. The module assembly methods in the study included the maximum priority index…

Descriptors: Adaptive Testing, Monte Carlo Methods, Computer Security, Clinical Diagnosis

IRT-Based Classification Analysis of an English Language Reading Proficiency Subtest

Peer reviewed

Direct link

Kaya, Elif; O'Grady, Stefan; Kalender, Ilker – Language Testing, 2022

Language proficiency testing serves an important function of classifying examinees into different categories of ability. However, misclassification is to some extent inevitable and may have important consequences for stakeholders. Recent research suggests that classification efficacy may be enhanced substantially using computerized adaptive…

Descriptors: Item Response Theory, Test Items, Language Tests, Classification

Panel Design Variations in the Multistage Test Using the Mixed-Format Tests

Peer reviewed

Direct link

Kim, Jiseon; Chung, Hyewon; Dodd, Barbara G.; Park, Ryoungsun – Educational and Psychological Measurement, 2012

This study compared various panel designs of the multistage test (MST) using mixed-format tests in the context of classification testing. Simulations varied the design of the first-stage module. The first stage was constructed according to three levels of test information functions (TIFs) with three different TIF centers. Additional computerized…

Descriptors: Test Format, Comparative Analysis, Computer Assisted Testing, Adaptive Testing

PISA Test Items and School-Based Examinations in Greece: Exploring the Relationship between Global and Local Assessment Discourses

Peer reviewed

Direct link

Anagnostopoulou, Kyriaki; Hatzinikita, Vassilia; Christidou, Vasilia; Dimopoulos, Kostas – International Journal of Science Education, 2013

The paper explores the relationship of the global and the local assessment discourses as expressed by Programme for International Student Assessment (PISA) test items and school-based examinations, respectively. To this end, the paper compares PISA test items related to living systems and the context of life, health, and environment, with Greek…

Descriptors: Foreign Countries, Achievement Tests, Secondary School Students, Discourse Analysis

Studies of a Latent-Class Signal-Detection Model for Constructed-Response Scoring. Research Report. ETS RR-08-63

Peer reviewed
PDF on ERIC

Download full text

DeCarlo, Lawrence T. – ETS Research Report Series, 2008

Rater behavior in essay grading can be viewed as a signal-detection task, in that raters attempt to discriminate between latent classes of essays, with the latent classes being defined by a scoring rubric. The present report examines basic aspects of an approach to constructed-response (CR) scoring via a latent-class signal-detection model. The…

Descriptors: Scoring, Responses, Test Format, Bias

Toward an Operational Definition of Educational Performance Assessments.

Download full text

Finch, F. L.; Dost, Marcia A. – 1992

Many state and local entities are developing and using performance assessment programs. Because these initiatives are so diverse, it is very difficult to understand what they are doing, or to compare them in any meaningful way. Multiple-choice tests are contrasted with performance assessments, and preliminary classifications are suggested to…

Descriptors: Alternative Assessment, Classification, Comparative Analysis, Constructed Response

Some Issues in the Testing of Vocabulary Knowledge.

Download full text

Read, John; Nation, Paul – 1986

A review of the literature on a variety of issues related to testing vocabulary knowledge in a second language addresses these topics: problems in estimating vocabulary size, including the related questions of what constitutes a word, how a sample should be selected, and what are the criteria for knowing a word; sampling the basic and specialized…

Descriptors: Achievement Tests, Check Lists, Classification, Comparative Analysis

Anagnostopoulou, Kyriaki	1
Cai, Yan	1
Choi, Jiwon	1
Christidou, Vasilia	1
Chung, Hyewon	1
DeCarlo, Lawrence T.	1
Dimopoulos, Kostas	1
Dodd, Barbara G.	1
Dost, Marcia A.	1
Finch, F. L.	1
Hatzinikita, Vassilia	1
Huang, Hung-Yu	1
Kalender, Ilker	1
Kang, Yujin	1
Kaya, Elif	1
Kim, Jiseon	1
Kim, Stella Y.	1
Lee, Won-Chan	1
Liu, Shuchang	1
Nation, Paul	1
O'Grady, Stefan	1
Park, Ryoungsun	1
Read, John	1
Tu, Dongbo	1
Uk Hyun Cho	1
More ▼