ERIC - Search Results

Publication Date

In 2025	1
Since 2024	4
Since 2021 (last 5 years)	8
Since 2016 (last 10 years)	12
Since 2006 (last 20 years)	16

Descriptor

Item Response Theory	23
Test Format	23
Test Validity	23
Test Items	13
Test Reliability	12
Foreign Countries	9
Test Construction	8
Multiple Choice Tests	7
Psychometrics	6
Science Tests	6
Computer Assisted Testing	5
Difficulty Level	5
Scores	5
Comparative Analysis	4
Item Analysis	4
Language Tests	4
Achievement Tests	3
College Students	3
Comparative Testing	3
Correlation	3
Factor Analysis	3
High School Students	3
Mathematics Tests	3
Statistical Analysis	3
Test Theory	3
More ▼

Source

Journal of Experimental…	2
Language Testing	2
Applied Measurement in…	1
Applied Psychological…	1
College Board	1
Educational Sciences: Theory…	1
Grantee Submission	1
International Journal of…	1
International Journal of…	1
Journal of Economic Education	1
Journal of Educational and…	1
Journal of Interactive Online…	1
Journal of Outcome Measurement	1
Journal of Psychoeducational…	1
Physical Review Physics…	1
Sociological Methods &…	1
Turkish Online Journal of…	1
More ▼

Publication Type

Journal Articles	17
Reports - Research	16
Reports - Descriptive	2
Reports - Evaluative	2
Speeches/Meeting Papers	2
Collected Works - General	1
Collected Works - Serials	1
Information Analyses	1
Non-Print Media	1
Reference Materials - General	1
Tests/Questionnaires	1
More ▼

Education Level

Higher Education	7
Postsecondary Education	7
Secondary Education	6
Elementary Education	4
High Schools	4
Middle Schools	4
Grade 8	3
Junior High Schools	3
Elementary Secondary Education	2
Early Childhood Education	1
Grade 3	1
Grade 4	1
Grade 5	1
Grade 7	1
Grade 9	1
Intermediate Grades	1
Primary Education	1
More ▼

Audience

Location

Germany	3
Canada	1
Chile	1
Indonesia	1
Japan	1
Mexico	1
Turkey	1

Laws, Policies, & Programs

Assessments and Surveys

SAT (College Admission Test)	2
Armed Services Vocational…	1
Defining Issues Test	1
Learning Style Inventory	1
Test of Standard Written…	1
Trends in International…	1

What Works Clearinghouse Rating

Showing 1 to 15 of 23 results Save | Export

A Systematic Review of Differential Item Functioning in Second Language Assessment

Peer reviewed

Direct link

Xueliang Chen; Vahid Aryadoust; Wenxin Zhang – Language Testing, 2025

The growing diversity among test takers in second or foreign language (L2) assessments makes the importance of fairness front and center. This systematic review aimed to examine how fairness in L2 assessments was evaluated through differential item functioning (DIF) analysis. A total of 83 articles from 27 journals were included in a systematic…

Descriptors: Second Language Learning, Language Tests, Test Items, Item Analysis

The Effect of Question Positioning on Data Quality in Web Surveys

Peer reviewed

Direct link

Cornelia Eva Neuert – Sociological Methods & Research, 2024

The quality of data in surveys is affected by response burden and questionnaire length. With an increasing number of questions, respondents can become bored, tired, and annoyed and may take shortcuts to reduce the effort needed to complete the survey. In this article, direct evidence is presented on how the position of items within a web…

Descriptors: Online Surveys, Test Items, Test Format, Test Construction

A Two-Level Adaptive Test Battery

Peer reviewed

Direct link

Wim J. van der Linden; Luping Niu; Seung W. Choi – Journal of Educational and Behavioral Statistics, 2024

A test battery with two different levels of adaptation is presented: a within-subtest level for the selection of the items in the subtests and a between-subtest level to move from one subtest to the next. The battery runs on a two-level model consisting of a regular response model for each of the subtests extended with a second level for the joint…

Descriptors: Adaptive Testing, Test Construction, Test Format, Test Reliability

Pamukkale Critical Thinking Skill Scale: A Validity and Reliability Study

Peer reviewed
PDF on ERIC

Download full text

Duru, Erdinc; Ozgungor, Sevgi; Yildirim, Ozen; Duatepe-Paksu, Asuman; Duru, Sibel – International Journal of Assessment Tools in Education, 2022

The aim of this study is to develop a valid and reliable measurement tool that measures critical thinking skills of university students. Pamukkale Critical Thinking Skills Scale was developed as two separate forms; multiple choice and open-ended. The validity and reliability studies of the multiple-choice form were constructed on two different…

Descriptors: Critical Thinking, Cognitive Measurement, Test Validity, Test Reliability

Innovations in Assessing Students' Digital Literacy Skills in Learning Science: Effective Multiple Choice Closed-Ended Tests Using Rasch Model

Peer reviewed
PDF on ERIC

Download full text

Fitria Lafifa; Dadan Rosana – Turkish Online Journal of Distance Education, 2024

This research goal to develop a multiple-choice closed-ended test to assessing and evaluate students' digital literacy skills. The sample in this study were students at MTsN 1 Blitar City who were selected using a purposive sampling technique. The test was also validated by experts, namely 2 Doctors of Physics and Science from Yogyakarta State…

Descriptors: Educational Innovation, Student Evaluation, Digital Literacy, Multiple Choice Tests

Reviewing the Structure of Kolb's Learning Style Inventory from Factor Analysis and Thurstonian Item Response Theory (IRT) Model Approaches

Peer reviewed

Direct link

Calderón Carvajal, Carlos; Ximénez Gómez, Carmen; Lay-Lisboa, Siu; Briceño, Mauricio – Journal of Psychoeducational Assessment, 2021

Kolb's Learning Style Inventory (LSI) continues to generate a great debate among researchers, given the contradictory evidence resulting from its psychometric properties. One primary criticism focuses on the artificiality of the results derived from its internal structure because of the ipsative nature of the forced-choice format. This study seeks…

Descriptors: Factor Structure, Psychometrics, Test Format, Test Validity

An Experimental Validation of Sequential Multiple-Choice Tests

Peer reviewed

Direct link

Papenberg, Martin; Diedenhofen, Birk; Musch, Jochen – Journal of Experimental Education, 2021

Testwiseness may introduce construct-irrelevant variance to multiple-choice test scores. Presenting response options sequentially has been proposed as a potential solution to this problem. In an experimental validation, we determined the psychometric properties of a test based on the sequential presentation of response options. We created a strong…

Descriptors: Test Wiseness, Test Validity, Test Reliability, Multiple Choice Tests

Development and Validation of the Ray Optics in Converging Lenses Concept Inventory

Peer reviewed

Direct link

Wörner, Salome; Becker, Sebastian; Küchemann, Stefan; Scheiter, Katharina; Kuhn, Jochen – Physical Review Physics Education Research, 2022

Optics is a core field in the curricula of secondary physics education. In this study, we present the development and validation of a test instrument in the field of optics, the ray optics in converging lenses concept inventory (ROC-CI). It was developed for and validated with middle school students, but can also be adapted for the use in higher…

Descriptors: Optics, Physics, Science Instruction, Concept Formation

Effects of Data-Collection Designs in the Comparison of Computer-Based and Paper-Based Tests

Peer reviewed

Direct link

Arce-Ferrer, Alvaro J.; Bulut, Okan – Journal of Experimental Education, 2019

This study investigated the performance of four widely used data-collection designs in detecting test-mode effects (i.e., computer-based versus paper-based testing). The experimental conditions included four data-collection designs, two test-administration modes, and the availability of an anchor assessment. The test-level and item-level results…

Descriptors: Data Collection, Test Construction, Test Format, Computer Assisted Testing

Psychometric Report for the Early Fractions Test (Version 2.2) Administered with Third- and Fourth-Grade Students in Spring 2017. Research Report No. 2017-11

Download full text

Schoen, Robert C.; Yang, Xiaotong; Liu, Sicong; Paek, Insu – Grantee Submission, 2017

The Early Fractions Test v2.2 is a paper-pencil test designed to measure mathematics achievement of third- and fourth-grade students in the domain of fractions. The purpose, or intended use, of the Early Fractions Test v2.2 is to serve as a measure of student outcomes in a randomized trial designed to estimate the effect of an educational…

Descriptors: Psychometrics, Mathematics Tests, Mathematics Achievement, Fractions

A Comparison of Three Test Formats to Assess Word Difficulty

Peer reviewed

Direct link

Culligan, Brent – Language Testing, 2015

This study compared three common vocabulary test formats, the Yes/No test, the Vocabulary Knowledge Scale (VKS), and the Vocabulary Levels Test (VLT), as measures of vocabulary difficulty. Vocabulary difficulty was defined as the item difficulty estimated through Item Response Theory (IRT) analysis. Three tests were given to 165 Japanese students,…

Descriptors: Language Tests, Test Format, Comparative Analysis, Vocabulary

The Impact of Sub-Skills and Item Content on Students' Skills with Regard to the Control-of-Variables Strategy

Peer reviewed

Direct link

Schwichow, Martin; Christoph, Simon; Boone, William J.; Härtig, Hendrik – International Journal of Science Education, 2016

The so-called control-of-variables strategy (CVS) incorporates the important scientific reasoning skills of designing controlled experiments and interpreting experimental outcomes. As CVS is a prominent component of science standards appropriate assessment instruments are required to measure these scientific reasoning skills and to evaluate the…

Descriptors: Thinking Skills, Science Instruction, Science Experiments, Science Tests

The Impact of Test Dimensionality, Common-Item Set Format, and Scale Linking Methods on Mixed-Format Test Equating

Peer reviewed
PDF on ERIC

Download full text

Öztürk-Gübes, Nese; Kelecioglu, Hülya – Educational Sciences: Theory and Practice, 2016

The purpose of this study was to examine the impact of dimensionality, common-item set format, and different scale linking methods on preserving equity property with mixed-format test equating. Item response theory (IRT) true-score equating (TSE) and IRT observed-score equating (OSE) methods were used under common-item nonequivalent groups design.…

Descriptors: Test Format, Item Response Theory, True Scores, Equated Scores

Measurement Properties of Two Innovative Item Formats in a Computer-Based Test

Peer reviewed

Direct link

Wan, Lei; Henly, George A. – Applied Measurement in Education, 2012

Many innovative item formats have been proposed over the past decade, but little empirical research has been conducted on their measurement properties. This study examines the reliability, efficiency, and construct validity of two innovative item formats--the figural response (FR) and constructed response (CR) formats used in a K-12 computerized…

Descriptors: Test Items, Test Format, Computer Assisted Testing, Measurement

Rasch Measurement for Reducing the Items of the Nottingham Health Profile.

Peer reviewed

Prieto, Luis; Alonso, Jordi; Lamarca, Rosa; Wright, Benjamin D. – Journal of Outcome Measurement, 1998

Data from 45 studies involving 9,149 people were used to develop a short form of the Spanish version of the Nottingham Health Profile through Rasch analysis. Results confirmed the validity of using the developed 22-item short form to measure different groups of people categorized by gender, clinical, and health status. (SLD)

Descriptors: Groups, Health, Individual Characteristics, Item Response Theory

Previous Page | Next Page »

Pages: 1 | 2

Alonso, Jordi	1
Anivan, Sarinee, Ed.	1
Arce-Ferrer, Alvaro J.	1
Becker, Sebastian	1
Boone, William J.	1
Briceño, Mauricio	1
Budgell, Glen R.	1
Bulut, Okan	1
Calderón Carvajal, Carlos	1
Christoph, Simon	1
Cornelia Eva Neuert	1
Culligan, Brent	1
Dadan Rosana	1
Diedenhofen, Birk	1
Duatepe-Paksu, Asuman	1
Duru, Erdinc	1
Duru, Sibel	1
Fitria Lafifa	1
Hendrickson, Amy	1
Henly, George A.	1
Härtig, Hendrik	1
Iran-Nejad, Asghar	1
Kelecioglu, Hülya	1
Kuhn, Jochen	1
More ▼