ERIC - Search Results

Publication Date

In 2026	0
Since 2025	1
Since 2022 (last 5 years)	12
Since 2017 (last 10 years)	30
Since 2007 (last 20 years)	76

Descriptor

Test Items	143
Test Validity	106
Test Construction	91
Test Reliability	45
Item Analysis	26
Item Response Theory	25
Validity	23
Evaluation Methods	20
Psychometrics	20
Scores	20
Scoring	19
Student Evaluation	19
Testing	18
Foreign Countries	16
Difficulty Level	15
Language Tests	15
Standardized Tests	15
Achievement Tests	14
Content Validity	14
Higher Education	14
Test Use	14
Models	13
Multiple Choice Tests	13
Test Format	13
Mathematics Tests	12
More ▼

Publication Type

Reports - Descriptive	143
Journal Articles	85
Numerical/Quantitative Data	11
Speeches/Meeting Papers	10
Tests/Questionnaires	8
Opinion Papers	6
Guides - Non-Classroom	4
Reports - Research	2
Collected Works - Serials	1
Guides - Classroom - Teacher	1
Information Analyses	1
Reports - Evaluative	1
More ▼

Education Level

Elementary Education	9
Elementary Secondary Education	8
Grade 5	8
Higher Education	8
Secondary Education	8
Grade 4	7
Middle Schools	7
Grade 6	6
Grade 7	6
High Schools	6
Junior High Schools	6
Postsecondary Education	6
Grade 8	5
Early Childhood Education	4
Grade 3	4
Intermediate Grades	4
Primary Education	4
Grade 9	3
Kindergarten	2
Grade 1	1
Grade 10	1
Grade 12	1
Grade 2	1
Preschool Education	1
More ▼

Audience

Practitioners	7
Teachers	7
Administrators	5
Researchers	4
Community	1
Parents	1
Policymakers	1

Location

Australia	3
Massachusetts	3
Missouri	3
Oregon	3
Florida	2
Idaho	2
Japan	2
New Mexico	2
Tennessee	2
Washington	2
Canada	1
Georgia	1
India	1
Kuwait	1
Maryland	1
Mexico	1
Nebraska	1
New York	1
Philippines	1
Puerto Rico	1
Switzerland	1
United Kingdom	1
United Kingdom (Edinburgh)	1
More ▼

Laws, Policies, & Programs

No Child Left Behind Act 2001	4
Comprehensive Education…	2

Assessments and Surveys

National Assessment of…	3
Program for International…	3
SAT (College Admission Test)	2
Test of English as a Foreign…	2
Alberta Grade Twelve Diploma…	1
Eysenck Personality Inventory	1
Measures of Academic Progress	1
National Teacher Examinations	1
New York State Regents…	1
North Carolina End of Course…	1
Pediatric Evaluation of…	1
Raven Advanced Progressive…	1
Stanford Achievement Tests	1
Test of English for…	1
Work Keys (ACT)	1
More ▼

What Works Clearinghouse Rating

Showing 1 to 15 of 143 results Save | Export

Using Content Relevance and Representativeness Indices in Instrument Revision

Peer reviewed

Direct link

Anne Traynor; Sara C. Christopherson – Applied Measurement in Education, 2024

Combining methods from earlier content validity and more contemporary content alignment studies may allow a more complete evaluation of the meaning of test scores than if either set of methods is used on its own. This article distinguishes item relevance indices in the content validity literature from test representativeness indices in the…

Descriptors: Test Validity, Test Items, Achievement Tests, Test Construction

Improving Mathematics Diagnostic Tests Using Item Analysis

Peer reviewed

Direct link

Meike Akveld; George Kinnear – International Journal of Mathematical Education in Science and Technology, 2024

Many universities use diagnostic tests to assess incoming students' preparedness for mathematics courses. Diagnostic test results can help students to identify topics where they need more practice and give lecturers a summary of strengths and weaknesses in their class. We demonstrate a process that can be used to make improvements to a mathematics…

Descriptors: Mathematics Tests, Diagnostic Tests, Test Items, Item Analysis

Response Process Validity Evidence in Chemistry Education Research

Peer reviewed

Direct link

Deng, Jacky M.; Streja, Nicholas; Flynn, Alison B. – Journal of Chemical Education, 2021

Response process validity evidence can provide researchers with insight into how and why participants interpret items on instruments such as tests and questionnaires. In chemistry education research literature and the social sciences more broadly, response process validity evidence has been used and reported in a variety of ways. This paper's…

Descriptors: Chemistry, Science Education, Educational Research, Validity

TOEFL iBT® Technical Manual. TOEFL® Research Series. RR-106. ETS Research Report. RR-25-12

Peer reviewed
PDF on ERIC

Download full text

Venessa F. Manna; Shuhong Li; Spiros Papageorgiou; Lixiong Gu – ETS Research Report Series, 2025

This technical manual describes the purpose and intended uses of the TOEFL iBT test, its target test-taker population, and relevant language use domains. The test design and scoring procedures are presented first, followed by a research agenda intended to support the interpretation and use of test scores. Given the updates to the test starting…

Descriptors: Second Language Learning, English (Second Language), Language Tests, Test Construction

Supporting the Interpretive Validity of Student-Level Claims in Science Assessment with Tiered Claim Structures

Peer reviewed

Direct link

Student, Sanford R.; Gong, Brian – Educational Measurement: Issues and Practice, 2022

We address two persistent challenges in large-scale assessments of the Next Generation Science Standards: (a) the validity of score interpretations that target the standards broadly and (b) how to structure claims for assessments of this complex domain. The NGSS pose a particular challenge for specifying claims about students that evidence from…

Descriptors: Science Tests, Test Validity, Test Items, Test Construction

Designing and Evaluating Tasks to Measure Individual Differences in Experimental Psychology: A Tutorial

Peer reviewed

Direct link

Marc Brysbaert – Cognitive Research: Principles and Implications, 2024

Experimental psychology is witnessing an increase in research on individual differences, which requires the development of new tasks that can reliably assess variations among participants. To do this, cognitive researchers need statistical methods that many researchers have not learned during their training. The lack of expertise can pose…

Descriptors: Experimental Psychology, Individual Differences, Statistical Analysis, Task Analysis

Development and Validation of a Survey Instrument for Measuring Pre-Service Teachers' Pedagogical Content Knowledge

Peer reviewed

Direct link

Martin, David; Jamieson-Proctor, Romina – International Journal of Research & Method in Education, 2020

In Australia, one of the key findings of the Teacher Education Ministerial Advisory Group was that not all graduating pre-service teachers possess adequate pedagogical content knowledge (PCK) to teach effectively. The concern is that higher education providers working with pre-service teachers are using pedagogical practices and assessments which…

Descriptors: Test Construction, Preservice Teachers, Pedagogical Content Knowledge, Foreign Countries

Item Pool Quality Control in Educational Testing: Change Point Model, Compound Risk, and Sequential Detection

Peer reviewed

Direct link

Chen, Yunxiao; Lee, Yi-Hsuan; Li, Xiaoou – Journal of Educational and Behavioral Statistics, 2022

In standardized educational testing, test items are reused in multiple test administrations. To ensure the validity of test scores, the psychometric properties of items should remain unchanged over time. In this article, we consider the sequential monitoring of test items, in particular, the detection of abrupt changes to their psychometric…

Descriptors: Standardized Tests, Test Items, Test Validity, Scores

Development and Initial Validation of a Multidimensional Questionnaire on the Relationship to Work (RWQ)

Peer reviewed

Direct link

Fournier, Geneviève; Lachance, Lise; Viviers, Simon; Lahrizi, Imane Zineb; Goyer, Liette; Masdonati, Jonas – International Journal for Educational and Vocational Guidance, 2020

The paper presents first the theoretical foundations used to develop a pre-experimental version of a questionnaire on relationship to work, and then the four stages of its initial validation leading to an experimental version. These stages included: (1) Defining the dimensions and sub-dimensions of the relationship to work concept; (2)…

Descriptors: Test Construction, Content Validity, Work Attitudes, Test Items

Argument-Based Validation in Practice: Examples from Mathematics Education

Peer reviewed

Direct link

Krupa, Erin Elizabeth; Carney, Michele; Bostic, Jonathan – Applied Measurement in Education, 2019

This article provides a brief introduction to the set of four articles in the special issue. To provide a foundation for the issue, key terms are defined, a brief historical overview of validity is provided, and a description of several different validation approaches used in the issue are explained. Finally, the contribution of the articles to…

Descriptors: Test Items, Program Validation, Test Validity, Mathematics Education

Establishing Survey Validity: A Practical Guide

Peer reviewed
PDF on ERIC

Download full text

Cobern, William W.; Adams, Betty A. J. – International Journal of Assessment Tools in Education, 2020

What follows is a practical guide for establishing the validity of a survey for research purposes. The motivation for providing this guide is our observation that researchers, not necessarily being survey researchers per se, but wanting to use a survey method, lack a concise resource on validity. There is far more to know about surveys and survey…

Descriptors: Surveys, Test Validity, Test Construction, Test Items

Evaluating Content-Related Validity Evidence Using a Text-Based Machine Learning Procedure

Peer reviewed

Direct link

Anderson, Daniel; Rowley, Brock; Stegenga, Sondra; Irvin, P. Shawn; Rosenberg, Joshua M. – Educational Measurement: Issues and Practice, 2020

Validity evidence based on test content is critical to meaningful interpretation of test scores. Within high-stakes testing and accountability frameworks, content-related validity evidence is typically gathered via alignment studies, with panels of experts providing qualitative judgments on the degree to which test items align with the…

Descriptors: Content Validity, Artificial Intelligence, Test Items, Vocabulary

The Uses of Process Data in Large-Scale Educational Assessments. OECD Education Working Papers. No. 286

Direct link

Maddox, Bryan – OECD Publishing, 2023

The digital transition in educational testing has introduced many new opportunities for technology to enhance large-scale assessments. These include the potential to collect and use log data on test-taker response processes routinely, and on a large scale. Process data has long been recognised as a valuable source of validation evidence in…

Descriptors: Measurement, Inferences, Test Reliability, Computer Assisted Testing

Practical Online Assessment of Mathematical Proof

Peer reviewed

Direct link

Thomas Bickerton, Robert; Sangwin, Chris J. – International Journal of Mathematical Education in Science and Technology, 2022

We discuss a practical method for assessing mathematical proof online. We examine the use of faded worked examples and reading comprehension questions to understand proof. By breaking down a given proof, we formulate a checklist that can be used to generate comprehension questions which can be assessed automatically online. We then provide some…

Descriptors: Mathematics Instruction, Validity, Mathematical Logic, Evaluation Methods

Diagnostic Test Construction: Insights from Cognitive Diagnostic Modeling

Peer reviewed
PDF on ERIC

Download full text

Ketabi, Somaye; Alavi, Seyyed Mohammed; Ravand, Hamdollah – International Journal of Language Testing, 2021

Although Diagnostic Classification Models (DCMs) were introduced to education system decades ago, it seems that these models were not employed for the original aims upon which they had been designed. Using DCMs has been mostly common in analyzing large-scale non-diagnostic tests and these models have been rarely used in developing Cognitive…

Descriptors: Diagnostic Tests, Test Construction, Goodness of Fit, Classification

Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10

Educational Measurement:…	11
Behavioral Research and…	4
Journal of Applied Testing…	3
Journal of Educational and…	3
Measurement and Evaluation in…	3
Online Submission	3
Practical Assessment,…	3
Applied Measurement in…	2
Educational Assessment	2
International Journal of…	2
Journal of Chemical Education	2
Journal of Educational…	2
New Meridian Corporation	2
New Mexico Public Education…	2
OECD Publishing	2
Performance and Instruction	2
Structural Equation Modeling:…	2
AMATYC Review	1
Alberta Journal of…	1
American Journal of Sexuality…	1
Assessment and Accountability…	1
Assessment in Education:…	1
Assessment in Higher Education	1
Astronomy Education Review	1
British Journal of…	1
More ▼

Stansfield, Charles W.	6
Liu, Kimy	3
Sireci, Stephen G.	3
Embretson, Susan E.	2
Ferrando, Pere J.	2
Geller, Josh	2
Irvin, P. Shawn	2
Jung, Eunju	2
Ketterlin-Geller, Leanne R.	2
Lee, Yi-Hsuan	2
Petscher, Yaacov	2
Polikoff, Morgan S.	2
Tindal, Gerald	2
Truckenmiller, Adrea	2
Yovanoff, Paul	2
Abedi, Jamal	1
Adams, Betty A. J.	1
Ahmed, S.	1
Alavi, Seyyed Mohammed	1
Alonzo, Julie	1
Anderson, Daniel	1
Andersson, Luanne	1
Andrich, David	1
Anne Traynor	1
More ▼