ERIC - Search Results

Publication Date

In 2026	0
Since 2025	14
Since 2022 (last 5 years)	62
Since 2017 (last 10 years)	133
Since 2007 (last 20 years)	419

Descriptor

Item Analysis	957
Test Validity	957
Test Reliability	535
Test Construction	425
Test Items	303
Foreign Countries	210
Factor Analysis	200
Psychometrics	169
Correlation	116
Statistical Analysis	110
Achievement Tests	109
Higher Education	98
Difficulty Level	95
Multiple Choice Tests	95
Questionnaires	84
Scores	84
Evaluation Methods	76
Test Interpretation	76
Rating Scales	75
Criterion Referenced Tests	74
Test Bias	74
College Students	72
Measures (Individuals)	71
Measurement Techniques	70
Factor Structure	69
More ▼

Education Level

Higher Education	142
Postsecondary Education	104
Secondary Education	72
Elementary Education	63
Elementary Secondary Education	44
High Schools	38
Middle Schools	35
Early Childhood Education	31
Junior High Schools	22
Grade 5	19
Grade 6	17
Intermediate Grades	17
Primary Education	17
Preschool Education	16
Grade 3	15
Grade 4	15
Grade 8	15
Grade 7	13
Adult Education	11
Grade 9	6
Kindergarten	6
Grade 1	5
Grade 10	5
Grade 12	4
Two Year Colleges	4
More ▼

Audience

Researchers	34
Practitioners	14
Teachers	8
Students	2
Administrators	1
Counselors	1
Policymakers	1

Location

Turkey	52
Canada	15
Iran	11
Australia	10
China	10
California	7
India	7
Indonesia	7
United Kingdom	7
Florida	6
Japan	6
Taiwan	6
Arkansas	5
Italy	5
Nigeria	5
Georgia	4
Germany	4
Israel	4
Malaysia	4
New York	4
Singapore	4
Spain	4
Alabama	3
Brazil	3
Greece	3
More ▼

Laws, Policies, & Programs

No Child Left Behind Act 2001	5
Individuals with Disabilities…	4
Elementary and Secondary…	1
Individuals with Disabilities…	1

What Works Clearinghouse Rating

Test Validity X

Showing 46 to 60 of 957 results Save | Export

The Pattern of Test-Taking Effort across Items in Cognitive Ability Test: A Latent Class Analysis

Peer reviewed
PDF on ERIC

Download full text

Akhtar, Hanif – International Association for Development of the Information Society, 2022

When examinees perceive a test as low stakes, it is logical to assume that some of them will not put out their maximum effort. This condition makes the validity of the test results more complicated. Although many studies have investigated motivational fluctuation across tests during a testing session, only a small number of studies have…

Descriptors: Intelligence Tests, Student Motivation, Test Validity, Student Attitudes

Exploring Sex Differences in the Structure of the ADOS-2 in an Early Intervention Sample

Direct link

Brigid Garvin – ProQuest LLC, 2021

Autism Spectrum Disorder (ASD) is diagnosed using the same criteria for males and females (e.g., DSM-5, ICD-10). Our understanding of ASD, including its etiology, symptom presentation, and prevalence has evolved significantly over time motivating several changes to the diagnostic criteria and the tools with which symptoms are measured. One aspect…

Descriptors: Preschool Children, Autism Spectrum Disorders, Diagnostic Tests, Observation

Developing the Diagnostic Test of Misconceptions of Fractions

Peer reviewed
PDF on ERIC

Download full text

Aleyna Altan; Zehra Taspinar Sener – Online Submission, 2023

This research aimed to develop a valid and reliable test to be used to detect sixth grade students' misconceptions and errors regarding the subject of fractions. A misconception diagnostic test has been developed that includes the concept of fractions, different representations of fractions, ordering and comparing fractions, equivalence of…

Descriptors: Diagnostic Tests, Mathematics Tests, Fractions, Misconceptions

Gender Bias in Test Item Formats: Evidence from PISA 2009, 2012, and 2015 Math and Reading Tests

Peer reviewed

Direct link

Shear, Benjamin R. – Journal of Educational Measurement, 2023

Large-scale standardized tests are regularly used to measure student achievement overall and for student subgroups. These uses assume tests provide comparable measures of outcomes across student subgroups, but prior research suggests score comparisons across gender groups may be complicated by the type of test items used. This paper presents…

Descriptors: Gender Bias, Item Analysis, Test Items, Achievement Tests

Item Objective Congruence Analysis for Multidimensional Items: Content Validation of a Reading Test in Sri Lankan University

Peer reviewed
PDF on ERIC

Download full text

Ismail, Fouzul Kareema Mohamed; Zubairi, Ainol Madziah Bt. – English Language Teaching, 2022

This paper presents the findings of a study that intended to seek the content validity (CV) evidence of an instrument to measure the reading ability of university students in Sri Lanka. The reading passages and items were adapted from CEFR aligned Learning Resource Network (LRN) materials. The items were designed based on the cognitive processing…

Descriptors: Foreign Countries, Test Items, Content Validity, Reading Tests

Development of a High-Level Thinking Skills Test (HOTS) in English Writing

Peer reviewed
PDF on ERIC

Download full text

Mardiana – Eurasian Journal of Applied Linguistics, 2023

Written inquiries, which are more frequent and have less of a focus on complex thinking, are issues at school. Students are not taught how to respond to questions found in High-Level Thinking Skills (HOTS) tests, hence, their thinking abilities are generally weak. The issue for teachers is that neither they nor anyone else has been able to create…

Descriptors: Skill Development, Thinking Skills, Check Lists, Models

Probing the Internal Validity of the LLAMA Language Aptitude Tests

Peer reviewed

Direct link

Bokander, Lars; Bylund, Emanuel – Language Learning, 2020

Over the past decade, the LLAMA language aptitude test battery has come to play an increasingly important role as an instrument in research on individual differences in language development. However, a potentially serious problem that has been pointed out by several scholars is that the LLAMA has not yet been carefully validated. We addressed this…

Descriptors: Item Analysis, Language Tests, Test Items, Individual Differences

Ensuring Fairness in Difficulty and Content among Parallel Assessments Generated from a Test-Item Database

Download full text

Parry, James R. – Online Submission, 2020

This paper presents research and provides a method to ensure that parallel assessments, that are generated from a large test-item database, maintain equitable difficulty and content coverage each time the assessment is presented. To maintain fairness and validity it is important that all instances of an assessment, that is intended to test the…

Descriptors: Culture Fair Tests, Difficulty Level, Test Items, Test Validity

Student Perceptions of a Whole Child School Screening Instrument: An Initial Step in Attending to Consequential Validity

Peer reviewed

Direct link

Kimmia Lyon; Jessica B. Koslouski; Sandra M. Chafouleas; Amy M. Briesch; Jacqueline M. Caemmerer – Grantee Submission, 2025

Existing educational assessments have typically been developed without appropriate attention to the intended and unintended consequences of measure implementation and interpretation. We are developing the Expanding Screening to Support Youth (ESSY) Whole Child Screener using a mixed methods approach that attends to the intended and unintended…

Descriptors: Student Attitudes, Screening Tests, Validity, Grade 3

Student Perceptions of a Whole Child School Screening Instrument: An Initial Step in Attending to Consequential Validity

Peer reviewed

Direct link

Kimmia Lyon; Jessica B. Koslouski; Sandra M. Chafouleas; Amy M. Briesch; Jacqueline M. Caemmerer – School Mental Health, 2025

Descriptors: Student Attitudes, Screening Tests, Validity, Grade 3

The Development and Initial Validation of O-WSVLT, a Meaning-Recall Online L2 Spanish Vocabulary Levels Test

Peer reviewed

Direct link

Pablo Robles-García; Stuart McLean; Jeffrey Stewart; Ji-young Shin; Claudia Helena Sánchez-Gutiérrez – Language Assessment Quarterly, 2024

Recent literature in the field of L2 vocabulary assessment has advocated for the development of written receptive vocabulary tests such as Vocabulary Levels Tests (VLTs) that use: (a) meaning-recall item formats, (b) a minimum of 40 item counts per 1,000-frequency band to improve level estimates, and (c) lemmas (not word-families) as the lexical…

Descriptors: Spanish, Test Validity, Test Construction, Vocabulary Development

Implementation of Four-Tier Multiple-Choice Instruments Based on the Partial Credit Model in Evaluating Students' Learning Progress

Peer reviewed
PDF on ERIC

Download full text

Laliyo, Lukman Abdul Rauf; Hamdi, Syukrul; Pikoli, Masrid; Abdullah, Romario; Panigoro, Citra – European Journal of Educational Research, 2021

One of the issues that hinder the students' learning progress is the inability to construct an epistemological explanation of a scientific phenomenon. Four-tier multiple-choice (hereinafter, 4TMC) instrument and Partial-Credit Model were employed to elaborate on the diagnosis process of the aforementioned problem. This study was to develop and…

Descriptors: Learning Processes, Multiple Choice Tests, Models, Test Items

Comparison of DIF Methods for the Student Experience in the Research University Survey: A Validity and Methodological Study

Direct link

Thapelo Ncube Whitfield – ProQuest LLC, 2021

Student Experience surveys are used to measure student attitudes towards their campus as well as to initiate conversations for institutional change. Validity evidence to support the interpretations of these surveys' results, however, is lacking. The first purpose of this study was to compare three Differential Item Functioning (DIF) methods on…

Descriptors: College Students, Student Surveys, Student Experience, Student Attitudes

Exploring the Influence of Judge Proficiency on Standard-Setting Judgments

Peer reviewed

Direct link

Peabody, Michael R.; Wind, Stefanie A. – Journal of Educational Measurement, 2019

Setting performance standards is a judgmental process involving human opinions and values as well as technical and empirical considerations. Although all cut score decisions are by nature somewhat arbitrary, they should not be capricious. Judges selected for standard-setting panels should have the proper qualifications to make the judgments asked…

Descriptors: Standard Setting, Decision Making, Performance Based Assessment, Evaluators

Evaluating Research Reports on the Qualities of Tests of English Language Skills in Indonesian Schools: A Systematic Review

Peer reviewed
PDF on ERIC

Download full text

Patrisius Istiarto Djiwandono; Daniel Ginting – Language Education & Assessment, 2025

The teaching of English as a foreign language in Indonesia has a long history, and it is always important to ask whether the assessment of the students' language skills has been valid and reliable. A screening of many articles in several prominent databases reveal that a number of evaluation studies have been done by Indonesian scholars in the…

Descriptors: Foreign Countries, Language Tests, English (Second Language), Second Language Learning

« Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10 | 11 | ... | 64

Educational and Psychological…	57
Journal of Educational…	26
Journal of Psychoeducational…	24
ProQuest LLC	23
Online Submission	16
Educational Sciences: Theory…	12
Language Testing	9
Applied Measurement in…	8
Assessment for Effective…	8
Educational Measurement:…	7
Grantee Submission	7
Journal of Consulting and…	7
Journal of Education and…	6
Journal of Vocational Behavior	6
Measurement and Evaluation in…	6
Research on Social Work…	6
CBE - Life Sciences Education	5
Canadian Journal of School…	5
ETS Research Report Series	5
Educational Assessment	5
International Journal of…	5
Language Assessment Quarterly	5
Measurement and Evaluation in…	5
Applied Psychological…	4
Educational Research and…	4
More ▼

Erford, Bradley T.	6
Haladyna, Tom	5
Dedrick, Robert F.	4
Ferron, John	4
Green, Donald Ross	4
Hambleton, Ronald K.	4
Michael, William B.	4
Pyrczak, Fred	4
Roid, Gale	4
Shaunessy-Dedrick, Elizabeth	4
Suldo, Shannon M.	4
Whitney, Douglas R.	4
Brown, James Dean	3
Echternacht, Gary	3
Filby, Nikola N.	3
Kolstad, Rosemarie K.	3
O'Reilly, Tenaha	3
Plake, Barbara S.	3
Reckase, Mark D.	3
Aaronson, May	2
Abbott, Robert D.	2
Abell, Neil	2
Abrams, Lisa M.	2
Amy M. Briesch	2
More ▼

Reports - Research	577
Journal Articles	494
Reports - Evaluative	106
Speeches/Meeting Papers	92
Tests/Questionnaires	58
Reports - Descriptive	36
Dissertations/Theses -…	24
Information Analyses	14
Numerical/Quantitative Data	14
Opinion Papers	14
Guides - Non-Classroom	12
Guides - General	4
Books	2
Dissertations/Theses	2
Legal/Legislative/Regulatory…	2
Reference Materials -…	2
Collected Works - General	1
Collected Works - Proceedings	1
Collected Works - Serials	1
Dissertations/Theses -…	1
ERIC Publications	1
Guides - Classroom - Learner	1
Guides - Classroom - Teacher	1
Reference Materials -…	1
More ▼

Stanford Achievement Tests	11
California Achievement Tests	10
National Assessment of…	8
Iowa Tests of Basic Skills	7
National Teacher Examinations	6
Peabody Picture Vocabulary…	6
Wechsler Intelligence Scale…	6
Metropolitan Achievement Tests	5
Minnesota Multiphasic…	5
SAT (College Admission Test)	5
Comprehensive Tests of Basic…	4
Program for International…	4
Stanford Binet Intelligence…	4
Adjective Check List	3
Armed Services Vocational…	3
Childrens Manifest Anxiety…	3
General Educational…	3
Graduate Record Examinations	3
Piers Harris Childrens Self…	3
Rosenberg Self Esteem Scale	3
Strong Vocational Interest…	3
Test of English as a Foreign…	3
Trends in International…	3
Wechsler Adult Intelligence…	3
Adaptive Behavior Scale	2
More ▼