ERIC - Search Results

Publication Date

In 2026	0
Since 2025	4
Since 2022 (last 5 years)	26
Since 2017 (last 10 years)	50
Since 2007 (last 20 years)	85

Descriptor

Test Format	243
Test Reliability	243
Test Validity	243
Test Construction	91
Test Items	60
Testing	50
Test Interpretation	43
Higher Education	41
Language Tests	38
Standardized Tests	38
Foreign Countries	37
Test Content	37
Test Use	36
Elementary Secondary Education	33
Test Reviews	32
Student Evaluation	31
Computer Assisted Testing	30
Scoring	30
Multiple Choice Tests	27
Psychometrics	26
Comparative Analysis	25
Screening Tests	25
Disability Identification	24
Evaluation Methods	24
Second Language Learning	24
More ▼

Education Level

Higher Education	33
Postsecondary Education	27
Secondary Education	18
Elementary Education	17
Middle Schools	13
Junior High Schools	12
Grade 8	9
High Schools	8
Grade 4	6
Grade 5	6
Grade 7	6
Intermediate Grades	6
Early Childhood Education	5
Elementary Secondary Education	5
Grade 3	5
Grade 6	5
Primary Education	4
Adult Education	2
Grade 9	1
Kindergarten	1
Preschool Education	1
More ▼

Audience

Practitioners	22
Administrators	15
Teachers	14
Researchers	4
Community	1
Policymakers	1
Students	1
Support Staff	1

Location

New York	8
Canada	3
Israel	3
Turkey	3
Georgia	2
Germany	2
Indonesia	2
Iran	2
Japan	2
Netherlands	2
Singapore	2
Bangladesh	1
Czech Republic	1
Estonia	1
Louisiana	1
Missouri	1
Nebraska	1
New Jersey	1
New York (Albany)	1
New York (Buffalo)	1
New York (New York)	1
New York (Rochester)	1
New York (Syracuse)	1
North Carolina	1
North Dakota	1
More ▼

Laws, Policies, & Programs

Individuals with Disabilities…	1
Job Training Partnership Act…	1
No Child Left Behind Act 2001	1
Pell Grant Program	1

What Works Clearinghouse Rating

Showing 1 to 15 of 243 results Save | Export

A Review of Automatic Item Generation Techniques Leveraging Large Language Models

Peer reviewed
PDF on ERIC

Download full text

Bin Tan; Nour Armoush; Elisabetta Mazzullo; Okan Bulut; Mark J. Gierl – International Journal of Assessment Tools in Education, 2025

This study reviews existing research on the use of large language models (LLMs) for automatic item generation (AIG). We performed a comprehensive literature search across seven research databases, selected studies based on predefined criteria, and summarized 60 relevant studies that employed LLMs in the AIG process. We identified the most commonly…

Descriptors: Artificial Intelligence, Test Items, Automation, Test Format

Do Different Devices Perform Equally Well with Different Numbers of Scale Points and Response Formats? A Test of Measurement Invariance and Reliability

Peer reviewed

Direct link

Natalja Menold; Vera Toepoel – Sociological Methods & Research, 2024

Research on mixed devices in web surveys is in its infancy. Using a randomized experiment, we investigated device effects (desktop PC, tablet and mobile phone) for six response formats and four different numbers of scale points. N = 5,077 members of an online access panel participated in the experiment. An exact test of measurement invariance and…

Descriptors: Online Surveys, Handheld Devices, Telecommunications, Test Reliability

Selecting Technically Adequate Tests

Peer reviewed

Direct link

Susan K. Johnsen – Gifted Child Today, 2024

The author provides a checklist for educators who are selecting technically adequate tests for identifying and referring students for gifted education services and programs. The checklist includes questions related to how the test was normed, reliability and validity studies as well as questions related to types of scores, administration, and…

Descriptors: Test Selection, Academically Gifted, Gifted Education, Test Validity

Meta[superscript 2]: A Meta-Analysis and Psychometric Evaluation of the Metacognitive Awareness Inventory (MAI) in the Context of Health Professions Education

Peer reviewed

Direct link

Andrew S. Cale; Elizabeth R. Agosto; Brenda Kucha Anak Ganeng; Megan E. Kruskie; Margaret A. McNulty; Kyle A. Robertson; Cecelia J. Vetter; Sabrina C. Woods; Md. Nazmul Karim; Adam B. Wilson – Anatomical Sciences Education, 2025

To keep pace with medicine's unpredictable changes, medical trainees must learn to accurately monitor and evaluate themselves via metacognition (i.e., thinking about thinking). The Metacognitive Awareness Inventory (MAI) can assess and guide the metacognitive development of trainees. This study summarizes existing psychometric evidence and…

Descriptors: Meta Analysis, Psychometrics, Metacognition, Measures (Individuals)

The DAATS Battery Short Form as a Measure of Teacher Dispositions

Peer reviewed
PDF on ERIC

Download full text

Judy R. Wilkerson; W. Steve Lang; LaSonya Moore – Journal of Research in Education, 2025

The DAATS (Dispositions Assessments Aligned with Teacher Standards) battery is a series of five instruments of different item types that measure teachers' consistency with the critical dispositions embedded in the InTASC Standards. The purpose of this study was to continue a 20-year research project on the development and implementation of…

Descriptors: Educational Assessment, National Standards, Teacher Evaluation, Teacher Competencies

A Two-Level Adaptive Test Battery

Peer reviewed

Direct link

Wim J. van der Linden; Luping Niu; Seung W. Choi – Journal of Educational and Behavioral Statistics, 2024

A test battery with two different levels of adaptation is presented: a within-subtest level for the selection of the items in the subtests and a between-subtest level to move from one subtest to the next. The battery runs on a two-level model consisting of a regular response model for each of the subtests extended with a second level for the joint…

Descriptors: Adaptive Testing, Test Construction, Test Format, Test Reliability

Data Literacy Assessments: A Systematic Literature Review

Peer reviewed

Direct link

Cui, Ying; Chen, Fu; Lutsyk, Alina; Leighton, Jacqueline P.; Cutumisu, Maria – Assessment in Education: Principles, Policy & Practice, 2023

With the exponential increase in the volume of data available in the 21st century, data literacy skills have become vitally important in work places and everyday life. This paper provides a systematic review of available data literacy assessments targeted at different audiences and educational levels. The results can help researchers and…

Descriptors: Data, Information Literacy, 21st Century Skills, Competence

Not Liking the Likert? A Rasch Analysis of Forced-Choice Format and Usefulness in Survey Design

Peer reviewed

Direct link

Celeste Combrinck – SAGE Open, 2024

We have less time and focus than ever before, while the demand for attention is increasing. Therefore, it is no surprise that when answering questionnaires, we often choose to strongly agree or be neutral, producing problematic and unusable data. The current study investigated forced-choice (ipsative) format compared to the same questions on a…

Descriptors: Likert Scales, Test Format, Surveys, Design

Exploring Curriculum-Based Measurement in Elementary Science: Investigating Two Vocabulary-Matching Formats

Peer reviewed

Direct link

Conoyer, Sarah J.; Wagner, Kyle B.; Janssen, Kristen K.; Jewell, Jeremy D.; McKenney, Elizabeth L. W. – Assessment for Effective Intervention, 2023

As content literacy intervention is expanded in schools, data-based decision-making practices need to also advance, especially in the areas of science. Vocabulary-matching curriculum-based measures (VM-CBM) may allow educators to identify students needing additional support in science vocabulary to assist with using and comprehending disciplinary…

Descriptors: Curriculum Based Assessment, Elementary School Science, Vocabulary, Benchmarking

The MSRT: A Critical Review of English Proficiency in Iran

Peer reviewed

Direct link

Muhammed Parviz; Masoud Azizi – Discover Education, 2025

This article offers a critical review of the Ministry of Science, Research, and Technology English Proficiency Test (MSRT), a high-stakes exam required for postgraduate graduation, scholarships, and certain employment positions in Iran. Despite its widespread use, the design and implementation of the MSRT raise concerns about its validity and…

Descriptors: Language Tests, Language Proficiency, English (Second Language), Second Language Learning

Measuring Mathematical Skills in Early Childhood: A Systematic Review of the Psychometric Properties of Early Maths Assessments and Screeners

Peer reviewed

Direct link

Laura A. Outhwaite; Pirjo Aunio; Jaimie Ka Yu Leung; Jo Van Herwegen – Educational Psychology Review, 2024

Successful early mathematical development is vital to children's later education, employment, and wellbeing outcomes. However, established measurement tools are infrequently used to (i) assess children's mathematical skills and (ii) identify children with or at-risk of mathematical learning difficulties. In response, this pre-registered systematic…

Descriptors: Mathematics Tests, Screening Tests, Mathematics Skills, At Risk Students

Establishing Survey Validity: A Practical Guide

Peer reviewed
PDF on ERIC

Download full text

Cobern, William W.; Adams, Betty A. J. – International Journal of Assessment Tools in Education, 2020

What follows is a practical guide for establishing the validity of a survey for research purposes. The motivation for providing this guide is our observation that researchers, not necessarily being survey researchers per se, but wanting to use a survey method, lack a concise resource on validity. There is far more to know about surveys and survey…

Descriptors: Surveys, Test Validity, Test Construction, Test Items

Pamukkale Critical Thinking Skill Scale: A Validity and Reliability Study

Peer reviewed
PDF on ERIC

Download full text

Duru, Erdinc; Ozgungor, Sevgi; Yildirim, Ozen; Duatepe-Paksu, Asuman; Duru, Sibel – International Journal of Assessment Tools in Education, 2022

The aim of this study is to develop a valid and reliable measurement tool that measures critical thinking skills of university students. Pamukkale Critical Thinking Skills Scale was developed as two separate forms; multiple choice and open-ended. The validity and reliability studies of the multiple-choice form were constructed on two different…

Descriptors: Critical Thinking, Cognitive Measurement, Test Validity, Test Reliability

The Canadian English Language Proficiency Index Program (CELPIP) Test

Peer reviewed

Direct link

McLeod, Melissa; Cheng, Liying – Language Assessment Quarterly, 2023

The Canadian English Language Proficiency Index Program (CELPIP) Test was designed for immigration and citizenship in Canada. CELPIP is a computer-based English-language proficiency test which covers all four skills. This test review provides a description of the test and its construct, tasks, and delivery. Then, it appraises CELPIP for…

Descriptors: Language Tests, Language Proficiency, English (Second Language), Second Language Learning

Reliability and Validity of Methods to Assess Undergraduate Healthcare Student Performance in Pharmacology: Comparison of Open Book versus Time-Limited Closed Book Examinations

Peer reviewed
PDF on ERIC

Download full text

David Bell; Vikki O'Neill; Vivienne Crawford – Practitioner Research in Higher Education, 2023

We compared the influence of open-book extended duration versus closed book time-limited format on reliability and validity of written assessments of pharmacology learning outcomes within our medical and dental courses. Our dental cohort undertake a mid-year test (30xfree-response short answer to a question, SAQ) and end-of-year paper (4xSAQ,…

Descriptors: Undergraduate Students, Pharmacology, Pharmaceutical Education, Test Format

Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10 | 11 | ... | 17

Diagnostique	26
Educational and Psychological…	8
New York State Education…	8
Journal of Reading	5
Journal of Educational…	4
Assessment for Effective…	3
International Journal of…	3
Journal of Experimental…	3
Language Assessment Quarterly	3
Language Testing	3
Psychological Assessment	3
Annual Review of Applied…	2
Applied Psychological…	2
Assessment	2
Assessment in Education:…	2
College Board	2
Grantee Submission	2
International Journal of…	2
Journal of Psychoeducational…	2
Online Submission	2
Academic Medicine	1
Advances in Language and…	1
Anatomical Sciences Education	1
Applied Measurement in…	1
Asia-Pacific Journal of…	1
More ▼

Federico, Pat-Anthony	3
Hambleton, Ronald K.	3
Stansfield, Charles W.	3
Straus, Murray A.	3
Conoyer, Sarah J.	2
Eignor, Daniel R.	2
Green, Kathy	2
Hamby, Sherry L.	2
Hendrickson, Amy	2
Liskin-Gasparro, Judith E.	2
Patterson, Brian	2
Sax, Gilbert	2
Schriesheim, Chester A.	2
Trevisan, Michael S.	2
Adam B. Wilson	1
Adams, Betty A. J.	1
Ahmed, Md. Kawser	1
Ahnberg, Jamie L.	1
Al-Jarf, Reima	1
Alberola Colomar, María Pilar	1
Alderson, J. Charles	1
Alemi, Minoo	1
Ali Hashemi	1
Ali, Syed Haris	1
More ▼

Journal Articles	151
Reports - Research	108
Reports - Descriptive	48
Reports - Evaluative	34
Speeches/Meeting Papers	33
Information Analyses	21
Guides - Non-Classroom	17
Opinion Papers	17
Tests/Questionnaires	11
Guides - Classroom - Teacher	8
Guides - General	5
Reference Materials -…	3
Books	2
Dissertations/Theses -…	1
ERIC Publications	1
Guides - Classroom - Learner	1
Non-Print Media	1
Numerical/Quantitative Data	1
Reference Materials - General	1
More ▼

Beck Depression Inventory	2
Peabody Picture Vocabulary…	2
SAT (College Admission Test)	2
Wechsler Intelligence Scale…	2
ACT Assessment	1
Armed Services Vocational…	1
Behavior Assessment System…	1
Canfield Learning Styles…	1
Computer Attitude Scale	1
Conflict Tactics Scale	1
Conners Rating Scales	1
Cornell Critical Thinking Test	1
Defining Issues Test	1
Developmental Indicators for…	1
Embedded Figures Test	1
English Proficiency Test	1
Graduate Management Admission…	1
Gregorc Style Delineator	1
International English…	1
Kaufman Brief Intelligence…	1
Kaufman Test of Educational…	1
Keymath Diagnostic Arithmetic…	1
Measures of Academic Progress	1
Minnesota Multiphasic…	1
Myers Briggs Type Indicator	1
More ▼