ERIC - Search Results

Publication Date

In 2026	0
Since 2025	2
Since 2022 (last 5 years)	8
Since 2017 (last 10 years)	9
Since 2007 (last 20 years)	11

Descriptor

Accuracy	12
Test Use	12
Test Items	6
Psychometrics	5
Test Construction	5
Elementary School Students	4
Scores	4
Test Interpretation	4
Achievement Tests	3
Screening Tests	3
Academic Achievement	2
Administrator Attitudes	2
Alignment (Education)	2
Caregiver Attitudes	2
Decision Making	2
Evaluation Methods	2
Item Analysis	2
Item Response Theory	2
Parent Attitudes	2
School Personnel	2
Selection Criteria	2
Standardized Tests	2
Statistical Analysis	2
Student Needs	2
Teacher Attitudes	2
More ▼

Source

Journal of Educational…	2
Discover Education	1
Educational Administration…	1
Educational Measurement:…	1
Grantee Submission	1
International Journal of…	1
Journal of Speech, Language,…	1
Language Testing in Asia	1
Office of Education, US…	1
School Mental Health	1
School Psychology Review	1
More ▼

Publication Type

Journal Articles	10
Reports - Research	9
Information Analyses	2
Reports - Descriptive	2
Historical Materials	1

Education Level

Elementary Education	5
Early Childhood Education	2
Elementary Secondary Education	2
Higher Education	2
Postsecondary Education	2
Primary Education	2
Grade 1	1
Grade 2	1
Grade 3	1
Grade 4	1
Grade 5	1
High Schools	1
Intermediate Grades	1
Middle Schools	1
Secondary Education	1
More ▼

Audience

Administrators	1
Counselors	1
Teachers	1

Location

France

Laws, Policies, & Programs

National Defense Education Act

Assessments and Surveys

Iowa Tests of Basic Skills

What Works Clearinghouse Rating

Showing all 12 results Save | Export

The Test Adaptation Reporting Standards (TARES): Reporting Test Adaptations

Peer reviewed

Direct link

Dragos Iliescu; Dave Bartram; Pia Zeinoun; Matthias Ziegler; Paula Elosua; Stephen Sireci; Kurt F. Geisinger; Aletta Odendaal; Maria Elena Oliveri; Jon Twing; Wayne Camara – International Journal of Testing, 2024

The "Test Adaptation Reporting Standards" (TARES), or "TARES statement" was developed to alleviate the problems arising from inadequate reporting of test adaptation procedures. The TARES contains a short preamble and a checklist, that comprises an evidence-based minimum set of information for reporting in test adaptations. The…

Descriptors: Test Use, Outcome Measures, Check Lists, Evidence Based Practice

Using Multiple Maximum Exposure Rates in Computerized Adaptive Testing

Peer reviewed

Direct link

Kylie Gorney; Mark D. Reckase – Journal of Educational Measurement, 2025

In computerized adaptive testing, item exposure control methods are often used to provide a more balanced usage of the item pool. Many of the most popular methods, including the restricted method (Revuelta and Ponsoda), use a single maximum exposure rate to limit the proportion of times that each item is administered. However, Barrada et al.…

Descriptors: Computer Assisted Testing, Adaptive Testing, Test Items, Item Banks

Using Multilabel Neural Network to Score High-Dimensional Assessments for Different Use Foci: An Example with College Major Preference Assessment

Peer reviewed

Direct link

Shun-Fu Hu; Amery D. Wu; Jake Stone – Journal of Educational Measurement, 2025

Scoring high-dimensional assessments (e.g., > 15 traits) can be a challenging task. This paper introduces the multilabel neural network (MNN) as a scoring method for high-dimensional assessments. Additionally, it demonstrates how MNN can score the same test responses to maximize different performance metrics, such as accuracy, recall, or…

Descriptors: Tests, Testing, Scores, Test Construction

The Intent of ChatGPT Usage and Its Robustness in Medical Proficiency Exams: A Systematic Review

Peer reviewed

Direct link

Tatiana Chaiban; Zeinab Nahle; Ghaith Assi; Michelle Cherfane – Discover Education, 2024

Background: Since it was first launched, ChatGPT, a Large Language Model (LLM), has been widely used across different disciplines, particularly the medical field. Objective: The main aim of this review is to thoroughly assess the performance of the distinct version of ChatGPT in subspecialty written medical proficiency exams and the factors that…

Descriptors: Medical Education, Accuracy, Artificial Intelligence, Computer Software

Spelling Errors in French Elementary School Students: A Linguistic Analysis

Peer reviewed

Direct link

Joye, Nelly; Broc, Lucie; Marshall, Chloë Ruth; Dockrell, Julie Elizabeth – Journal of Speech, Language, and Hearing Research, 2022

Purpose: This study offers the first description of misspellings across elementary school using the Phonological, Orthographic and Morphological Assessment of Spelling (POMAS), a linguistic framework based on Triple Word Form theory, adapted for French (POMAS-FR). It aims to test the "universality" of POMAS and its suitability to track…

Descriptors: Spelling, Elementary School Students, French, Error Patterns

Developing a Whole Child School Screening Instrument: Evaluating Perceived Usability as an Initial Step in Planning for Consequential Validity

Peer reviewed

Direct link

Jessica B. Koslouski; Sandra M. Chafouleas; Amy Briesch; Jacqueline M. Caemmerer; Brittany Melo – School Mental Health, 2024

We are developing the Equitable Screening to Support Youth (ESSY) Whole Child Screener to address concerns prevalent in existing school-based screenings that impede goals to advance educational equity using universal screeners. Traditional assessment development does not include end users in the early development phases, instead relying on a…

Descriptors: Screening Tests, Psychometrics, Validity, Child Development

A Special Case of Brennan's Index for Tests That Aim to Select a Limited Number of Students: A Monte Carlo Simulation Study

Peer reviewed

Direct link

Arikan, Serkan; Aybek, Eren Can – Educational Measurement: Issues and Practice, 2022

Many scholars compared various item discrimination indices in real or simulated data. Item discrimination indices, such as item-total correlation, item-rest correlation, and IRT item discrimination parameter, provide information about individual differences among all participants. However, there are tests that aim to select a very limited number…

Descriptors: Monte Carlo Methods, Item Analysis, Correlation, Individual Differences

Developing a Whole Child School Screening Instrument: Evaluating Perceived Usability as an Initial Step in Planning for Consequential Validity

Peer reviewed

Direct link

Jessica B. Koslouski; Sandra M. Chafouleas; Amy Briesch; Jacqueline M. Caemmerer; Brittany Melo – Grantee Submission, 2024

Descriptors: Screening Tests, Usability, Decision Making, Validity

Investigating the Psychometric Properties of the Qiyas for L1 Arabic Language Test Using a Rasch Measurement Framework

Peer reviewed

Direct link

Al-Owidha, Amjed A. – Language Testing in Asia, 2018

Background: This study investigated the psychometric properties of the recently developed Qiyas for L1 Arabic language test using a Rasch measurement framework. Methods: Responses from 271 examinees were analyzed in this study. The test is hypothesized to involve one dominant factor that assesses four skills: reading comprehension, rhetorical…

Descriptors: Semitic Languages, Language Tests, Psychometrics, Reading Comprehension

A Bridge Too Far? Challenges in Evaluating Principal Effectiveness

Peer reviewed

Direct link

Fuller, Edward J.; Hollingworth, Liz – Educational Administration Quarterly, 2014

Purpose: The purpose of this article is to examine the assumptions underlying efforts to evaluate principal effectiveness in terms of student test scores, to review extant research on efforts to estimate principal effectiveness, and to discuss the appropriateness of including estimates of principal effectiveness in evaluations of principals.…

Descriptors: Principals, Administrator Effectiveness, Administrator Evaluation, Computation

Evaluating the Interpretations and Use of Curriculum-Based Measurement in Reading and Word Lists for Universal Screening in First and Second Grade

Peer reviewed
PDF on ERIC

Download full text

January, Stacy-Ann A.; Ardoin, Scott P.; Christ, Theodore J.; Eckert, Tanya L.; White, Mary Jane – School Psychology Review, 2016

Universal screening in elementary schools often includes administering curriculum-based measurement in reading (CBM-R); but in first grade, nonsense word fluency (NWF) and, to a lesser extent, word identification fluency (WIF) are used because of concerns that CBM-R is too difficult for emerging readers. This study used Kane's argument-based…

Descriptors: Curriculum Based Assessment, Reading Tests, Test Interpretation, Test Use

Interpretation of Test Results. Bulletin, 1964, No. 7. OE-25038

Download full text

McLaughlin, Kenneth F. – Office of Education, US Department of Health, Education, and Welfare, 1964

Under Title V, Guidance, Counseling, and Testing of the "National Defense Education Act of 1958," the Congress of the United States has recognized the value of tests as a tool which may be used to help make an early determination of the aptitudes and abilities of the students in U.S. schools. This bulletin attempts to explain the use and…

Descriptors: Educational History, School Guidance, Educational Testing, Aptitude Tests

Amy Briesch	2
Brittany Melo	2
Jacqueline M. Caemmerer	2
Jessica B. Koslouski	2
Sandra M. Chafouleas	2
Al-Owidha, Amjed A.	1
Aletta Odendaal	1
Amery D. Wu	1
Ardoin, Scott P.	1
Arikan, Serkan	1
Aybek, Eren Can	1
Broc, Lucie	1
Christ, Theodore J.	1
Dave Bartram	1
Dockrell, Julie Elizabeth	1
Dragos Iliescu	1
Eckert, Tanya L.	1
Fuller, Edward J.	1
Ghaith Assi	1
Hollingworth, Liz	1
Jake Stone	1
January, Stacy-Ann A.	1
Jon Twing	1
Joye, Nelly	1
Kurt F. Geisinger	1
More ▼