ERIC - Search Results

Publication Date

In 2025	1
Since 2024	12

Descriptor

Test Format	12
Test Reliability	12
Test Validity	8
Testing	5
Foreign Countries	4
Language Tests	3
Psychometrics	3
Test Construction	3
Adaptive Testing	2
Answer Sheets	2
Artificial Intelligence	2
Cheating	2
Computer Assisted Testing	2
Elementary School Students	2
Information Security	2
Item Response Theory	2
Language Arts	2
Likert Scales	2
Mathematics Tests	2
Measurement Techniques	2
Multiple Choice Tests	2
Science Tests	2
Scoring	2
Student Evaluation	2
Test Items	2
More ▼

Source

New York State Education…	2
Digital Education Review	1
ETS Research Report Series	1
Educational Psychology Review	1
Gifted Child Today	1
International Journal of…	1
Journal of Educational and…	1
Language Assessment Quarterly	1
SAGE Open	1
Sociological Methods &…	1
Turkish Online Journal of…	1
More ▼

Publication Type

Journal Articles	10
Reports - Research	7
Guides - Classroom - Teacher	2
Guides - Non-Classroom	1
Information Analyses	1
Reports - Evaluative	1
Tests/Questionnaires	1

Education Level

Elementary Education	3
Higher Education	3
Postsecondary Education	3
Junior High Schools	2
Middle Schools	2
Secondary Education	2
Grade 6	1
Grade 7	1
Intermediate Grades	1

Audience

Teachers	2
Administrators	1

Location

New York	2
Indonesia	1
South Africa	1
Turkey	1
Vietnam	1

Laws, Policies, & Programs

Assessments and Surveys

What Works Clearinghouse Rating

Showing all 12 results Save | Export

A Comparison of Yen's Q3 Coefficient and Rasch Testlet Modeling for Identifying Local Item Dependence: Evidence from Two Vocabulary Matching Tests

Peer reviewed

Direct link

Hung Tan Ha; Duyen Thi Bich Nguyen; Tim Stoeckel – Language Assessment Quarterly, 2025

This article compares two methods for detecting local item dependence (LID): residual correlation examination and Rasch testlet modeling (RTM), in a commonly used 3:6 matching format and an extended matching test (EMT) format. The two formats are hypothesized to facilitate different levels of item dependency due to differences in the number of…

Descriptors: Comparative Analysis, Language Tests, Test Items, Item Analysis

The Effects of Reverse Items on Psychometric Properties and Respondents' Scale Scores According to Different Item Reversal Strategies

Peer reviewed
PDF on ERIC

Download full text

Mustafa Ilhan; Nese Güler; Gülsen Tasdelen Teker; Ömer Ergenekon – International Journal of Assessment Tools in Education, 2024

This study aimed to examine the effects of reverse items created with different strategies on psychometric properties and respondents' scale scores. To this end, three versions of a 10-item scale in the research were developed: 10 positive items were integrated in the first form (Form-P) and five positive and five reverse items in the other two…

Descriptors: Test Items, Psychometrics, Scores, Measures (Individuals)

Do Different Devices Perform Equally Well with Different Numbers of Scale Points and Response Formats? A Test of Measurement Invariance and Reliability

Peer reviewed

Direct link

Natalja Menold; Vera Toepoel – Sociological Methods & Research, 2024

Research on mixed devices in web surveys is in its infancy. Using a randomized experiment, we investigated device effects (desktop PC, tablet and mobile phone) for six response formats and four different numbers of scale points. N = 5,077 members of an online access panel participated in the experiment. An exact test of measurement invariance and…

Descriptors: Online Surveys, Handheld Devices, Telecommunications, Test Reliability

Selecting Technically Adequate Tests

Peer reviewed

Direct link

Susan K. Johnsen – Gifted Child Today, 2024

The author provides a checklist for educators who are selecting technically adequate tests for identifying and referring students for gifted education services and programs. The checklist includes questions related to how the test was normed, reliability and validity studies as well as questions related to types of scores, administration, and…

Descriptors: Test Selection, Academically Gifted, Gifted Education, Test Validity

Evaluating the Evaluators: A Comparative Study of AI and Teacher Assessments in Higher Education

Peer reviewed
PDF on ERIC

Download full text

Tugra Karademir Coskun; Ayfer Alper – Digital Education Review, 2024

This study aims to examine the potential differences between teacher evaluations and artificial intelligence (AI) tool-based assessment systems in university examinations. The research has evaluated a wide spectrum of exams including numerical and verbal course exams, exams with different assessment styles (project, test exam, traditional exam),…

Descriptors: Artificial Intelligence, Visual Aids, Video Technology, Tests

A Two-Level Adaptive Test Battery

Peer reviewed

Direct link

Wim J. van der Linden; Luping Niu; Seung W. Choi – Journal of Educational and Behavioral Statistics, 2024

A test battery with two different levels of adaptation is presented: a within-subtest level for the selection of the items in the subtests and a between-subtest level to move from one subtest to the next. The battery runs on a two-level model consisting of a regular response model for each of the subtests extended with a second level for the joint…

Descriptors: Adaptive Testing, Test Construction, Test Format, Test Reliability

Not Liking the Likert? A Rasch Analysis of Forced-Choice Format and Usefulness in Survey Design

Peer reviewed

Direct link

Celeste Combrinck – SAGE Open, 2024

We have less time and focus than ever before, while the demand for attention is increasing. Therefore, it is no surprise that when answering questionnaires, we often choose to strongly agree or be neutral, producing problematic and unusable data. The current study investigated forced-choice (ipsative) format compared to the same questions on a…

Descriptors: Likert Scales, Test Format, Surveys, Design

Measuring Mathematical Skills in Early Childhood: A Systematic Review of the Psychometric Properties of Early Maths Assessments and Screeners

Peer reviewed

Direct link

Laura A. Outhwaite; Pirjo Aunio; Jaimie Ka Yu Leung; Jo Van Herwegen – Educational Psychology Review, 2024

Successful early mathematical development is vital to children's later education, employment, and wellbeing outcomes. However, established measurement tools are infrequently used to (i) assess children's mathematical skills and (ii) identify children with or at-risk of mathematical learning difficulties. In response, this pre-registered systematic…

Descriptors: Mathematics Tests, Screening Tests, Mathematics Skills, At Risk Students

Innovations in Assessing Students' Digital Literacy Skills in Learning Science: Effective Multiple Choice Closed-Ended Tests Using Rasch Model

Peer reviewed
PDF on ERIC

Download full text

Fitria Lafifa; Dadan Rosana – Turkish Online Journal of Distance Education, 2024

This research goal to develop a multiple-choice closed-ended test to assessing and evaluate students' digital literacy skills. The sample in this study were students at MTsN 1 Blitar City who were selected using a purposive sampling technique. The test was also validated by experts, namely 2 Doctors of Physics and Science from Yogyakarta State…

Descriptors: Educational Innovation, Student Evaluation, Digital Literacy, Multiple Choice Tests

New York State Testing Program: Grades 6 and 7 English Language Arts Paper-Based Tests. Teacher's Directions. Spring 2024

Download full text

New York State Education Department, 2024

The New York State Education Department (NYSED) has a partnership with NWEA for the development of the 2024 Grades 3-8 English Language Arts Tests. Teachers from across the State work with NYSED in a variety of activities to ensure the validity and reliability of the New York State Testing Program (NYSTP). The 2024 Grades 6 and 7 English Language…

Descriptors: Language Tests, Test Format, Language Arts, English Instruction

New York State Testing Program: English Language Arts, Mathematics, and Science Tests. School Administrator's Manual, 2024. Grades 3-8

Download full text

New York State Education Department, 2024

The instructions in this manual explain the responsibilities of school administrators for the New York State Testing Program (NYSTP) Grades 3-8 English Language Arts, Mathematics, and Grades 5 & 8 Science Tests. School administrators must be thoroughly familiar with the contents of the manual, and the policies and procedures must be followed…

Descriptors: Testing Programs, Language Arts, Mathematics Tests, Science Tests

Charting the Future of Assessments. Research Report. ETS RR-24-13

Peer reviewed
PDF on ERIC

Download full text

Patrick Kyllonen; Amit Sevak; Teresa Ober; Ikkyu Choi; Jesse Sparks; Daniel Fishtein – ETS Research Report Series, 2024

Assessment refers to a broad array of approaches for measuring or evaluating a person's (or group of persons') skills, behaviors, dispositions, or other attributes. Assessments range from standardized tests used in admissions, employee selection, licensure examinations, and domestic and international large-scale assessments of cognitive and…

Descriptors: Assessment Literacy, Testing, Test Bias, Test Construction

Amit Sevak	1
Ayfer Alper	1
Celeste Combrinck	1
Dadan Rosana	1
Daniel Fishtein	1
Duyen Thi Bich Nguyen	1
Fitria Lafifa	1
Gülsen Tasdelen Teker	1
Hung Tan Ha	1
Ikkyu Choi	1
Jaimie Ka Yu Leung	1
Jesse Sparks	1
Jo Van Herwegen	1
Laura A. Outhwaite	1
Luping Niu	1
Mustafa Ilhan	1
Natalja Menold	1
Nese Güler	1
Patrick Kyllonen	1
Pirjo Aunio	1
Seung W. Choi	1
Susan K. Johnsen	1
Teresa Ober	1
Tim Stoeckel	1
Tugra Karademir Coskun	1
More ▼