ERIC - Search Results

Publication Date

In 2026	0
Since 2025	0
Since 2022 (last 5 years)	3
Since 2017 (last 10 years)	6
Since 2007 (last 20 years)	13

Descriptor

Difficulty Level	19
Test Construction	19
Test Content	19
Test Items	16
Foreign Countries	7
Item Analysis	5
Item Response Theory	5
Psychometrics	5
English (Second Language)	4
Test Validity	4
Educational Assessment	3
Evaluation Methods	3
Language Tests	3
Mathematics Tests	3
Scoring	3
Second Language Learning	3
Statistical Analysis	3
Student Evaluation	3
Test Use	3
College Entrance Examinations	2
Comparative Analysis	2
Correlation	2
Data Analysis	2
Equated Scores	2
Grade 12	2
More ▼

Source

National Assessment Governing…	2
Applied Measurement in…	1
Assessment in Education:…	1
Canadian Journal of Special…	1
ETS Research Report Series	1
Educational Measurement:…	1
International Journal of…	1
International Multilingual…	1
Journal of Applied Testing…	1
Journal of Chemical Education	1
Ministerial Council on…	1
PASAA: Journal of Language…	1
Practical Assessment,…	1
ProQuest LLC	1
More ▼

Publication Type

Journal Articles	11
Reports - Descriptive	7
Reports - Evaluative	6
Reports - Research	4
Guides - Non-Classroom	2
Dissertations/Theses -…	1
Information Analyses	1
Numerical/Quantitative Data	1
Speeches/Meeting Papers	1
Tests/Questionnaires	1

Education Level

Higher Education	6
Postsecondary Education	6
Elementary Education	4
Secondary Education	3
Elementary Secondary Education	2
Grade 12	2
Grade 4	2
Grade 8	2
High Schools	2
Intermediate Grades	2
Junior High Schools	2
Middle Schools	2
Early Childhood Education	1
Grade 6	1
Kindergarten	1
Primary Education	1
More ▼

Audience

Teachers	4
Practitioners	2
Policymakers	1
Researchers	1

Location

Australia	1
Canada	1
Indonesia	1
Nigeria	1
Oman	1

Laws, Policies, & Programs

Assessments and Surveys

Graduate Record Examinations	2
National Assessment of…	2
Bender Visual Motor Gestalt…	1
Goodenough Harris Drawing Test	1
Program for International…	1

What Works Clearinghouse Rating

Showing 1 to 15 of 19 results Save | Export

Idea-Sharing Crafting Item Difficulty in TOEFL iBT Listening Tests

Peer reviewed
PDF on ERIC

Download full text

Alan Shaw – PASAA: Journal of Language Teaching and Learning in Thailand, 2023

Although the TOEFL iBT Listening test is sometimes used for other purposes, it was designed primarily for use as a college entrance examination. Item difficulty in TOEFL iBT Listening tests is the product of interactions between two sets of complex relationships: 1) relationships among numerous item characteristics themselves, and 2) relationships…

Descriptors: English (Second Language), Second Language Instruction, Listening Skills, Language Tests

From Investigating the Alignment of a Priori Item Characteristics Based on the CTT and Four-Parameter Logistic (4-PL) IRT Models to Further Exploring the Comparability of the Two Models

Peer reviewed
PDF on ERIC

Download full text

Agus Santoso; Heri Retnawati; Timbul Pardede; Ibnu Rafi; Munaya Nikma Rosyada; Gulzhaina K. Kassymova; Xu Wenxin – Practical Assessment, Research & Evaluation, 2024

The test blueprint is important in test development, where it guides the test item writer in creating test items according to the desired objectives and specifications or characteristics (so-called a priori item characteristics), such as the level of item difficulty in the category and the distribution of items based on their difficulty level.…

Descriptors: Foreign Countries, Undergraduate Students, Business English, Test Construction

Test Item Writing Competence among Oman College of Health Sciences Nurse Faculty

Direct link

Mohammed Ambusaidi – ProQuest LLC, 2022

There is an increased demand on nursing faculty to provide quality teaching and assessment. Nursing faculty are required to ensure accurate assessment of learning through testing and outcome measurement that are critical elements of the evaluation process. Likewise, nursing faculty should implement a logical evaluation system. However, the…

Descriptors: Nursing Education, College Faculty, Test Construction, Test Validity

On the Choice of Anchor Tests in Equating

Peer reviewed

Direct link

Sinharay, Sandip – Educational Measurement: Issues and Practice, 2018

The choice of anchor tests is crucial in applications of the nonequivalent groups with anchor test design of equating. Sinharay and Holland (2006, 2007) suggested "miditests," which are anchor tests that are content-representative and have the same mean item difficulty as the total test but have a smaller spread of item difficulties.…

Descriptors: Test Content, Difficulty Level, Test Items, Test Construction

Mathematics Framework for the 2019 National Assessment of Educational Progress

Download full text

National Assessment Governing Board, 2019

Since 1973, the National Assessment of Educational Progress (NAEP) has gathered information about student achievement in mathematics. The NAEP assessment in mathematics has two components that differ in purpose. One assessment measures long-term trends in achievement among 9-, 13-, and 17-year-old students by using the same basic design each time.…

Descriptors: National Competency Tests, Mathematics Achievement, Grade 4, Grade 8

Assessment Accommodations for Emergent Bilinguals in Mainstream Classroom Assessments: A Targeted Literature Review

Peer reviewed

Direct link

Yang, Xuexue – International Multilingual Research Journal, 2020

Despite the importance of assessment accommodations, little is known about its use in the context of classroom assessments. To provide guidance for teachers on how to best support their emergent bilinguals during classroom assessments, there may be ideas from large-scale assessments that can be used in the classrooms. This article, a targeted…

Descriptors: Testing Accommodations, Measurement, Bilingualism, Second Language Learning

Using a Table of Specifications to Improve Teacher-Constructed Traditional Tests: An Experimental Design

Peer reviewed

Direct link

DiDonato-Barnes, Nicole; Fives, Helenrose; Krause, Emily S. – Assessment in Education: Principles, Policy & Practice, 2014

We investigated if instruction on a Table of Specifications (TOS) would influence the quality of classroom test construction. Results should prove informative for educational researchers, teacher educators, and practising teachers interested in evidenced-based strategies that may improve assessment-related practices. Fifty-three college…

Descriptors: Teacher Made Tests, Test Construction, Tables (Data), Alignment (Education)

Assessment Engineering Task Model Maps, Task Models and Templates as a New Way to Develop and Implement Test Specifications

Peer reviewed

Direct link

Luecht, Richard M. – Journal of Applied Testing Technology, 2013

Assessment engineering is a new way to design and implement scalable, sustainable and ideally lower-cost solutions to the complexities of designing and developing tests. It represents a merger of sorts between cognitive task modeling and engineering design principles--a merger that requires some new thinking about the nature of score scales, item…

Descriptors: Engineering, Test Construction, Test Items, Models

Guide to Developing High-Quality, Reliable, and Valid Multiple-Choice Assessments

Peer reviewed

Direct link

Towns, Marcy H. – Journal of Chemical Education, 2014

Chemistry faculty members are highly skilled in obtaining, analyzing, and interpreting physical measurements, but often they are less skilled in measuring student learning. This work provides guidance for chemistry faculty from the research literature on multiple-choice item development in chemistry. Areas covered include content, stem, and…

Descriptors: Multiple Choice Tests, Test Construction, Psychometrics, Test Items

Evaluation of Northwest University, Kano Post-UTME Test Items Using Item Response Theory

Peer reviewed
PDF on ERIC

Download full text

Bichi, Ado Abdu; Hafiz, Hadiza; Bello, Samira Abdullahi – International Journal of Evaluation and Research in Education, 2016

High-stakes testing is used for the purposes of providing results that have important consequences. Validity is the cornerstone upon which all measurement systems are built. This study applied the Item Response Theory principles to analyse Northwest University Kano Post-UTME Economics test items. The developed fifty (50) economics test items was…

Descriptors: Item Response Theory, Test Items, Difficulty Level, Statistical Analysis

Mathematics Framework for the 2015 National Assessment of Educational Progress

Download full text

National Assessment Governing Board, 2014

Since 1973, the National Assessment of Educational Progress (NAEP) has gathered information about student achievement in mathematics. Results of these periodic assessments, produced in print and web-based formats, provide valuable information to a wide variety of audiences. They inform citizens about the nature of students' comprehension of the…

Descriptors: National Competency Tests, Mathematics Achievement, Mathematics Skills, Grade 4

Item Position and Item Difficulty Change in an IRT-Based Common Item Equating Design

Peer reviewed

Direct link

Meyers, Jason L.; Miller, G. Edward; Way, Walter D. – Applied Measurement in Education, 2009

In operational testing programs using item response theory (IRT), item parameter invariance is threatened when an item appears in a different location on the live test than it did when it was field tested. This study utilizes data from a large state's assessments to model change in Rasch item difficulty (RID) as a function of item position change,…

Descriptors: Test Items, Test Content, Testing Programs, Simulation

Content Characteristics of GRE Analytical Reasoning Items. GRE Board Professional Report No. 84-14P.

Download full text

Chalifour, Clark; Powers, Donald E. – 1988

In actual test development practice, the number of test items that must be developed and pretested is typically greater, and sometimes much greater, than the number eventually judged suitable for use in operational test forms. This has proven to be especially true for analytical reasoning items, which currently form the bulk of the analytical…

Descriptors: Coding, Difficulty Level, Higher Education, Test Construction

Basic Precepts in Test Construction: Recommendations from Various Measurement Textbooks.

Download full text

Mathieu, Cindy K. – 1997

This paper presents six steps in test construction generally recommended by measurement textbook authors. The focus is primarily on paper-and-pencil achievement tests as used by class instructions, although the discussion touches on the construction of other types of assessment. The six steps are: (1) determine the test purpose; (2) determine the…

Descriptors: Achievement Tests, Difficulty Level, Measurement Techniques, Selection

Evaluating the Test Results.

Download full text

Kitao, Kenji; Kitao, S. Kathleen – 1996

After tests are administered, they are scored and the scores are given back to the students. If the real purpose of the test is to improve student learning, simply returning the scores is not sufficient. The first step in evaluating test results is to be sure that the test has tested the intended concepts and content. Calculating the mean and the…

Descriptors: Difficulty Level, English (Second Language), Evaluation Methods, Feedback

Previous Page | Next Page »

Pages: 1 | 2

Kitao, Kenji	2
Kitao, S. Kathleen	2
Agus Santoso	1
Alan Shaw	1
Bello, Samira Abdullahi	1
Bichi, Ado Abdu	1
Chalifour, Clark	1
DiDonato-Barnes, Nicole	1
Donovan, Jenny	1
Fives, Helenrose	1
Graf, Edith Aurora	1
Gulzhaina K. Kassymova	1
Hafiz, Hadiza	1
Heri Retnawati	1
Hutton, Penny	1
Ibnu Rafi	1
Krause, Emily S.	1
Lawless, René	1
Lennon, Melissa	1
Luecht, Richard M.	1
Mathieu, Cindy K.	1
Meyers, Jason L.	1
Miller, G. Edward	1
Mohammed Ambusaidi	1
More ▼