ERIC - Search Results

Publication Date

In 2026	0
Since 2025	0
Since 2022 (last 5 years)	1
Since 2017 (last 10 years)	5
Since 2007 (last 20 years)	9

Descriptor

Test Items	15
Test Reliability	15
Test Construction	13
Test Validity	12
Standard Setting	10
Scoring	8
Psychometrics	7
Academic Standards	5
Criterion Referenced Tests	5
Cutting Scores	5
English	5
Standard Setting (Scoring)	5
Test Results	5
Difficulty Level	4
Item Response Theory	4
Mathematics Tests	4
Middle School Students	4
Science Tests	4
Summative Evaluation	4
Benchmarking	3
Career Readiness	3
College Readiness	3
Elementary School Students	3
Error of Measurement	3
Evaluation Methods	3
More ▼

Source

Nebraska Department of…	4
New Mexico Public Education…	2
English Language Teaching	1
Evaluation and the Health…	1
International Journal of…	1
Journal of Applied Testing…	1
Online Submission	1
Practical Assessment,…	1

Publication Type

Numerical/Quantitative Data	6
Reports - Evaluative	6
Journal Articles	5
Reports - Research	5
Reports - Descriptive	3
Guides - Classroom - Teacher	1
Speeches/Meeting Papers	1
Tests/Questionnaires	1

Education Level

Secondary Education	5
Junior High Schools	4
Middle Schools	4
Elementary Education	3
Elementary Secondary Education	2
Grade 5	1
Grade 6	1
Grade 7	1
Grade 8	1
Higher Education	1
Intermediate Grades	1
Postsecondary Education	1
More ▼

Audience

Practitioners

Location

Nebraska	4
New Mexico	2
Europe	1
Nigeria	1
Thailand	1

Laws, Policies, & Programs

Assessments and Surveys

What Works Clearinghouse Rating

Showing all 15 results Save | Export

2023-2024 NSCAS Growth: English Language Arts, Mathematics, and Science Technical Report

Download full text

Nebraska Department of Education, 2024

The Nebraska Student-Centered Assessment System (NSCAS) is a statewide assessment system that embodies Nebraska's holistic view of students and helps them prepare for success in postsecondary education, career, and civic life. It uses multiple measures throughout the year to provide educators and decision-makers at all levels with the insights…

Descriptors: Student Evaluation, Evaluation Methods, Elementary School Students, Middle School Students

The Development of STEP, the CEFR-Based English Proficiency Test

Peer reviewed
PDF on ERIC

Download full text

Sridhanyarat, Kietnawin; Pathong, Supakarn; Suranakkharin, Todsapon; Ammaralikit, Amornrat – English Language Teaching, 2021

This study aimed at developing the Silpakorn Test of English Proficiency (STEP), in alignment with the Common European Framework of Reference for Languages (CEFR), and in accordance with the theoretical framework established by Alderson et al. (2006). Four major steps were involved in the test construction. First, English language lecturers who…

Descriptors: Language Tests, Language Proficiency, Second Language Learning, Second Language Instruction

Spring 2021 NSCAS Phase I Pilot ELA, Mathematics, and Science Technical Report

Download full text

Nebraska Department of Education, 2021

This technical report documents the processes and procedures implemented to support the Spring 2021 Nebraska Student-Centered Assessment System (NSCAS) Phase I Pilot in English Language Arts (ELA), Mathematics, and Science assessments by NWEA® under the supervision of the Nebraska Department of Education (NDE). The technical report shows how the…

Descriptors: Psychometrics, Standard Setting, English, Language Arts

Spring 2020 NSCAS General Summative ELA, Mathematics, and Science Technical Report

Download full text

Nebraska Department of Education, 2020

The Spring 2020 Nebraska Student-Centered Assessment System (NSCAS) General Summative testing was cancelled due to COVID-19. This technical report documents the processes and procedures that had been implemented to support the Spring 2020 assessments prior to the cancellation. The following sections are presented in this technical report: (1)…

Descriptors: English, Language Arts, Mathematics Tests, Science Tests

Spring 2019 NSCAS Summative ELA, Mathematics, and Science Technical Report

Download full text

Nebraska Department of Education, 2019

This technical report documents the processes and procedures implemented to support the Spring 2019 Nebraska Student-Centered Assessment System (NSCAS) General Summative English Language Arts (ELA), Mathematics, and Science assessments by NWEA® under the supervision of the Nebraska Department of Education (NDE). The technical report shows how the…

Descriptors: English, Language Arts, Summative Evaluation, Mathematics Tests

Evaluating the Appropriateness and Use of Domain Critical Errors

Peer reviewed
PDF on ERIC

Download full text

Buckendahl, Chad W.; Davis-Becker, Susan L. – Practical Assessment, Research & Evaluation, 2012

The consequences associated with the uses and interpretations of scores for many credentialing testing programs have important implications for a range of stakeholders. Within licensure settings specifically, results from examination programs are often one of the final steps in the process of assessing whether individuals will be allowed to enter…

Descriptors: Licensing Examinations (Professions), Test Items, Dentistry, Minimum Competency Testing

Evaluation of Northwest University, Kano Post-UTME Test Items Using Item Response Theory

Peer reviewed
PDF on ERIC

Download full text

Bichi, Ado Abdu; Hafiz, Hadiza; Bello, Samira Abdullahi – International Journal of Evaluation and Research in Education, 2016

High-stakes testing is used for the purposes of providing results that have important consequences. Validity is the cornerstone upon which all measurement systems are built. This study applied the Item Response Theory principles to analyse Northwest University Kano Post-UTME Economics test items. The developed fifty (50) economics test items was…

Descriptors: Item Response Theory, Test Items, Difficulty Level, Statistical Analysis

Combining the Best of Two Standard Setting Methods: The Ordered Item Booklet Angoff

Peer reviewed

Direct link

Smith, Russell W.; Davis-Becker, Susan L.; O'Leary, Lisa S. – Journal of Applied Testing Technology, 2014

This article describes a hybrid standard setting method that combines characteristics of the Angoff (1971) and Bookmark (Mitzel, Lewis, Patz & Green, 2001) methods. The proposed approach utilizes strengths of each method while addressing weaknesses. An ordered item booklet, with items sorted based on item difficulty, is used in combination…

Descriptors: Standard Setting, Difficulty Level, Test Items, Rating Scales

Item Analysis for Criterion-Referenced Tests

Download full text

McCowan, Richard J.; McCowan, Sheila C. – Online Submission, 1999

This paper describes major concepts related to item analysis for criterion-referenced tests including validity, reliability, item difficulty, and item discrimination, particularly in relation to criterion-referenced tests. The paper discussed how these concepts can be used to revise and improve items and listed suggestions regarding general…

Descriptors: Criterion Referenced Tests, Standard Setting, Item Analysis, Item Response Theory

Setting, Evaluating, and Maintaining Certification Standards with the Rasch Model.

Peer reviewed

Grosse, Martin E.; Wright, Benjamin D. – Evaluation and the Health Professions, 1986

Based on the standard setting procedures or the American Board of Preventive Medicine for their Core Test, this article describes how Rasch measurement can facilitate using test content judgments in setting a standard. Rasch measurement can then be used to evaluate and improve the precision of the standard and to hold it constant across time.…

Descriptors: Certification, Criterion Referenced Tests, Difficulty Level, Health Personnel

Assessing Inconsistencies in Standard Setting with the Angoff or Nedelsky Technique.

Download full text

van der Linden, Wim J. – 1982

A latent trait method is presented to investigate the possibility that Angoff or Nedelsky judges specify inconsistent probabilities in standard setting techniques for objectives-based instructional programs. It is suggested that judges frequently specify a low probability of success for an easy item but a large probability for a hard item. The…

Descriptors: Criterion Referenced Tests, Cutting Scores, Error of Measurement, Interrater Reliability

Standard Setting Study of the UT Austin Test for Credit in Japanese: Fall 1991 through Spring 1993. Research Bulletin 93-2.

Download full text

Fitzpatrick, Steven J.; And Others – 1994

In 1991 the Measurement and Evaluation Center of the University of Texas at Austin was asked to develop a test for credit by examination in four lower division courses in Japanese. The test (in Japanese) was constructed from locally developed items provided by instructors of Japanese. The developed test consisted of 80 items distributed among…

Descriptors: College Students, Cutting Scores, Equivalency Tests, Higher Education

New Mexico Standards-Based Assessment Technical Report: Spring 2007 Administration

Download full text

New Mexico Public Education Department, 2007

The purpose of the NMSBA technical report is to provide users and other interested parties with a general overview of and technical characteristics of the 2007 NMSBA. The 2007 technical report contains the following information: (1) Test development; (2) Scoring procedures; (3) Summary of student performance; (4) Statistical analyses of item and…

Descriptors: Interrater Reliability, Standard Setting, Measures (Individuals), Scoring

A Practitioner's Guide to Criterion-Referenced Test Development, Validation, and Test Score Usage (Second Edition). Laboratory of Psychometric and Evaluation Research Report No. 70.

Download full text

Hambleton, Ronald K.; Eignor, Daniel R. – 1979

This instructional training package introduces practitioners to methods for developing, validating, using, and reporting criterion-referenced tests. It provides a comprehensive presentation of criterion-referenced testing technology. The package emphasizes the most recent substantive and technological advances in the field that are both important…

Descriptors: Criterion Referenced Tests, Cutting Scores, Evaluation Methods, Mastery Tests

New Mexico Standards Based Assessment (NMSBA) Technical Report: 2006 Spring Administration

Download full text

Griph, Gerald W. – New Mexico Public Education Department, 2006

The purpose of the NMSBA technical report is to provide users and other interested parties with a general overview of and technical characteristics of the 2006 NMSBA. The 2006 technical report contains the following information: (1) Test development; (2) Scoring procedures; (3) Calibration, scaling, and equating procedures; (4) Standard setting;…

Descriptors: Interrater Reliability, Standard Setting, Measures (Individuals), Scoring

Davis-Becker, Susan L.	2
Ammaralikit, Amornrat	1
Bello, Samira Abdullahi	1
Bichi, Ado Abdu	1
Buckendahl, Chad W.	1
Eignor, Daniel R.	1
Fitzpatrick, Steven J.	1
Griph, Gerald W.	1
Grosse, Martin E.	1
Hafiz, Hadiza	1
Hambleton, Ronald K.	1
McCowan, Richard J.	1
McCowan, Sheila C.	1
O'Leary, Lisa S.	1
Pathong, Supakarn	1
Smith, Russell W.	1
Sridhanyarat, Kietnawin	1
Suranakkharin, Todsapon	1
Wright, Benjamin D.	1
van der Linden, Wim J.	1
More ▼