ERIC - Search Results

Publication Date

In 2026	0
Since 2025	0
Since 2022 (last 5 years)	4
Since 2017 (last 10 years)	7
Since 2007 (last 20 years)	9

Descriptor

Scores	40
Test Items	40
Testing Problems	40
Item Analysis	13
Difficulty Level	11
Higher Education	10
Elementary Secondary Education	9
Test Construction	9
Latent Trait Theory	8
College Entrance Examinations	6
Criterion Referenced Tests	6
Standardized Tests	6
Test Bias	6
Test Format	6
Test Interpretation	6
Test Validity	6
Achievement Tests	5
Item Response Theory	5
Multiple Choice Tests	5
Test Reliability	5
Test Results	5
Testing	5
Cheating	4
Comparative Analysis	4
Estimation (Mathematics)	4
More ▼

Source

Educational Measurement:…	3
Journal of Educational…	3
ETS Research Report Series	2
Journal of Economic Education	2
Applied Measurement in…	1
Educational Assessment,…	1
Educational Evaluation and…	1
International Journal for the…	1
Journal of Educational…	1
Journal of Educational and…	1
Learning Disabilities…	1
More ▼

Publication Type

Reports - Research	27
Journal Articles	17
Speeches/Meeting Papers	11
Reports - Evaluative	7
Guides - Non-Classroom	6
Collected Works - Proceedings	1
Information Analyses	1
Opinion Papers	1
Reports - Descriptive	1

Education Level

Higher Education	2
Postsecondary Education	2
Elementary Education	1
Grade 4	1
Intermediate Grades	1

Audience

Researchers	7
Practitioners	1

Location

Laws, Policies, & Programs

Assessments and Surveys

Graduate Record Examinations	2
SAT (College Admission Test)	2
ACT Assessment	1
National Assessment of…	1
New Jersey College Basic…	1
Progress in International…	1
Slosson Intelligence Test	1
State Trait Anxiety Inventory	1

What Works Clearinghouse Rating

Showing 1 to 15 of 40 results Save | Export

Reporting Pass-Fail Decisions to Examinees with Incomplete Data: A Commentary on Feinberg (2021)

Peer reviewed

Direct link

Sinharay, Sandip – Educational Measurement: Issues and Practice, 2022

Administrative problems such as computer malfunction and power outage occasionally lead to missing item scores, and hence to incomplete data, on credentialing tests such as the United States Medical Licensing examination. Feinberg compared four approaches for reporting pass-fail decisions to the examinees with incomplete data on credentialing…

Descriptors: Testing Problems, High Stakes Tests, Credentials, Test Items

IRTrees for Skipping Items in PIRLS

Peer reviewed

Direct link

Andrés Christiansen; Rianne Janssen – Educational Assessment, Evaluation and Accountability, 2024

In international large-scale assessments, students may not be compelled to answer every test item: a student can decide to skip a seemingly difficult item or may drop out before the end of the test is reached. The way these missing responses are treated will affect the estimation of the item difficulty and student ability, and ultimately affect…

Descriptors: Test Items, Item Response Theory, Grade 4, International Assessment

Item Pool Quality Control in Educational Testing: Change Point Model, Compound Risk, and Sequential Detection

Peer reviewed

Direct link

Chen, Yunxiao; Lee, Yi-Hsuan; Li, Xiaoou – Journal of Educational and Behavioral Statistics, 2022

In standardized educational testing, test items are reused in multiple test administrations. To ensure the validity of test scores, the psychometric properties of items should remain unchanged over time. In this article, we consider the sequential monitoring of test items, in particular, the detection of abrupt changes to their psychometric…

Descriptors: Standardized Tests, Test Items, Test Validity, Scores

Better Remedies for Bad Exams: Correcting for Difficult Questions in a Fair and Systematic Way

Peer reviewed
PDF on ERIC

Download full text

Camenares, Devin – International Journal for the Scholarship of Teaching and Learning, 2022

Balancing assessment of learning outcomes with the expectations of students is a perennial challenge in education. Difficult exams, in which many students perform poorly, exacerbate this problem and can inspire a wide variety of interventions, such as a grading curve. However, addressing poor performance can sometimes distort or inflate grades and…

Descriptors: College Students, Student Evaluation, Tests, Test Items

Assessing Mode Effects of At-Home Testing without a Randomized Trial. Research Report. ETS RR-21-10

Peer reviewed
PDF on ERIC

Download full text

Kim, Sooyeon; Walker, Michael – ETS Research Report Series, 2021

In this investigation, we used real data to assess potential differential effects associated with taking a test in a test center (TC) versus testing at home using remote proctoring (RP). We used a pseudo-equivalent groups (PEG) approach to examine group equivalence at the item level and the total score level. If our assumption holds that the PEG…

Descriptors: Testing, Distance Education, Comparative Analysis, Test Items

A Review of Subscore Estimation Methods. ETS RR-18-17

Peer reviewed
PDF on ERIC

Download full text

Fu, Jianbin; Qu, Yanxuan – ETS Research Report Series, 2018

Various subscore estimation methods that use auxiliary information to improve subscore accuracy and stability have been developed. This report provides a review of various subscore estimation methods described in the literature. The methodology of each method is described, then research studies on these subscore estimation methods are summarized.…

Descriptors: Scores, Evaluation Methods, Item Response Theory, Test Items

How to Compare Parametric and Nonparametric Person-Fit Statistics Using Real Data

Peer reviewed

Direct link

Sinharay, Sandip – Journal of Educational Measurement, 2017

Person-fit assessment (PFA) is concerned with uncovering atypical test performance as reflected in the pattern of scores on individual items on a test. Existing person-fit statistics (PFSs) include both parametric and nonparametric statistics. Comparison of PFSs has been a popular research topic in PFA, but almost all comparisons have employed…

Descriptors: Goodness of Fit, Testing, Test Items, Scores

A New Procedure for Detection of Students' Rapid Guessing Responses Using Response Time

Peer reviewed

Direct link

Guo, Hongwen; Rios, Joseph A.; Haberman, Shelby; Liu, Ou Lydia; Wang, Jing; Paek, Insu – Applied Measurement in Education, 2016

Unmotivated test takers using rapid guessing in item responses can affect validity studies and teacher and institution performance evaluation negatively, making it critical to identify these test takers. The authors propose a new nonparametric method for finding response-time thresholds for flagging item responses that result from rapid-guessing…

Descriptors: Guessing (Tests), Reaction Time, Nonparametric Statistics, Models

An NCME Instructional Module on Using Differential Step Functioning to Refine the Analysis of DIF in Polytomous Items

Peer reviewed

Direct link

Penfield, Randall D.; Gattamorta, Karina; Childs, Ruth A. – Educational Measurement: Issues and Practice, 2009

Traditional methods for examining differential item functioning (DIF) in polytomously scored test items yield a single item-level index of DIF and thus provide no information concerning which score levels are implicated in the DIF effect. To address this limitation of DIF methodology, the framework of differential step functioning (DSF) has…

Descriptors: Test Bias, Test Items, Evaluation Methods, Scores

Humor and Anxiety: Effects on Class Test Performance.

Download full text

Townsend, Michael A. R.; Mahoney, Peggy – 1980

The roles of humor and anxiety in test performance were investigated. Measures of trait anxiety, state anxiety and achievement were obtained on a sample of undergraduate students; the A-Trait and A-State scales of the State-Trait Anxiety Inventory were used. Half of the students received additional humorous items in the achievement test. The…

Descriptors: Achievement Tests, Anxiety, Higher Education, Humor

A Practitioner's Guide to Functional Level Testing.

Haenn, Joseph F. – 1981

Procedures for conducting functional level testing have been available for use by practitioners for some time. However, the Title I Evaluation and Reporting System (TIERS), developed in response to the educational amendments of 1974 to the Elementary and Secondary Education Act (ESEA), has provided the impetus for widespread adoption of this…

Descriptors: Achievement Tests, Difficulty Level, Scores, Scoring

Latent Trait Theory in the Affective Domain--Applications of the Rasch Model.

Curry, Allen R.; Riegel, N. Blyth – 1978

The Rasch model of test theory is described in general terms, compared with latent trait theory, and shown to have interesting applications for the measurement of affective as well as cognitive traits. Three assumption of the Rasch model are stated to support the conclusion that calibration of the items and tests is independent of the examinee…

Descriptors: Affective Measures, Goodness of Fit, Item Analysis, Latent Trait Theory

The Effect of Item Sequence on Bar Examination Scores.

Klein, Stephen P.; Bolus, Roger – 1983

A solution to reduce the likelihood of one examinee copying another's answers on large scale tests that require all examinees to answer the same set of questions is to use multiple test forms that differ in terms of item ordering. This study was conducted to determine whether varying the sequence in which blocks of items were presented to…

Descriptors: Adults, Cheating, Cost Effectiveness, Item Analysis

Guidelines for Reporting Criterion-Referenced Test Score Information.

PDF pending restoration

Mills, Craig N.; Hambleton, Ronald K. – 1980

General guidelines exist for reporting and interpreting test scores, but there are short comings in the available technology, especially when applied to criterion-referenced tests. Concerns that have been expressed in the educational measurement literature address the uses of test scores, the manner of reporting scores, limited testing knowledge…

Descriptors: Criterion Referenced Tests, Educational Objectives, Elementary Secondary Education, Guidelines

Using Standardized Tests for Assessing Local Learning Objectives.

Peer reviewed

Wilson, Sandra Meachan; Hiscox, Michael D. – Educational Measurement: Issues and Practice, 1984

This article presents a model that can be used by local school districts for reanalyzing standardized test results to obtain a more valid assessment of local learning objectives can be used to identify strengths/weaknesses of existing programs as well as individual students. (EGS)

Descriptors: Educational Objectives, Item Analysis, Models, School Districts

Previous Page | Next Page »

Pages: 1 | 2 | 3

Hambleton, Ronald K.	2
Herndon, Enid B.	2
Sinharay, Sandip	2
Smith, Richard M.	2
Andrés Christiansen	1
Arter, Judith A.	1
Bolus, Roger	1
Braswell, James	1
Bresnock, Anne E.	1
Camenares, Devin	1
Chall, Jeanne S.	1
Chen, Yunxiao	1
Childs, Ruth A.	1
Craig, Robert	1
Curry, Allen R.	1
Estes, Gary D.	1
Ferguson, Richard L.	1
Frary, Robert B.	1
Fu, Jianbin	1
Gattamorta, Karina	1
Gilmer, Jerry S.	1
Gohmann, Stephan F.	1
Guo, Hongwen	1
Haberman, Shelby	1
More ▼