ERIC - Search Results

Publication Date

In 2026	0
Since 2025	1
Since 2022 (last 5 years)	1
Since 2017 (last 10 years)	1
Since 2007 (last 20 years)	1

Descriptor

Classification	6
Evaluation Methods	6
Test Format	6
Computer Assisted Testing	2
Computer Simulation	2
Cutting Scores	2
Higher Education	2
Test Construction	2
Test Reliability	2
Test Use	2
Accuracy	1
Adaptive Testing	1
Bayesian Statistics	1
College Students	1
Computer Software	1
Criterion Referenced Tests	1
Definitions	1
Educational Assessment	1
Educational Objectives	1
Efficiency	1
Estimation (Mathematics)	1
Evaluation Criteria	1
Flow Charts	1
Information Security	1
Instructional Design	1
More ▼

Source

Educational and Psychological…	1
Journal of Educational…	1
Journal of Educational and…	1
Performance and Instruction	1
Simulation and Games	1

Author

Berk, Ronald A.	1
Llaneras, Robert E.	1
Reshetar, Rosemary A.	1
Schriesheim, Chester A.	1
Stanislaw, Harold	1
Susu Zhang	1
Yang Du	1

Publication Type

Journal Articles	5
Reports - Research	3
Opinion Papers	2
Reports - Descriptive	2
Guides - Non-Classroom	1
Information Analyses	1
Speeches/Meeting Papers	1

Education Level

Audience

Researchers

Location

Laws, Policies, & Programs

Assessments and Surveys

What Works Clearinghouse Rating

Showing all 6 results Save | Export

Detecting Compromised Items with Response Times Using a Bayesian Change-Point Approach

Peer reviewed

Direct link

Yang Du; Susu Zhang – Journal of Educational and Behavioral Statistics, 2025

Item compromise has long posed challenges in educational measurement, jeopardizing both test validity and test security of continuous tests. Detecting compromised items is therefore crucial to address this concern. The present literature on compromised item detection reveals two notable gaps: First, the majority of existing methods are based upon…

Descriptors: Item Response Theory, Item Analysis, Bayesian Statistics, Educational Assessment

A Consumers' Guide to Criterion-Referenced Test Reliability. Reliability.

Peer reviewed

Berk, Ronald A. – Journal of Educational Measurement, 1980

A dozen different approaches that yield 13 reliability indices for criterion-referenced tests were identified and grouped into three categories: threshold loss function, squared-error loss function, and domain score estimation. Indices were evaluated within each category. (Author/RL)

Descriptors: Classification, Criterion Referenced Tests, Cutting Scores, Evaluation Methods

TIFAID: A Test Item Format Selection Job Aid for Use by Instructional Developers.

Llaneras, Robert E.; And Others – Performance and Instruction, 1993

Presents a job aid for determining test-item format called TIFAID (Test Item Format Job Aid), based on adequately constructed instructional objectives. The four sections of the job aid are described: (1) a task classification system; (2) task-related questions; (3) a flowchart; and (4) a tips and techniques guide. (Contains four references.) (LRW)

Descriptors: Classification, Educational Objectives, Evaluation Methods, Flow Charts

The Effect of Grouped versus Randomized Questionnaire Format on Scale Reliability and Validity: A Three-Study Investigation.

Peer reviewed

Schriesheim, Chester A.; And Others – Educational and Psychological Measurement, 1989

Three studies explored the effects of grouping versus randomized items in questionnaires on internal consistency and test-retest reliability with samples of 80, 80, and 100, respectively, university students and undergraduates. The 2 correlational and 1 experimental studies were reasonably consistent in demonstrating that neither format was…

Descriptors: Classification, College Students, Evaluation Methods, Higher Education

Tests of Computer Simulation Validity: What Do They Measure?

Peer reviewed

Stanislaw, Harold – Simulation and Games, 1986

Provides a framework for understanding the concept of validity and how it applies to simulation research. Extant tests of validity are discussed in terms of this framework, and a general schema for testing validity is proposed. (MBR)

Descriptors: Classification, Computer Simulation, Computer Software, Definitions

An Adaptive Testing Simulation for a Certifying Examination.

Download full text

Reshetar, Rosemary A.; And Others – 1992

This study examined performance of a simulated computerized adaptive test that was designed to help direct the development of a medical recertification examination. The item pool consisted of 229 single-best-answer items from a random sample of 3,000 examinees, calibrated using the two-parameter logistic model. Examinees' responses were known. For…

Descriptors: Adaptive Testing, Classification, Computer Assisted Testing, Computer Simulation