ERIC - Search Results

Publication Date

In 2026	0
Since 2025	0
Since 2022 (last 5 years)	2
Since 2017 (last 10 years)	3
Since 2007 (last 20 years)	6

Descriptor

Classification	19
Test Construction	19
Test Format	19
Test Items	11
Computer Assisted Testing	4
Multiple Choice Tests	4
College Students	3
Cutting Scores	3
Difficulty Level	3
Higher Education	3
Language Tests	3
Psychometrics	3
Accuracy	2
Adaptive Testing	2
English (Second Language)	2
Evaluation Methods	2
Foreign Countries	2
Item Banks	2
Item Response Theory	2
Objective Tests	2
Occupational Tests	2
Questionnaires	2
Scoring	2
Second Language Instruction	2
Student Evaluation	2
More ▼

Source

Applied Measurement in…	2
Applied Psychological…	1
Astronomy Education Review	1
Educational and Psychological…	1
Journal of Educational…	1
Journal of Technology,…	1
Language Teaching Research…	1
Performance and Instruction	1
Physical Review Physics…	1
ProQuest LLC	1

Publication Type

Journal Articles	10
Reports - Research	8
Speeches/Meeting Papers	6
Reports - Evaluative	5
Reports - Descriptive	4
Information Analyses	3
Dissertations/Theses -…	1
Opinion Papers	1
Tests/Questionnaires	1

Education Level

Higher Education	3
Postsecondary Education	3
Elementary Secondary Education	1
Secondary Education	1

Audience

Location

China	1
Japan	1
New Jersey	1
Russia	1

Laws, Policies, & Programs

Assessments and Surveys

ACT Assessment	1
Armed Services Vocational…	1
Test of English as a Foreign…	1

What Works Clearinghouse Rating

Showing 1 to 15 of 19 results Save | Export

The Impact of Scoring Later on Mixed Format Adaptive Testing

Direct link

Jing Ma – ProQuest LLC, 2024

This study investigated the impact of scoring polytomous items later on measurement precision, classification accuracy, and test security in mixed-format adaptive testing. Utilizing the shadow test approach, a simulation study was conducted across various test designs, lengths, number and location of polytomous item. Results showed that while…

Descriptors: Scoring, Adaptive Testing, Test Items, Classification

Development of a Multiple-Choice Problem-Solving Categorization Test for Assessment of Student Knowledge Structure

Peer reviewed

Direct link

Chen, Qingwei; Zhu, Guangtian; Liu, Qiaoyi; Han, Jing; Fu, Zhao; Bao, Lei – Physical Review Physics Education Research, 2020

Problem-solving categorization tasks have been well studied and used as an effective tool for assessment of student knowledge structure. In this study, a traditional free-response categorization test has been modified into a multiple-choice format, and the effectiveness of this new assessment is evaluated. Through randomized testing with Chinese…

Descriptors: Foreign Countries, Test Construction, Multiple Choice Tests, Problem Solving

The Influence of Passage Cohesion on Cloze Test Item Difficulty

Peer reviewed
PDF on ERIC

Download full text

Jonathan Trace – Language Teaching Research Quarterly, 2023

The role of context in cloze tests has long been seen as both a benefit as well as a complication in their usefulness as a measure of second language comprehension (Brown, 2013). Passage cohesion, in particular, would seem to have a relevant and important effect on the degree to which cloze items function and the interpretability of performances…

Descriptors: Language Tests, Cloze Procedure, Connected Discourse, Test Items

Does Maximizing Information at the Cut Score Always Maximize Classification Accuracy and Consistency?

Peer reviewed

Direct link

Wyse, Adam E.; Babcock, Ben – Journal of Educational Measurement, 2016

A common suggestion made in the psychometric literature for fixed-length classification tests is that one should design tests so that they have maximum information at the cut score. Designing tests in this way is believed to maximize the classification accuracy and consistency of the assessment. This article uses simulated examples to illustrate…

Descriptors: Cutting Scores, Psychometrics, Test Construction, Classification

Multiple Choice Testing for Introductory Astronomy: Design Theory Using Bloom's Taxonomy

Peer reviewed

Direct link

Young, Arthur; Shawl, Stephen J. – Astronomy Education Review, 2013

Professors who teach introductory astronomy to students not majoring in science desire them to comprehend the concepts and theories that form the basis of the science. They are usually less concerned about the myriad of detailed facts and information that accompanies the science. As such, professors prefer to test the students for such…

Descriptors: Multiple Choice Tests, Classification, Astronomy, Introductory Courses

The Potential Impact of Not Being Able to Create Parallel Tests on Expected Classification Accuracy

Peer reviewed

Direct link

Wyse, Adam E. – Applied Psychological Measurement, 2011

In many practical testing situations, alternate test forms from the same testing program are not strictly parallel to each other and instead the test forms exhibit small psychometric differences. This article investigates the potential practical impact that these small psychometric differences can have on expected classification accuracy. Ten…

Descriptors: Test Format, Test Construction, Testing Programs, Psychometrics

Techniques for Developing Higher-Level Objective Test Questions.

Stape, Christopher J. – Performance and Instruction, 1995

Suggests methods for developing higher level objective test questions. Taxonomies that define learning outcomes are discussed; and examples for various test types are presented, including multiple correct answers; more complex forms, including classification and multiple true-false; relations and correlates; and interpretive exercises. (LRW)

Descriptors: Classification, Objective Tests, Outcomes of Education, Test Construction

Domain-Referenced Versus Mastery Conceptualization of Criterion-Referenced Measurement: A Clarification.

Berk, Ronald A. – 1980

Two approaches to criterion-referenced measurement are described and contrasted--domain-referenced testing and mastery testing. This paper is organized according to ten issues or stages in test construction: (1) content domain specification; (2) item construction; (3) item domain specification; (4) item analysis; (5) item selection; (6) parallel…

Descriptors: Classification, Criterion Referenced Tests, Mastery Tests, Measurement Objectives

Validity of a Taxonomy of Multiple-Choice Item-Writing Rules.

Peer reviewed

Haladyna, Thomas M.; Downing, Steven M. – Applied Measurement in Education, 1989

Results of 96 theoretical/empirical studies were reviewed to see if they support a taxonomy of 43 rules for writing multiple-choice test items. The taxonomy is the result of an analysis of 46 textbooks dealing with multiple-choice item writing. For nearly half of the rules, no research was found. (SLD)

Descriptors: Classification, Literature Reviews, Multiple Choice Tests, Test Construction

Order of Elicited Responses on a Questionnaire as a Measure of Topic Salience.

Download full text

Hensley, Wayne E. – 1992

Two studies among U.S. college students (n=88 and n=329) examined the relationships between the order in which responses are offered on a questionnaire and the ranked importance of those responses. Study 1 included 36 males and 52 females, and Study 2 included 127 males and 202 females. Both studies found that approximately one-third (32 percent…

Descriptors: Classification, College Students, Higher Education, Questionnaires

A Taxonomy of Multiple-Choice Item-Writing Rules.

Peer reviewed

Haladyna, Thomas M.; Downing, Steven M. – Applied Measurement in Education, 1989

A taxonomy of 43 rules for writing multiple-choice test items is presented, based on a consensus of 46 textbooks. These guidelines are presented as complete and authoritative, with solid consensus apparent for 33 of the rules. Four rules lack consensus, and 5 rules were cited fewer than 10 times. (SLD)

Descriptors: Classification, Interrater Reliability, Multiple Choice Tests, Objective Tests

The Effect of Grouped versus Randomized Questionnaire Format on Scale Reliability and Validity: A Three-Study Investigation.

Peer reviewed

Schriesheim, Chester A.; And Others – Educational and Psychological Measurement, 1989

Three studies explored the effects of grouping versus randomized items in questionnaires on internal consistency and test-retest reliability with samples of 80, 80, and 100, respectively, university students and undergraduates. The 2 correlational and 1 experimental studies were reasonably consistent in demonstrating that neither format was…

Descriptors: Classification, College Students, Evaluation Methods, Higher Education

Word Processing for Item Banking and Test Production. Final Report.

Boyd, Joseph L. – 1982

This report describes the sequence of activities that took place as the Examination Division of the New Jersey Department of Civil Service introduced a word processing system for a test item bank and for production of camera-ready test copy. The equipment selection, installation and orientation procedures are discussed. Keyboard and CRT terminals,…

Descriptors: Classification, Computer Assisted Testing, Item Banks, Occupational Tests

An Analysis of Factors Affecting the Difficulty of Dialogue Items in TOEFL Listening Comprehension. TOEFL Research Reports, 51.

Download full text

Nissan, Susan; And Others – 1996

One of the item types in the Listening Comprehension section of the Test of English as a Foreign Language (TOEFL) test is the dialogue. Because the dialogue item pool needs to have an appropriate balance of items at a range of difficulty levels, test developers have examined items at various difficulty levels in an attempt to identify their…

Descriptors: Classification, Dialogs (Language), Difficulty Level, English (Second Language)

Test Form Accuracy.

Download full text

Wise, Lauress – 1993

As high-stakes use of tests increases, it becomes vital that test developers and test users communicate clearly about the accuracy and limitations of the scores generated by a test after it is assembled and used. A procedure is described for portraying the accuracy of test scores. It can be used in setting accuracy targets during form construction…

Descriptors: Classification, High Stakes Tests, Item Response Theory, Military Personnel

Previous Page | Next Page »

Pages: 1 | 2

Downing, Steven M.	2
Haladyna, Thomas M.	2
Wyse, Adam E.	2
Babcock, Ben	1
Bao, Lei	1
Berk, Ronald A.	1
Boyd, Joseph L.	1
Chen, Qingwei	1
Fu, Zhao	1
Gifford, Bernard	1
Han, Jing	1
Hensley, Wayne E.	1
Huntley, Renee M.	1
Jing Ma	1
Jonathan Trace	1
Liu, Qiaoyi	1
Miller, Sherri	1
Nation, Paul	1
Nissan, Susan	1
Read, John	1
Reshetar, Rosemary A.	1
Scalise, Kathleen	1
Schriesheim, Chester A.	1
Shawl, Stephen J.	1
More ▼