Publication Date
| In 2026 | 0 |
| Since 2025 | 0 |
| Since 2022 (last 5 years) | 2 |
| Since 2017 (last 10 years) | 3 |
| Since 2007 (last 20 years) | 6 |
Descriptor
| Classification | 19 |
| Test Construction | 19 |
| Test Format | 19 |
| Test Items | 11 |
| Computer Assisted Testing | 4 |
| Multiple Choice Tests | 4 |
| College Students | 3 |
| Cutting Scores | 3 |
| Difficulty Level | 3 |
| Higher Education | 3 |
| Language Tests | 3 |
| More ▼ | |
Source
Author
| Downing, Steven M. | 2 |
| Haladyna, Thomas M. | 2 |
| Wyse, Adam E. | 2 |
| Babcock, Ben | 1 |
| Bao, Lei | 1 |
| Berk, Ronald A. | 1 |
| Boyd, Joseph L. | 1 |
| Chen, Qingwei | 1 |
| Fu, Zhao | 1 |
| Gifford, Bernard | 1 |
| Han, Jing | 1 |
| More ▼ | |
Publication Type
| Journal Articles | 10 |
| Reports - Research | 8 |
| Speeches/Meeting Papers | 6 |
| Reports - Evaluative | 5 |
| Reports - Descriptive | 4 |
| Information Analyses | 3 |
| Dissertations/Theses -… | 1 |
| Opinion Papers | 1 |
| Tests/Questionnaires | 1 |
Education Level
| Higher Education | 3 |
| Postsecondary Education | 3 |
| Elementary Secondary Education | 1 |
| Secondary Education | 1 |
Audience
Location
| China | 1 |
| Japan | 1 |
| New Jersey | 1 |
| Russia | 1 |
Laws, Policies, & Programs
Assessments and Surveys
| ACT Assessment | 1 |
| Armed Services Vocational… | 1 |
| Test of English as a Foreign… | 1 |
What Works Clearinghouse Rating
Jing Ma – ProQuest LLC, 2024
This study investigated the impact of scoring polytomous items later on measurement precision, classification accuracy, and test security in mixed-format adaptive testing. Utilizing the shadow test approach, a simulation study was conducted across various test designs, lengths, number and location of polytomous item. Results showed that while…
Descriptors: Scoring, Adaptive Testing, Test Items, Classification
Chen, Qingwei; Zhu, Guangtian; Liu, Qiaoyi; Han, Jing; Fu, Zhao; Bao, Lei – Physical Review Physics Education Research, 2020
Problem-solving categorization tasks have been well studied and used as an effective tool for assessment of student knowledge structure. In this study, a traditional free-response categorization test has been modified into a multiple-choice format, and the effectiveness of this new assessment is evaluated. Through randomized testing with Chinese…
Descriptors: Foreign Countries, Test Construction, Multiple Choice Tests, Problem Solving
Jonathan Trace – Language Teaching Research Quarterly, 2023
The role of context in cloze tests has long been seen as both a benefit as well as a complication in their usefulness as a measure of second language comprehension (Brown, 2013). Passage cohesion, in particular, would seem to have a relevant and important effect on the degree to which cloze items function and the interpretability of performances…
Descriptors: Language Tests, Cloze Procedure, Connected Discourse, Test Items
Wyse, Adam E.; Babcock, Ben – Journal of Educational Measurement, 2016
A common suggestion made in the psychometric literature for fixed-length classification tests is that one should design tests so that they have maximum information at the cut score. Designing tests in this way is believed to maximize the classification accuracy and consistency of the assessment. This article uses simulated examples to illustrate…
Descriptors: Cutting Scores, Psychometrics, Test Construction, Classification
Young, Arthur; Shawl, Stephen J. – Astronomy Education Review, 2013
Professors who teach introductory astronomy to students not majoring in science desire them to comprehend the concepts and theories that form the basis of the science. They are usually less concerned about the myriad of
detailed facts and information that accompanies the science. As such, professors prefer to test the students for such…
Descriptors: Multiple Choice Tests, Classification, Astronomy, Introductory Courses
Wyse, Adam E. – Applied Psychological Measurement, 2011
In many practical testing situations, alternate test forms from the same testing program are not strictly parallel to each other and instead the test forms exhibit small psychometric differences. This article investigates the potential practical impact that these small psychometric differences can have on expected classification accuracy. Ten…
Descriptors: Test Format, Test Construction, Testing Programs, Psychometrics
Stape, Christopher J. – Performance and Instruction, 1995
Suggests methods for developing higher level objective test questions. Taxonomies that define learning outcomes are discussed; and examples for various test types are presented, including multiple correct answers; more complex forms, including classification and multiple true-false; relations and correlates; and interpretive exercises. (LRW)
Descriptors: Classification, Objective Tests, Outcomes of Education, Test Construction
Berk, Ronald A. – 1980
Two approaches to criterion-referenced measurement are described and contrasted--domain-referenced testing and mastery testing. This paper is organized according to ten issues or stages in test construction: (1) content domain specification; (2) item construction; (3) item domain specification; (4) item analysis; (5) item selection; (6) parallel…
Descriptors: Classification, Criterion Referenced Tests, Mastery Tests, Measurement Objectives
Peer reviewedHaladyna, Thomas M.; Downing, Steven M. – Applied Measurement in Education, 1989
Results of 96 theoretical/empirical studies were reviewed to see if they support a taxonomy of 43 rules for writing multiple-choice test items. The taxonomy is the result of an analysis of 46 textbooks dealing with multiple-choice item writing. For nearly half of the rules, no research was found. (SLD)
Descriptors: Classification, Literature Reviews, Multiple Choice Tests, Test Construction
Hensley, Wayne E. – 1992
Two studies among U.S. college students (n=88 and n=329) examined the relationships between the order in which responses are offered on a questionnaire and the ranked importance of those responses. Study 1 included 36 males and 52 females, and Study 2 included 127 males and 202 females. Both studies found that approximately one-third (32 percent…
Descriptors: Classification, College Students, Higher Education, Questionnaires
Peer reviewedHaladyna, Thomas M.; Downing, Steven M. – Applied Measurement in Education, 1989
A taxonomy of 43 rules for writing multiple-choice test items is presented, based on a consensus of 46 textbooks. These guidelines are presented as complete and authoritative, with solid consensus apparent for 33 of the rules. Four rules lack consensus, and 5 rules were cited fewer than 10 times. (SLD)
Descriptors: Classification, Interrater Reliability, Multiple Choice Tests, Objective Tests
Peer reviewedSchriesheim, Chester A.; And Others – Educational and Psychological Measurement, 1989
Three studies explored the effects of grouping versus randomized items in questionnaires on internal consistency and test-retest reliability with samples of 80, 80, and 100, respectively, university students and undergraduates. The 2 correlational and 1 experimental studies were reasonably consistent in demonstrating that neither format was…
Descriptors: Classification, College Students, Evaluation Methods, Higher Education
Boyd, Joseph L. – 1982
This report describes the sequence of activities that took place as the Examination Division of the New Jersey Department of Civil Service introduced a word processing system for a test item bank and for production of camera-ready test copy. The equipment selection, installation and orientation procedures are discussed. Keyboard and CRT terminals,…
Descriptors: Classification, Computer Assisted Testing, Item Banks, Occupational Tests
Nissan, Susan; And Others – 1996
One of the item types in the Listening Comprehension section of the Test of English as a Foreign Language (TOEFL) test is the dialogue. Because the dialogue item pool needs to have an appropriate balance of items at a range of difficulty levels, test developers have examined items at various difficulty levels in an attempt to identify their…
Descriptors: Classification, Dialogs (Language), Difficulty Level, English (Second Language)
Wise, Lauress – 1993
As high-stakes use of tests increases, it becomes vital that test developers and test users communicate clearly about the accuracy and limitations of the scores generated by a test after it is assembled and used. A procedure is described for portraying the accuracy of test scores. It can be used in setting accuracy targets during form construction…
Descriptors: Classification, High Stakes Tests, Item Response Theory, Military Personnel
Previous Page | Next Page ยป
Pages: 1 | 2
Direct link
