Publication Date
| In 2026 | 0 |
| Since 2025 | 8 |
| Since 2022 (last 5 years) | 57 |
| Since 2017 (last 10 years) | 148 |
| Since 2007 (last 20 years) | 246 |
Descriptor
| Multiple Choice Tests | 526 |
| Test Format | 526 |
| Test Items | 260 |
| Foreign Countries | 145 |
| Test Construction | 139 |
| Higher Education | 115 |
| Difficulty Level | 96 |
| Comparative Analysis | 93 |
| Scores | 86 |
| Test Reliability | 68 |
| Computer Assisted Testing | 64 |
| More ▼ | |
Source
Author
Publication Type
Education Level
Audience
| Practitioners | 25 |
| Teachers | 21 |
| Researchers | 17 |
| Students | 7 |
| Administrators | 1 |
| Parents | 1 |
Location
| Canada | 13 |
| Turkey | 12 |
| Netherlands | 9 |
| Germany | 8 |
| Australia | 6 |
| Japan | 6 |
| California | 5 |
| Iran | 5 |
| South Korea | 5 |
| United Kingdom | 5 |
| China | 4 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
Tamir, Pinchas – 1989
An investigation of biology matriculation tests in Israel was undertaken to assess the use of justifications with multiple-choice items and to compare the effect of three item formats on students' performance. More specifically, the study was designed to determine the: (1) extent to which justifications will differ if the correct answer is made…
Descriptors: Biology, College Entrance Examinations, Comparative Analysis, Cross Cultural Studies
Ory, John C.; Ryan, Katherine E. – 1993
This book for college faculty provides a resource for developing, using, and grading classroom exams. The first chapter addresses ways to determine what content should be included on an exam. The second chapter identifies testing considerations such as number of exams, difficulty level of items, and test length. Chapters 3 and 4 provide guidelines…
Descriptors: Classroom Techniques, Codes of Ethics, Essay Tests, Evaluation Methods
Melancon, Janet G.; Thompson, Bruce – 1990
Classical measurement theory was used to investigate measurement characteristics of both parts of the Finding Embedded Figures Test (FEFT) when the test was: administered in either a "no guessing" supply format or a multiple-choice selection format; administered to either undergraduate college students or middle school students; and…
Descriptors: Comparative Testing, Construct Validity, Guessing (Tests), Higher Education
Trevisan, Michael S.; Sax, Gilbert – 1990
Reliability and validity of multiple-choice examinations as a function of the number of options per item and of student ability were computed for 435 junior class parochial high school students in the tri-county area of Portland (Oregon). The verbal section of the Washington Pre-College Test Battery was used. The least discriminating options were…
Descriptors: Ability Grouping, Academic Ability, College Bound Students, High Achievement
Murchan, Damian P. – 1989
The reliability, content validity, and construct validity were compared for two test formats in a public examination used to assess a secondary school geography course. The 11-item geography portion of the Intermediate Certificate Examination (essay examination) was administered in June 1987 to 400 secondary school students in Ireland who also…
Descriptors: Achievement Tests, Comparative Testing, Construct Validity, Content Validity
Johnson, Patricia – 1987
A proficiency examination developed for placing non-native English-speakers in appropriate expository writing courses is described. The instrument is a multiple-choice examination containing items that test specific expository writing skills through reading skills. The rationale for including such items for placement in expository writing courses,…
Descriptors: Cloze Procedure, Cohesion (Written Composition), Correlation, English (Second Language)
Scheuneman, Janice – 1985
A number of hypotheses were tested concerning elements of Graduate Record Examinations (GRE) items that might affect the performance of blacks and whites differently. These elements were characteristics common to several items that otherwise measured different concepts. Seven general hypotheses were tested in the form of sixteen specific…
Descriptors: Black Students, College Entrance Examinations, Graduate Study, Higher Education
Peer reviewedWalstad, William B.; Robson, Denise – Journal of Economic Education, 1997
Applies Item Response Theory methods to data from the national norming of the Test of Economic Literacy to identify test questions with large male-female differences. Regression analysis showed a significant decrease in the magnitude of gender difference, although a difference was still present. (MJP)
Descriptors: Academic Aptitude, Comparative Testing, Economics, Economics Education
Kleinke, David J. – 1979
Four forms of a 36-item adaptation of the Stanford Achievement Test were administered to 484 fourth graders. External factors potentially influencing test performance were examined, namely: (1) item order (easy-to-difficult vs. uniform); (2) response location (left column vs. right column); (3) handedness which may interact with response location;…
Descriptors: Achievement Tests, Answer Sheets, Difficulty Level, Eye Hand Coordination
Chang, Shu-Nu; Chiu, Mei-Hung – International Journal of Science and Mathematics Education, 2005
Scientific literacy and authenticity have gained a lot of attention in the past few decades worldwide. The goal of the study was to develop various authentic assessments to investigate students' scientific literacy for corresponding to the new curriculum reform of Taiwan in 1997. In the process, whether ninth graders were able to apply school…
Descriptors: Curriculum Development, Test Items, Educational Assessment, Scientific Principles
Macpherson, Colin R.; Rowley, Glenn L. – 1986
Teacher-made mastery tests were administered in a classroom-sized sample to study their decision consistency. Decision-consistency of criterion-referenced tests is usually defined in terms of the proportion of examinees who are classified in the same way after two test administrations. Single-administration estimates of decision consistency were…
Descriptors: Classroom Research, Comparative Testing, Criterion Referenced Tests, Cutting Scores
Lombard, Juliana V. – 1988
The validity and reliability of two techniques of assessing writing proficiency were compared in a sample of 300 South African students (in Standard 8) for whom English was a second language. The objective, multiple choice method was compared with a subjective, essay method. Students completed a Standard English Second Language Item Bank Test and…
Descriptors: Comparative Analysis, Educational Assessment, English (Second Language), Essay Tests
Hamilton, Laura S. – 1994
Despite the number of studies investigating affective aspects of test taking, little is known about how students perceive the kinds of extended performance assessments currently being developed for state and local testing programs. This paper presents two studies that address these issues. In the first, hands-on science tasks were administered to…
Descriptors: Affective Behavior, Alternative Assessment, Constructed Response, Educational Assessment
Brantmeier, Cindy – Forum on Public Policy Online, 2006
Bernhardt (2003) claims that half of the variance in second language (L2) reading is accounted for by first language literacy (20%) and second language knowledge (30%), and that one of the central goals of current L2 reading research should be to investigate the 50% of variance that remains unexplained. Part of this variance takes consists of…
Descriptors: Second Language Learning, Reading Research, Gender Differences, Test Format
van Weeren, J., Ed. – 1983
Presented in this symposium reader are nine papers, four of which deal with the theory and impact of the Rasch model on language testing and five of which discuss final examinations in secondary schools in both general and specific terms. The papers are: "Introduction to Rasch Measurement: Some Implications for Language Testing" (J. J.…
Descriptors: Adolescents, Comparative Analysis, Comparative Education, Difficulty Level

Direct link
