Publication Date
| In 2026 | 0 |
| Since 2025 | 12 |
| Since 2022 (last 5 years) | 42 |
| Since 2017 (last 10 years) | 120 |
| Since 2007 (last 20 years) | 213 |
Descriptor
| Multiple Choice Tests | 534 |
| Test Reliability | 534 |
| Test Validity | 303 |
| Test Construction | 240 |
| Test Items | 173 |
| Foreign Countries | 116 |
| Item Analysis | 102 |
| Higher Education | 90 |
| Difficulty Level | 86 |
| Guessing (Tests) | 74 |
| Scoring | 69 |
| More ▼ | |
Source
Author
| Ebel, Robert L. | 10 |
| Frary, Robert B. | 9 |
| Alonzo, Julie | 7 |
| Frisbie, David A. | 6 |
| Irvin, P. Shawn | 6 |
| Lai, Cheng-Fei | 6 |
| Park, Bitnara Jasmine | 6 |
| Tindal, Gerald | 6 |
| Wilcox, Rand R. | 5 |
| Albanese, Mark A. | 4 |
| Biancarosa, Gina | 4 |
| More ▼ | |
Publication Type
Education Level
Audience
| Researchers | 11 |
| Practitioners | 8 |
| Teachers | 5 |
Location
| Turkey | 18 |
| Indonesia | 17 |
| Germany | 9 |
| Iran | 8 |
| Canada | 6 |
| Malaysia | 4 |
| Nigeria | 4 |
| Australia | 3 |
| Florida | 3 |
| Japan | 3 |
| Pakistan | 3 |
| More ▼ | |
Laws, Policies, & Programs
| No Child Left Behind Act 2001 | 1 |
Assessments and Surveys
What Works Clearinghouse Rating
Arth, Thomas O. – 1986
The process of revising and validating two English language tests used by the United States armed forces in hiring foreign nationals overseas is described. Development of the item banks and classification of items are outlined, and field testing in the United States and overseas is described. The tests were the basic and intermediate level…
Descriptors: Armed Forces, Comparative Analysis, Correlation, Difficulty Level
Weber, George – 1974
It is asserted in this paper that some standardized tests do not do a good job of what they claim to do, and for some testing purposes nonstandardized tests are more appropriate or efficient. Under present circumstances, group I.Q. tests should be abolished. They provide no useful information that cannot be gained from achievement tests. And what…
Descriptors: Achievement Tests, College Entrance Examinations, Criterion Referenced Tests, Cultural Influences
Jackson, Douglas N.; And Others – 1971
In a comparative evaluation of a standard true-false format for personality assessment and a forced-choice format, subjects from college residential units were assigned randomly to respond either to the forced-choice or standard true-false form of the Personality Research Form (PRF). All subjects also rated themselves and the members of their…
Descriptors: Behavior Rating Scales, College Housing, College Students, Comparative Analysis
Peer reviewedWhitley, Theodore W. – Nursing Outlook, 1979
Gives examples of major sources of unreliability in multiple choice items in health professions classroom achievement tests (clues to response combinations, mutually exclusive alternatives, implausible distractors) and offers some suggestions for eliminating them when writing such tests. (MF)
Descriptors: Achievement Tests, Allied Health Occupations Education, Guessing (Tests), Item Analysis
Strong, Gregory – Thought Currents in English Literature, 1995
This paper traces developments in educational psychology and measurement that led to the Test of English as a Foreign Language (TOEFL) and the test of English for International Communication (TOEIC) and the application of educational measurement terms such as validity and reliability to testing. Use of a table of specifications for planning…
Descriptors: Cloze Procedure, Difficulty Level, English (Second Language), Foreign Countries
Trevisan, Michael S.; Sax, Gilbert – 1990
Reliability and validity of multiple-choice examinations as a function of the number of options per item and of student ability were computed for 435 junior class parochial high school students in the tri-county area of Portland (Oregon). The verbal section of the Washington Pre-College Test Battery was used. The least discriminating options were…
Descriptors: Ability Grouping, Academic Ability, College Bound Students, High Achievement
Murchan, Damian P. – 1989
The reliability, content validity, and construct validity were compared for two test formats in a public examination used to assess a secondary school geography course. The 11-item geography portion of the Intermediate Certificate Examination (essay examination) was administered in June 1987 to 400 secondary school students in Ireland who also…
Descriptors: Achievement Tests, Comparative Testing, Construct Validity, Content Validity
Johnson, Patricia – 1987
A proficiency examination developed for placing non-native English-speakers in appropriate expository writing courses is described. The instrument is a multiple-choice examination containing items that test specific expository writing skills through reading skills. The rationale for including such items for placement in expository writing courses,…
Descriptors: Cloze Procedure, Cohesion (Written Composition), Correlation, English (Second Language)
Levine, Michael V.; Rubin, Donald B. – 1976
Appropriateness indexes (statistical formulas) for detecting suspiciously high or low scores on aptitude tests were presented, based on a simulation of the Scholastic Aptitude Test (SAT) with 3,000 simulated scores--2,800 normal and 200 suspicious. The traditional index--marginal probability--uses a model for the normal examinee's test-taking…
Descriptors: Academic Ability, Aptitude Tests, College Entrance Examinations, High Schools
Peer reviewedKirst, Michael W. – Educational Researcher, 1991
Discusses the movement toward authentic assessment, also called direct or performance assessment, as an alternative to multiple-choice, standardized, norm-referenced testing. Authentic testing involves assessment tasks that are real instances of extended criterion performances rather than proxies of actual learning goals. Questions use of…
Descriptors: Competency Based Education, Criterion Referenced Tests, Educational Assessment, Educational Diagnosis
Isham, Steven P.; Allen, Nancy L. – 1992
As a result of the dual roles of the National Assessment of Educational Progress (NAEP) to measure trends in academic achievement over time and to measure what students know and can do, a scale anchoring procedure was developed. Although the NAEP provides norm-referenced information about student proficiency, the scale anchoring procedure gives…
Descriptors: Academic Achievement, Criterion Referenced Tests, Elementary School Students, Elementary Secondary Education
Macpherson, Colin R.; Rowley, Glenn L. – 1986
Teacher-made mastery tests were administered in a classroom-sized sample to study their decision consistency. Decision-consistency of criterion-referenced tests is usually defined in terms of the proportion of examinees who are classified in the same way after two test administrations. Single-administration estimates of decision consistency were…
Descriptors: Classroom Research, Comparative Testing, Criterion Referenced Tests, Cutting Scores
Scholz, George E.; Scholz, Celeste M. – 1981
This study examines the relationship between an open-ended cloze test and its multiple-choice versions generated from three sources: (1) interlingual learner-generated distractors--distractors selected from one language group and administered to a different language group, (2) intralingual learner-generated distractors--distractors selected from…
Descriptors: Chinese, Cloze Procedure, English for Special Purposes, Language Proficiency
Oller, John W., Jr. – 1977
This paper questions the purpose of testing in second language instruction. Comments are based on an examination of tests used by the Defense Language Institute for students of English as a second language. Two kinds of tests are used: the English Comprehension Level (ECL), used primarily as a basis for setting exit requirements, and "Book…
Descriptors: Communicative Competence (Languages), English (Second Language), Formative Evaluation, Item Analysis
Park, James – 1972
The use of videotape tests is presented. Such tests enable the educator to assess student performance more directly than traditional paper and pencil tests. Test 1 was exploratory. Test 2 was designed to measure empathetic understanding. It contains 16 scenes, each about one minute long, which show five individuals in a group situation. The…
Descriptors: Academic Achievement, Audiovisual Aids, College Students, Educational Psychology


