Publication Date
| In 2026 | 0 |
| Since 2025 | 62 |
| Since 2022 (last 5 years) | 388 |
| Since 2017 (last 10 years) | 831 |
| Since 2007 (last 20 years) | 1345 |
Descriptor
Source
Author
Publication Type
Education Level
Audience
| Practitioners | 195 |
| Teachers | 161 |
| Researchers | 93 |
| Administrators | 50 |
| Students | 34 |
| Policymakers | 15 |
| Parents | 12 |
| Counselors | 2 |
| Community | 1 |
| Media Staff | 1 |
| Support Staff | 1 |
| More ▼ | |
Location
| Canada | 63 |
| Turkey | 59 |
| Germany | 41 |
| United Kingdom | 37 |
| Australia | 36 |
| Japan | 35 |
| China | 33 |
| United States | 32 |
| California | 25 |
| Iran | 25 |
| United Kingdom (England) | 25 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
Peer reviewedAllalouf, Avi; Hambleton, Ronald K.; Sireci, Stephen G. – Journal of Educational Measurement, 1999
Focused on whether differential item functioning (DIF) is related to item type in translated test items and the causes of DIF using data from an Israeli college entrance test in Hebrew and a Russian translation. Results from 24,304 college applicants indicate that 34% of items functioned differently across items. (SLD)
Descriptors: College Applicants, College Entrance Examinations, Foreign Countries, Hebrew
Peer reviewedWang, Wen-Chung; Cheng, Ying-Yao – Journal of Applied Measurement, 2001
Explored the measurement issues in a two-stage evaluation for an outstanding faculty award. Thirty college teachers were rated by 293 students using a newly developed inventory. Items fit a Rasch model fairly well, and the separation reliability for the teachers was high. A cut score was established, and a short version was developed. (SLD)
Descriptors: College Faculty, Cutting Scores, Evaluation Methods, Higher Education
Peer reviewedGreidanus, Tine; Nienhuis, Lydius – Modern Language Journal, 2001
Examined the development of the word knowledge of two groups of advanced learners of French as a second language by means of a slightly revised version of a particular test format. Studied the type of distractor most suited to the participants, distinguished the role of three types of associations, and linked qualitative aspects of word knowledge…
Descriptors: Advanced Students, French, Language Tests, Second Language Instruction
DiBattista, David; Mitterer, John O.; Gosse, Leanne – Teaching in Higher Education, 2004
Undergraduates completed a questionnaire after using the Immediate Feedback Assessment Technique (IFAT), a commercially available answer form for multiple-choice (MC) testing that can be used easily and conveniently with large classes. This simple new technique for MC testing provides immediate feedback for each item in an answer-until-correct…
Descriptors: Multiple Choice Tests, Testing, Feedback, Guessing (Tests)
Prestera, Gustavo E.; Clariana, Roy; Peck, Andrew – Journal of Educational Multimedia and Hypermedia, 2005
In this experimental study, 44 undergraduates completed five computer-based instructional lessons and either two multiplechoice tests or two fill-in-the-blank tests. Color-coded borders were displayed during the lesson, adjacent to the screen text and illustrations. In the experimental condition, corresponding border colors were shown at posttest.…
Descriptors: Experimental Groups, Computer Assisted Instruction, Instructional Effectiveness, Multiple Choice Tests
Osterlind, Steven J.; Miao, Danmin; Sheng, Yanyan; Chia, Rosina C. – International Journal of Testing, 2004
This study investigated the interaction between different cultural groups and item type, and the ensuing effect on construct validity for a psychological inventory, the Myers-Briggs Type Indicator (MBTI, Form G). The authors analyzed 94 items from 2 Chinese-translated versions of the MBTI (Form G) for factorial differences among groups of…
Descriptors: Test Format, Undergraduate Students, Cultural Differences, Test Validity
Liao, Yan; Fukuya, Yoshinori J. – Language Learning, 2004
This study investigates the avoidance of English phrasal verbs by Chinese learners. Six groups of Chinese learners (intermediate and advanced; a total of 70) took one of 3 tests multiplechoice, translation, or recall, which included literal and figurative phrasal verbs, while 15 native speakers took the multiple-choice test. The results show that…
Descriptors: Test Format, Semantics, Native Speakers, Interlanguage
Deane, Paul; Odendahl, Nora; Quinlan, Thomas; Fowles, Mary; Welsh, Cyndi; Bivens-Tatum, Jennifer – ETS Research Report Series, 2008
This paper undertakes a review of the literature on writing cognition, writing instruction, and writing assessment with the goal of developing a framework and competency model for a new approach to writing assessment. The model developed is part of the Cognitively Based Assessments of, for, and as Learning (CBAL) initiative, an ongoing research…
Descriptors: Writing Skills, Writing Instruction, Schemata (Cognition), Writing Evaluation
Zheng, Ying; Cheng, Liying; Klinger, Don A. – TESL Canada Journal, 2007
Large scale testing in English affects second-language students not only greatly but also differently than first-language learners. The research literature reports that confounding factors in such large-scale testing such as varying test formats may differentially affect the performance of students from diverse backgrounds. An investigation of…
Descriptors: Reading Comprehension, Reading Tests, Test Format, Educational Testing
Chun, Christian W. – Language Assessment Quarterly, 2006
This article presents an analysis of Ordinate Corporation's PhonePass Spoken English Test-10. The company promotes this product as being a useful assessment tool for screening job candidates' ability in spoken English. In the real-life domain of the work environment, one of the primary target language use tasks involves extended production…
Descriptors: Language Tests, English (Second Language), Speech Tests, Screening Tests
Hamzah, Hanizah; Ariffin, Siti Rahayah; Yassin, Ruhizan Mohd – Journal of Science and Mathematics Education in Southeast Asia, 2006
This study explored the differential performances of mathematics test items used to test secondary school girls and boys in the national examination. The main purpose was to find out whether type of items is the reason for girls' overachievement in the Malaysian mathematics national examination. To investigate seven types of items, Differential…
Descriptors: Test Items, Test Format, Females, Overachievement
Freedle, Roy; Kostin, Irene – 1993
Prediction of the difficulty (equated delta) of a large sample (n=213) of reading comprehension items from the Test of English as a Foreign Language (TOEFL) was studied using main idea, inference, and supporting statement items. A related purpose was to examine whether text and text-related variables play a significant role in predicting item…
Descriptors: Construct Validity, Difficulty Level, Multiple Choice Tests, Prediction
Carlson, Sybil B.; Ward, William C. – 1988
Issues concerning the cost and feasibility of using Formulating Hypotheses (FH) test item types for the Graduate Record Examinations have slowed research into their use. This project focused on two major issues that need to be addressed in considering FH items for operational use: the costs of scoring and the assignment of scores along a range of…
Descriptors: Adaptive Testing, Computer Assisted Testing, Costs, Pilot Projects
Lunz, Mary E.; Bergstrom, Betty A. – 1995
The Board of Registry (BOR) certifies medical technologists and other laboratory personnel. The BOR has studied adaptive testing for over 6 years and now administers all 17 BOR certification examinations using computerized adaptive testing (CAT). This paper presents an overview of the major research efforts from 1989 to the present related to test…
Descriptors: Adaptive Testing, Computer Assisted Testing, Decision Making, Equated Scores
Wainer, Howard; And Others – 1991
A series of computer simulations was run to measure the relationship between testlet validity and the factors of item pool size and testlet length for both adaptive and linearly constructed testlets. Results confirmed the generality of earlier empirical findings of H. Wainer and others (1991) that making a testlet adaptive yields only marginal…
Descriptors: Adaptive Testing, Computer Assisted Testing, Computer Simulation, Item Banks

Direct link
