Publication Date
| In 2026 | 0 |
| Since 2025 | 1 |
| Since 2022 (last 5 years) | 3 |
| Since 2017 (last 10 years) | 7 |
| Since 2007 (last 20 years) | 9 |
Descriptor
| Evaluation Methods | 14 |
| Language Tests | 14 |
| Multiple Choice Tests | 14 |
| English (Second Language) | 8 |
| Second Language Learning | 8 |
| Foreign Countries | 7 |
| Second Language Instruction | 7 |
| Student Evaluation | 5 |
| Comparative Analysis | 4 |
| Test Format | 4 |
| Test Items | 4 |
| More ▼ | |
Source
Author
Publication Type
| Reports - Research | 10 |
| Journal Articles | 8 |
| Speeches/Meeting Papers | 2 |
| Tests/Questionnaires | 2 |
| Collected Works - Proceedings | 1 |
| Dissertations/Theses -… | 1 |
| Numerical/Quantitative Data | 1 |
| Reports - Evaluative | 1 |
Education Level
| Higher Education | 3 |
| Postsecondary Education | 3 |
| Elementary Education | 2 |
| Elementary Secondary Education | 1 |
| Junior High Schools | 1 |
| Middle Schools | 1 |
| Secondary Education | 1 |
Audience
Location
| Canada | 1 |
| Iran | 1 |
| Netherlands | 1 |
| Philippines | 1 |
| Saudi Arabia | 1 |
| Sweden | 1 |
| Taiwan | 1 |
| Thailand | 1 |
| United Kingdom (England) | 1 |
| United Kingdom (Northern… | 1 |
| United Kingdom (Wales) | 1 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
| Sequential Tests of… | 1 |
What Works Clearinghouse Rating
Tomkowicz, Joanna; Kim, Dong-In; Wan, Ping – Online Submission, 2022
In this study we evaluated the stability of item parameters and student scores, using the pre-equated (pre-pandemic) parameters from Spring 2019 and post-equated (post-pandemic) parameters from Spring 2021 in two calibration and equating designs related to item parameter treatment: re-estimating all anchor parameters (Design 1) and holding the…
Descriptors: Equated Scores, Test Items, Evaluation Methods, Pandemics
Abdullah Al Fraidan – International Journal of Distance Education Technologies, 2025
This study explores vocabulary assessment practices in Saudi Arabia's hybrid EFL ecosystem, leveraging platforms like Blackboard and Google Forms. The focus is on identifying prevalent test formats and evaluating their alignment with modern pedagogical goals. To classify vocabulary assessment formats in hybridized EFL contexts and recommend the…
Descriptors: Vocabulary Development, English (Second Language), Second Language Learning, Second Language Instruction
Malec, Wojciech; Krzeminska-Adamek, Malgorzata – Practical Assessment, Research & Evaluation, 2020
The main objective of the article is to compare several methods of evaluating multiple-choice options through classical item analysis. The methods subjected to examination include the tabulation of choice distribution, the interpretation of trace lines, the point-biserial correlation, the categorical analysis of trace lines, and the investigation…
Descriptors: Comparative Analysis, Evaluation Methods, Multiple Choice Tests, Item Analysis
Bachelor, Jeremy W. – Language Learning Journal, 2022
The objective of this study was to employ a new tool for assessing pragmatic "proficiency" (over "performance") in L2 students and to determine if pragmatic lessons on compliment sequences had a positive impact on L2 Spanish students' intercultural competence. To this end, students at a Midwestern college engaged in live video…
Descriptors: Pragmatics, Second Language Learning, Second Language Instruction, Teaching Methods
Mehri Kamrood, Ali; Davoudi, Mohammad; Ghaniabadi, Saeed; Amirian, Seyyed Mohammad Reza – Computer Assisted Language Learning, 2021
Dynamic Assessment (DA) is proposed as a workable diagnostic tool in second or foreign language context. Compared to traditional non-dynamic testing, DA presents a more comprehensive account of human beings' abilities through addressing both the fully internalized abilities and the abilities that are in the process of being internalized. However,…
Descriptors: Language Tests, Computer Assisted Testing, Second Language Learning, Second Language Instruction
Todd, Richard Watson – THAITESOL Journal, 2019
In the test-centric Thai education system, results on national exams are often viewed as indicators of educational success. These exams use multiple-choice which can have detrimental effects on students' attitudes and learning. If school assessments also rely on multiple-choice exams, the situation would be worrying, yet there is little data…
Descriptors: Foreign Countries, English (Second Language), Second Language Instruction, Second Language Learning
Genon, Lynrose Jane Dumandan; Torres, Chezka Bianca P. – English Language Teaching Educational Journal, 2020
This qualitative study identified the language assessment practices in terms of purpose, type, and timing in four elementary language classes in the Philippines. It then evaluated the constructive alignment and content validity of the assessment and described how the constructive alignment reflects the quality of teaching and learning in these…
Descriptors: Alignment (Education), Second Language Learning, Second Language Instruction, Teaching Methods
Chen, I-Jung – Computer Assisted Language Learning, 2016
This study compared how three different gloss modes affected college students' L2 reading comprehension and vocabulary acquisition. The study also compared how results on comprehension and vocabulary acquisition may differ depending on the four assessment methods used. A between-subjects design was employed with three groups of Mandarin-speaking…
Descriptors: Reading Comprehension, Multivariate Analysis, Scores, Multiple Choice Tests
Mbella, Kinge Keka – ProQuest LLC, 2012
Mixed-format assessments are increasingly being used in large scale standardized assessments to measure a continuum of skills ranging from basic recall to higher order thinking skills. These assessments are usually comprised of a combination of (a) multiple-choice items which can be efficiently scored, have stable psychometric properties, and…
Descriptors: Educational Assessment, Test Format, Evaluation Methods, Multiple Choice Tests
Madsen, Harold S. – 1987
A study investigated the effectiveness of the Rasch procedure in measuring response appropriateness, especially for the detection of cheating on multiple-choice language tests. The report gives background information on appropriateness measurement and its potential uses, reviews recent research on cheating and its detection, and describes three…
Descriptors: Cheating, English (Second Language), Evaluation Methods, Language Tests
Huntley, Renee M.; Welch, Catherine – 1988
A study compared student performance on language-usage test items embedded in a passage when the location of the answer was varied. American College Testing (ACT) Assessment experimental units were constructed that presented 35 items whose sequence of foils was varied so that each foil appeared once as the "no change" option embedded in…
Descriptors: College Entrance Examinations, Difficulty Level, Distractors (Tests), Evaluation Methods
Madaus, George F.; Rippey, Robert M. – Journal of Educational Measurement, 1966
The validity of the multiple-choice Sequential Tests of Educational Progress (STEP) Writing Test (1957) was tested by the University of Chicago Center for the Cooperative Study of Instruction. Seven criteria developed by the center to score essay assignments were used to determine the relationship between STEP and actual writing behavior. Of the…
Descriptors: Communication (Thought Transfer), Educational Testing, English Instruction, Evaluation Criteria
Rupp, Andre A.; Ferne, Tracy; Choi, Hyeran – Language Testing, 2006
This article provides renewed converging empirical evidence for the hypothesis that asking test-takers to respond to text passages with multiple-choice questions induces response processes that are strikingly different from those that respondents would draw on when reading in non-testing contexts. Moreover, the article shows that the construct of…
Descriptors: Foreign Countries, Language Tests, Reading Comprehension, Evaluation Methods
van Weeren, J., Ed. – 1983
Presented in this symposium reader are nine papers, four of which deal with the theory and impact of the Rasch model on language testing and five of which discuss final examinations in secondary schools in both general and specific terms. The papers are: "Introduction to Rasch Measurement: Some Implications for Language Testing" (J. J.…
Descriptors: Adolescents, Comparative Analysis, Comparative Education, Difficulty Level

Peer reviewed
Direct link
