Publication Date
| In 2026 | 0 |
| Since 2025 | 59 |
| Since 2022 (last 5 years) | 385 |
| Since 2017 (last 10 years) | 828 |
| Since 2007 (last 20 years) | 1342 |
Descriptor
Source
Author
Publication Type
Education Level
Audience
| Practitioners | 195 |
| Teachers | 161 |
| Researchers | 93 |
| Administrators | 50 |
| Students | 34 |
| Policymakers | 15 |
| Parents | 12 |
| Counselors | 2 |
| Community | 1 |
| Media Staff | 1 |
| Support Staff | 1 |
| More ▼ | |
Location
| Canada | 62 |
| Turkey | 59 |
| Germany | 40 |
| Australia | 36 |
| United Kingdom | 36 |
| Japan | 35 |
| China | 33 |
| United States | 32 |
| California | 25 |
| Iran | 25 |
| United Kingdom (England) | 25 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
Fukuzawa, Sherry; deBraga, Michael – Journal of Curriculum and Teaching, 2019
Graded Response Method (GRM) is an alternative to multiple-choice testing where students rank options according to their relevance to the question. GRM requires discrimination and inference between statements and is a cost-effective critical thinking assessment in large courses where open-ended answers are not feasible. This study examined…
Descriptors: Alternative Assessment, Multiple Choice Tests, Test Items, Test Format
El Rassi, Mary Ann Barbour – International Association for Development of the Information Society, 2019
It has long been debated whether the Open-Book-Open-Web exam was useful and efficient as the traditional closed book exams. Some scholars and practitioners have doubted the efficiency and the possibility of cheating in the OBOW as it is not directly monitored. This paper tends to investigate the effectiveness of OBOW exams by comparing them with…
Descriptors: Developing Nations, Test Format, Tests, Cheating
Mbonigaba, Josue; Oumar, Saidou B. – Africa Education Review, 2017
Whether or not the scores for multiple-choice questions (MCQs) and written questions (WQs) in formative assessments are the same, has been a subject of intense scrutiny. However, the evidence for their similarity at different levels of cognitive ability in applied courses has not been sufficiently documented. This study analysed the comparability…
Descriptors: Foreign Countries, College Students, Student Evaluation, Multiple Choice Tests
Kapuza, A. V.; Tyumeneva, Yu. A. – Russian Education & Society, 2017
One of the ways of controlling for the influence of social expectations on the answers given by survey respondents is to use a social desirability scale together with the main questions. The social desirability scale, which was included in the Teaching and Learning International Survey (TALIS) international comparative study for this purpose, was…
Descriptors: Surveys, Social Desirability, Measures (Individuals), Test Reliability
Hohensinn, Christine; Baghaei, Purya – Psicologica: International Journal of Methodology and Experimental Psychology, 2017
In large scale multiple-choice (MC) tests alternate forms of a test may be developed to prevent cheating by changing the order of items or by changing the position of the response options. The assumption is that since the content of the test forms are the same the order of items or the positions of the response options do not have any effect on…
Descriptors: Multiple Choice Tests, Test Format, Test Items, Difficulty Level
Muniroglu, S.; Subak, E. – Journal of Education and Training Studies, 2018
The football referees perform many actions as jogging, running, sprinting, side steps and backward steps during a football match. Further, the football referees change match activities every 5-6 seconds. Many tests are being conducted to determine the physical levels and competences of football referees like 50 m running, 200 m running, 12 minutes…
Descriptors: Judges, Physical Education, Team Sports, Athletics
Kiat, John Emmanuel; Ong, Ai Rene; Ganesan, Asha – Educational Psychology, 2018
Multiple-choice questions (MCQs) play a key role in standardised testing and in-class assessment. Research into the influence of within-item response order on MCQ characteristics has been mixed. While some researchers have shown preferential selection of response options presented earlier in the answer list, others have failed to replicate these…
Descriptors: Undergraduate Students, Multiple Choice Tests, Attention Control, Item Response Theory
Agus, Mirian; Peró-Cebollero, Maribel; Guàrdia-Olmos, Joan; Portoghese, Igor; Mascia, Maria Lidia; Penna, Maria Pietronilla – EURASIA Journal of Mathematics, Science and Technology Education, 2020
This paper reports some experiments on probabilistic reasoning designed to investigate the impact of the probabilistic problem presentation format (verbal-numerical and graphical-pictorial) on subjects' confidence in the correctness of their performance, other than the calibration between confidence and accuracy. To understand the potential effect…
Descriptors: Accuracy, Self Efficacy, Context Effect, Statistics
Yarahmadzehi, Nahid; Goodarzi, Mostafa – Turkish Online Journal of Distance Education, 2020
Throughout this study technology and especially mobile phones was utilized in EFL classrooms in order to see whether it can influence the process of vocabulary formative assessment and consequently improve vocabulary learning of Iranian pre-intermediate EFL learners or not. Two groups of pre-intermediate EFL learners participated in this study.…
Descriptors: Formative Evaluation, Computer Assisted Testing, Vocabulary Development, English (Second Language)
Mihele, Roxana – Romanian Review of Geographical Education, 2021
The COVID-19 pandemic pushed the limits and limitations of all educational systems, teachers and students around the world. The solution adopted -- distance, online teaching, learning and assessment -- has proven to be of a longer duration than initially anticipated, to the frustration of students, parents, and teachers alike. Nonetheless,…
Descriptors: Electronic Learning, Blended Learning, Online Courses, Distance Education
King, Rosemary; Blayney, Paul; Sweller, John – Accounting Education, 2021
This study offers evidence of the impact of language background on the performance of students enrolled in an accounting study unit. It aims to quantify the effects of language background on performance in essay questions, compared to calculation questions requiring an application of procedures. Marks were collected from 2850 students. The results…
Descriptors: Cognitive Ability, Accounting, Native Language, Second Language Learning
Shin, Sun-Young; Lee, Senyung; Lidster, Ryan – Language Testing, 2021
In this study we investigated the potential for a shared-first-language (shared-L1) effect on second language (L2) listening test scores using differential item functioning (DIF) analyses. We did this in order to understand how accented speech may influence performance at the item level, while controlling for key variables including listening…
Descriptors: Listening Comprehension Tests, Language Tests, Native Language, Scores
Kaya, Elif; O'Grady, Stefan; Kalender, Ilker – Language Testing, 2022
Language proficiency testing serves an important function of classifying examinees into different categories of ability. However, misclassification is to some extent inevitable and may have important consequences for stakeholders. Recent research suggests that classification efficacy may be enhanced substantially using computerized adaptive…
Descriptors: Item Response Theory, Test Items, Language Tests, Classification
Isbell, Dan; Winke, Paula – Language Testing, 2019
The American Council on the Teaching of Foreign Languages (ACTFL) oral proficiency interview -- computer (OPIc) testing system represents an ambitious effort in language assessment: Assessing oral proficiency in over a dozen languages, on the same scale, from virtually anywhere at any time. Especially for users in contexts where multiple foreign…
Descriptors: Oral Language, Language Tests, Language Proficiency, Second Language Learning
DiStefano, Christine; Barth, Steven G.; Greer, Fred – Journal of Psychoeducational Assessment, 2019
This study investigated the effect of item position on descriptive statistics, psychometric information, and factor structure of the Pediatric Symptoms Checklist 17-item social-emotional screening instrument (PSC-17). The goal was to determine whether item position, either grouped by factor or mixed across constructs, produced similar results.…
Descriptors: Check Lists, Test Items, Factor Structure, Screening Tests

Peer reviewed
Direct link
