Publication Date
| In 2026 | 0 |
| Since 2025 | 26 |
| Since 2022 (last 5 years) | 112 |
| Since 2017 (last 10 years) | 279 |
| Since 2007 (last 20 years) | 516 |
Descriptor
Source
Author
Publication Type
Education Level
Audience
| Practitioners | 248 |
| Researchers | 220 |
| Teachers | 81 |
| Administrators | 35 |
| Policymakers | 34 |
| Parents | 15 |
| Counselors | 13 |
| Students | 5 |
| Community | 3 |
| Support Staff | 2 |
Location
| Canada | 52 |
| Australia | 45 |
| California | 44 |
| United Kingdom | 37 |
| United States | 36 |
| United Kingdom (England) | 31 |
| China | 28 |
| Netherlands | 26 |
| Florida | 25 |
| New York | 25 |
| United Kingdom (Great Britain) | 24 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Meets WWC Standards with or without Reservations | 1 |
Haladyna, Thomas M.; Rodriguez, Michael C.; Stevens, Craig – Applied Measurement in Education, 2019
The evidence is mounting regarding the guidance to employ more three-option multiple-choice items. From theoretical analyses, empirical results, and practical considerations, such items are of equal or higher quality than four- or five-option items, and more items can be administered to improve content coverage. This study looks at 58 tests,…
Descriptors: Multiple Choice Tests, Test Items, Testing Problems, Guessing (Tests)
Schneider, W. Joel; Roman, Zachary – Journal of Psychoeducational Assessment, 2018
We used data simulations to test whether composites consisting of cohesive subtest scores are more accurate than composites consisting of divergent subtest scores. We demonstrate that when multivariate normality holds, divergent and cohesive scores are equally accurate. Furthermore, excluding divergent scores results in biased estimates of…
Descriptors: Statistical Data, Simulation, Testing, Scores
FIPC Linking across Multidimensional Test Forms: Effects of Confounding Difficulty within Dimensions
Kim, Sohee; Cole, Ki Lynn; Mwavita, Mwarumba – International Journal of Testing, 2018
This study investigated the effects of linking potentially multidimensional test forms using the fixed item parameter calibration. Forms had equal or unequal total test difficulty with and without confounding difficulty. The mean square errors and bias of estimated item and ability parameters were compared across the various confounding tests. The…
Descriptors: Test Items, Item Response Theory, Test Format, Difficulty Level
Khan, Shana Sanam – Education and Urban Society, 2020
Standardized testing is an applauded system of testing due to the uniformity that it offers. The idea is that in standardized testing, because every student is being asked exactly the same question and each question has only one specific answer, standardized examinations are neutral, value free, and exonerated from the subjectivity that an…
Descriptors: Standardized Tests, Aptitude Tests, Bilingual Students, Minority Group Students
Marsden, Emma; Dudley, Amber; Hawkes, Rachel – Modern Language Journal, 2023
The awarding organizations that create and administer high-stakes assessments for beginner-to-low-intermediate 16-year-old learners of French, German, and Spanish in England provide optional topic-driven word lists as guides for teachers and textbook writers. Given that these lists are developed by the awarding organizations, they exert a powerful…
Descriptors: Word Lists, Word Frequency, Secondary School Students, Second Language Instruction
Dong, Manxia; Fan, Jason; Xu, Jian – Asia Pacific Journal of Education, 2023
Understanding of the differential washback effects of high-stakes tests on students' learning remains limited. This study attempts to fill this research gap by investigating the differential washback effects of the National Matriculation English Test (NMET) in China on students' English learning process across genders, grades and English…
Descriptors: Testing Problems, English (Second Language), Second Language Learning, Second Language Instruction
Rezai, Afsheen; Alibakhshi, Gudarz; Farokhipour, Sajjad; Miri, Mowla – Language Testing in Asia, 2021
This study aims to disclose the Iranian university teachers' perceptions of the fundamentals of language assessment literacy (LAL). To this aim, using purposive sampling, eighteen university teachers from two Iranian universities were invited to participate in semi-structured interviews. Their viewpoints were audio-recorded, transcribed, and…
Descriptors: Foreign Countries, Phenomenology, Alternative Assessment, Testing Problems
Roelle, Julian; Roelle, Detlev; Berthold, Kirsten – Journal of Experimental Education, 2019
Providing test questions after an initial study phase is a common instructional technique. In theory, questions that require higher-level (deep) processing should be more beneficial than those that require lower-level (shallow) processing. However, empirical evidence on the matter is inconsistent. To shed light on two potential reasons for these…
Descriptors: Testing Problems, Test Items, Cognitive Processes, Problem Based Learning
Miller, Jeff – Educational and Psychological Measurement, 2017
Critics of null hypothesis significance testing suggest that (a) its basic logic is invalid and (b) it addresses a question that is of no interest. In contrast to (a), I argue that the underlying logic of hypothesis testing is actually extremely straightforward and compelling. To substantiate that, I present examples showing that hypothesis…
Descriptors: Hypothesis Testing, Testing Problems, Test Validity, Relevance (Education)
Sinharay, Sandip; Johnson, Matthew S. – Educational and Psychological Measurement, 2017
In a pioneering research article, Wollack and colleagues suggested the "erasure detection index" (EDI) to detect test tampering. The EDI can be used with or without a continuity correction and is assumed to follow the standard normal distribution under the null hypothesis of no test tampering. When used without a continuity correction,…
Descriptors: Deception, Identification, Testing Problems, Error of Measurement
Kato, Pamela M.; de Klerk, Sebastiaan – Journal of Applied Testing Technology, 2017
Serious games are increasingly being explored for use as assessment tools in broad domains. Drawing from research in these domains, we present important advantages and challenges that arise when using games for assessment. In light of this context and as an introduction to this special issue on Serious Games and Assessments, we introduce the…
Descriptors: Evaluation Methods, Formative Evaluation, Design, Educational Games
Sinharay, Sandip – Journal of Educational and Behavioral Statistics, 2017
An increasing concern of producers of educational assessments is fraudulent behavior during the assessment (van der Linden, 2009). Benefiting from item preknowledge (e.g., Eckerly, 2017; McLeod, Lewis, & Thissen, 2003) is one type of fraudulent behavior. This article suggests two new test statistics for detecting individuals who may have…
Descriptors: Test Items, Cheating, Testing Problems, Identification
Paneerselvam, Bavani – ProQuest LLC, 2017
Multiple-choice retrieval practice with additional lures reduces retention on a later test (Roediger & Marsh, 2005). However, the mechanism underlying the negative outcomes with additional lures is poorly understood. Given that the positive outcomes of retrieval practice are associated with enhanced relational and item-specific processing…
Descriptors: Multiple Choice Tests, Test Items, Item Analysis, Recall (Psychology)
Reed, Deborah K.; Stevenson, Nathan; LeBeau, Brandon C. – Elementary School Journal, 2019
This study investigated the effects of imposing task- or process-oriented reading behaviors on reading comprehension assessment performance. Students in grades 5-8 (N = 275) were randomly assigned to hear multiple-choice items read aloud before or after reading a test passage and when they were and were not allowed access to the passage while…
Descriptors: Reading Comprehension, Reading Tests, Multiple Choice Tests, Reading Aloud to Others
Hipkins, Rosemary – set: Research Information for Teachers, 2019
PISA [Programme for International Student Assessment] will be in the news again this year. The 2018 results are due to be released at the end of 2019 and they usually generate media interest. This Rangahau Whakarapopoto is a research brief which outlines things to watch out for as you think about what the results might mean.
Descriptors: Achievement Tests, Foreign Countries, Secondary School Students, International Assessment

Peer reviewed
Direct link
