Publication Date
| In 2026 | 0 |
| Since 2025 | 28 |
| Since 2022 (last 5 years) | 114 |
| Since 2017 (last 10 years) | 281 |
| Since 2007 (last 20 years) | 518 |
Descriptor
| Testing Problems | 4851 |
| Elementary Secondary Education | 1262 |
| Test Validity | 1008 |
| Test Construction | 801 |
| Standardized Tests | 790 |
| Higher Education | 658 |
| Test Reliability | 607 |
| Student Evaluation | 583 |
| Testing | 564 |
| Test Bias | 562 |
| Achievement Tests | 555 |
| More ▼ | |
Source
Author
Publication Type
Education Level
Audience
| Practitioners | 248 |
| Researchers | 220 |
| Teachers | 81 |
| Administrators | 35 |
| Policymakers | 34 |
| Parents | 15 |
| Counselors | 13 |
| Students | 5 |
| Community | 3 |
| Support Staff | 2 |
Location
| Canada | 52 |
| Australia | 45 |
| California | 44 |
| United Kingdom | 37 |
| United States | 36 |
| United Kingdom (England) | 31 |
| China | 29 |
| Netherlands | 26 |
| Florida | 25 |
| New York | 25 |
| United Kingdom (Great Britain) | 24 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Meets WWC Standards with or without Reservations | 1 |
Bramley, Tom; Crisp, Victoria – Assessment in Education: Principles, Policy & Practice, 2019
For many years, question choice has been used in some UK public examinations, with students free to choose which questions they answer from a selection (within certain parameters). There has been little published research on choice of exam questions in recent years in the UK. In this article we distinguish different scenarios in which choice…
Descriptors: Test Items, Test Construction, Difficulty Level, Foreign Countries
Schweizer, Karl; Reiß, Siegbert; Troche, Stefan – Educational and Psychological Measurement, 2019
The article reports three simulation studies conducted to find out whether the effect of a time limit for testing impairs model fit in investigations of structural validity, whether the representation of the assumed source of the effect prevents impairment of model fit and whether it is possible to identify and discriminate this method effect from…
Descriptors: Timed Tests, Testing, Barriers, Testing Problems
Rios, Joseph A.; Deng, Jiayi; Ihlenfeldt, Samuel D. – Educational Assessment, 2022
The present meta-analysis sought to quantify the average degree of aggregated test score distortion due to rapid guessing (RG). Included studies group-administered a low-stakes cognitive assessment, identified RG via response times, and reported the rate of examinees engaging in RG, the percentage of RG responses observed, and/or the degree of…
Descriptors: Guessing (Tests), Testing Problems, Scores, Item Response Theory
Danielle R. Blazek; Jason T. Siegel – International Journal of Social Research Methodology, 2024
Social scientists have long agreed that satisficing behavior increases error and reduces the validity of survey data. There have been numerous reviews on detecting satisficing behavior, but preventing this behavior has received less attention. The current narrative review provides empirically supported guidance on preventing satisficing by…
Descriptors: Response Style (Tests), Responses, Reaction Time, Test Interpretation
Agwu, Prince; Odii, Aloysius; Orjiakor, Tochukwu; Roy, Pallavi; Nzeadibe, Chidi; Onalu, Chinyere; Okoye, Uzoma Odera; Onwujekwe, Obinna – Quality Assurance in Education: An International Perspective, 2022
Purpose: The purpose of this study is to describe the nature and operations of schools commonly regarded as "Miracle Examination Centres (MECs)" in Nigeria, through the lens of stakeholders in education. This study also assessed stakeholders' perspectives on the possible solutions to the problem of MECs. Design/methodology/approach: The…
Descriptors: Foreign Countries, Educational Malpractice, Cheating, Stakeholders
Keating, Xiaofen; Liu, Xiaolu; Stephenson, Rachyl; Guan, Jianmin; Hodges, Michael – European Physical Education Review, 2020
If used appropriately in schools, youth fitness testing can play a significant role in promoting a physically active lifestyle among school-age children. Unfortunately, many issues exist when testing students' health-related fitness (HRF) components, such as privacy concerns, misuse of testing results, and time-consuming test procedures. This…
Descriptors: Health Related Fitness, Physical Education, Self Evaluation (Individuals), Testing Problems
Rivas, Axel; Scasso, Martín Guillermo – Journal of Education Policy, 2021
Since 2000, the PISA test implemented by OECD has become the prime benchmark for international comparisons in education. The 2015 PISA edition introduced methodological changes that altered the nature of its results. PISA made no longer valid non-reached items of the final part of the test, assuming that those unanswered questions were more a…
Descriptors: Test Validity, Computer Assisted Testing, Foreign Countries, Achievement Tests
von Davier, Matthias; Bezirhan, Ummugul – Educational and Psychological Measurement, 2023
Viable methods for the identification of item misfit or Differential Item Functioning (DIF) are central to scale construction and sound measurement. Many approaches rely on the derivation of a limiting distribution under the assumption that a certain model fits the data perfectly. Typical DIF assumptions such as the monotonicity and population…
Descriptors: Robustness (Statistics), Test Items, Item Analysis, Goodness of Fit
Saskia van Laar; Jianan Chen; Johan Braeken – Measurement: Interdisciplinary Research and Perspectives, 2024
Questionnaires in educational research assessing students' attitudes and beliefs are low-stakes for the students. As a consequence, students might not always consistently respond to a questionnaire scale but instead provide more random response patterns with no clear link to items' contents. We study inter-individual differences in students'…
Descriptors: Foreign Countries, Response Style (Tests), Grade 8, Secondary School Students
Andrés Christiansen; Rianne Janssen – Educational Assessment, Evaluation and Accountability, 2024
In international large-scale assessments, students may not be compelled to answer every test item: a student can decide to skip a seemingly difficult item or may drop out before the end of the test is reached. The way these missing responses are treated will affect the estimation of the item difficulty and student ability, and ultimately affect…
Descriptors: Test Items, Item Response Theory, Grade 4, International Assessment
Rezai, Afsheen; Alibakhshi, Gudarz; Farokhipour, Sajjad; Miri, Mowla – Language Testing in Asia, 2021
This study aims to disclose the Iranian university teachers' perceptions of the fundamentals of language assessment literacy (LAL). To this aim, using purposive sampling, eighteen university teachers from two Iranian universities were invited to participate in semi-structured interviews. Their viewpoints were audio-recorded, transcribed, and…
Descriptors: Foreign Countries, Phenomenology, Alternative Assessment, Testing Problems
Benton, Tom; Williamson, Joanna – Research Matters, 2022
Equating methods are designed to adjust between alternate versions of assessments targeting the same content at the same level, with the aim that scores from the different versions can be used interchangeably. The statistical processes used in equating have, however, been extended to statistically "link" assessments that differ, such as…
Descriptors: Statistical Analysis, Equated Scores, Definitions, Alternative Assessment
Atehortua, Laura – ProQuest LLC, 2022
Intelligence tests are used in a variety of settings such as schools, clinics, and courts to assess the intellectual capacity of individuals of all ages. Intelligence tests are used to make high-stakes decisions such as special education placement, employment, eligibility for social security services, and determination of the death penalty.…
Descriptors: Adults, Intelligence Tests, Children, Error of Measurement
Campione-Barr, Nicole; Lindell, Anna K.; Giron, Sonia E. – Developmental Psychology, 2020
The use of differences scores to assess agreement/disagreement has a long and contentious history. Laird (2020) notes, however, that developmentalists have been particularly resistant to discontinue the use of difference scores. One area of developmental science where difference scores are still in regular use is that of parental differential…
Descriptors: Educational Research, Hypothesis Testing, Differences, Scores
Behforouz, Behnam – Journal of Language and Linguistic Studies, 2022
The present study aimed to cover a holistic viewpoint toward assessment and its features. It discussed the problems in this area during the dominance of COVID-19. This study sought to present some notes on the current online assessment strategies used by the institutions. It measured the effects of the implemented techniques on the nature and…
Descriptors: Computer Assisted Testing, Second Language Learning, Second Language Instruction, Holistic Approach

Peer reviewed
Direct link
