Publication Date
| In 2026 | 0 |
| Since 2025 | 28 |
| Since 2022 (last 5 years) | 114 |
| Since 2017 (last 10 years) | 281 |
| Since 2007 (last 20 years) | 518 |
Descriptor
| Testing Problems | 4851 |
| Elementary Secondary Education | 1262 |
| Test Validity | 1008 |
| Test Construction | 801 |
| Standardized Tests | 790 |
| Higher Education | 658 |
| Test Reliability | 607 |
| Student Evaluation | 583 |
| Testing | 564 |
| Test Bias | 562 |
| Achievement Tests | 555 |
| More ▼ | |
Source
Author
Publication Type
Education Level
Audience
| Practitioners | 248 |
| Researchers | 220 |
| Teachers | 81 |
| Administrators | 35 |
| Policymakers | 34 |
| Parents | 15 |
| Counselors | 13 |
| Students | 5 |
| Community | 3 |
| Support Staff | 2 |
Location
| Canada | 52 |
| Australia | 45 |
| California | 44 |
| United Kingdom | 37 |
| United States | 36 |
| United Kingdom (England) | 31 |
| China | 29 |
| Netherlands | 26 |
| Florida | 25 |
| New York | 25 |
| United Kingdom (Great Britain) | 24 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Meets WWC Standards with or without Reservations | 1 |
Salmani Nodoushan, Mohammad Ali – Online Submission, 2021
This paper follows a line of logical argumentation to claim that what Samuel Messick conceptualized about construct validation has probably been misunderstood by some educational policy makers, practicing educators, and classroom teachers. It argues that, while Messick's unified theory of test validation aimed at (a) warning educational…
Descriptors: Construct Validity, Test Theory, Test Use, Affordances
Chang, Kuo-Feng – ProQuest LLC, 2022
This dissertation was designed to foster a deeper understanding of population invariance in the context of composite-score equating and provide practitioners with guidelines for addressing score equity concerns at the composite score level. The purpose of this dissertation was threefold. The first was to compare different composite equating…
Descriptors: Test Items, Equated Scores, Methods, Design
Phelps, Richard P. – Online Submission, 2020
Ten years ago, the author worked as the Director of Assessments for the District of Columbia Public Schools (DCPS). For temporal context, the author arrived after the first of the infamous test cheating scandals and left just before the incident that spawned a second. Indeed, the author filled a new position created to both manage test security…
Descriptors: Educational Change, School Districts, Public Schools, Testing
David Allen; Rie Koizumi – Language Assessment Quarterly, 2024
"The English Speaking Achievement Test for Japanese Junior High School Students" (ESAT-J) was introduced to contribute to levelling up public English education in Tokyo in 2022. Critics, however, have made claims in the mass media against the use of the test and stakeholder groups have called for its cancellation. This paper presents an…
Descriptors: English (Second Language), Second Language Learning, Foreign Countries, Junior High School Students
James D. Weese; Ronna C. Turner; Allison Ames; Xinya Liang; Brandon Crawford – Journal of Experimental Education, 2024
In this study a standardized effect size was created for use with the SIBTEST procedure. Using this standardized effect size, a single set of heuristics was developed that are appropriate for data fitting different item response models (e.g., 2-parameter logistic, 3-parameter logistic). The standardized effect size rescales the raw beta-uni value…
Descriptors: Test Bias, Test Items, Item Response Theory, Effect Size
Paul T. von Hippel – Annenberg Institute for School Reform at Brown University, 2023
Longitudinal studies can produce biased estimates of learning if children miss tests. In an application to summer learning, we illustrate how missing test scores can create an illusion of large summer learning gaps when true gaps are close to zero. We demonstrate two methods that reduce bias by exploiting the correlations between missing and…
Descriptors: Testing Problems, Scores, Educational Research, Longitudinal Studies
Olson, John F.; Lazarus, Sheryl S.; Thurlow, Martha L.; Quanbeck, Mari – National Center on Educational Outcomes, 2021
This report provides a snapshot of how accommodated tests for students with disabilities, accessibility, alternate assessments, and other related issues were addressed in states' test security policies for 2020-21. Strong test security policies and procedures are needed to help ensure the integrity and validity of state assessments, yet some test…
Descriptors: Testing Accommodations, Information Security, State Policy, Policy Analysis
Tavares, Walter; Kuper, Ayelet; Kulasegaram, Kulamakan; Whitehead, Cynthia – Advances in Health Sciences Education, 2020
The array of different philosophical positions underlying contemporary views on competence, assessment strategies and justification have led to advances in assessment science. Challenges may arise when these philosophical positions are not considered in assessment design. These can include (a) a logical incompatibility leading to varied or…
Descriptors: Performance Based Assessment, Educational Testing, Test Interpretation, Test Results
Abdulrahman Alshammari – ProQuest LLC, 2024
A critical component of modern software development practices, particularly continuous integration (CI), is the halt of development activities in response to test failures which requires further investigation and debugging. As software changes, regression testing becomes vital to verify that new code does not affect existing functionality.…
Descriptors: Computer Software, Programming, Coding, Test Reliability
Patrisius Istiarto Djiwandono; Daniel Ginting – Language Education & Assessment, 2025
The teaching of English as a foreign language in Indonesia has a long history, and it is always important to ask whether the assessment of the students' language skills has been valid and reliable. A screening of many articles in several prominent databases reveal that a number of evaluation studies have been done by Indonesian scholars in the…
Descriptors: Foreign Countries, Language Tests, English (Second Language), Second Language Learning
Curdt, Wiebke; Schreiber-Barsch, Silke – International Review of Education, 2020
In the past decade, the numeracy component in adult basic education has gained scholarly attention. The issue has been addressed by large-scale assessments of adults' skills and intergovernmental policy agendas, but also by qualitative research into numeracy from the perspective of social practice theory. However, some aspects of numeracy are…
Descriptors: Participatory Research, Numeracy, Adult Basic Education, Testing
Darina Scully; Mary Carroll; Sarah Clarke; Gráinne Guirke – Assessment in Education: Principles, Policy & Practice, 2025
In recent years, the lower secondary school curriculum in the Republic of Ireland has been subject to assessment-led reform, many elements of which, such as the increased focus on continuous school-based assessment, reflect those of similar initiatives introduced in other countries. This paper explores the extent to which this reformed approach is…
Descriptors: Foreign Countries, Secondary School Teachers, Secondary School Curriculum, Educational Assessment
Hyeryung Lee; Walter P. Vispoel – Journal of Educational Measurement, 2025
Traditional methods for detecting cheating on assessments tend to focus on either identifying cheaters or compromised items in isolation, overlooking their interconnection. In this study, we present a novel biclustering approach that simultaneously detects both cheaters and compromised items by identifying coherent subgroups of examinees and items…
Descriptors: Identification, Cheating, Test Wiseness, Test Items
Pornphan Sureeyatanapas; Panitas Sureeyatanapas; Uthumporn Panitanarak; Jittima Kraisriwattana; Patchanan Sarootyanapat; Daniel O'Connell – Language Testing in Asia, 2024
Ensuring consistent and reliable scoring is paramount in education, especially in performance-based assessments. This study delves into the critical issue of marking consistency, focusing on speaking proficiency tests in English language learning, which often face greater reliability challenges. While existing literature has explored various…
Descriptors: Foreign Countries, Students, English Language Learners, Speech
Linda Borger; Stefan Johansson; Rolf Strietholt – Educational Assessment, Evaluation and Accountability, 2024
PISA aims to serve as a "global yardstick" for educational success, as measured by student performance. For comparisons to be meaningful across countries or over time, PISA samples must be representative of the population of 15-year-old students in each country. Exclusions and non-response can undermine this representativeness and…
Descriptors: Achievement Tests, International Assessment, Foreign Countries, Secondary School Students

Direct link
Peer reviewed
