Publication Date
In 2025 | 22 |
Since 2024 | 51 |
Since 2021 (last 5 years) | 133 |
Since 2016 (last 10 years) | 307 |
Since 2006 (last 20 years) | 520 |
Descriptor
Source
Author
Publication Type
Education Level
Audience
Practitioners | 248 |
Researchers | 220 |
Teachers | 81 |
Administrators | 35 |
Policymakers | 34 |
Parents | 15 |
Counselors | 13 |
Students | 5 |
Community | 3 |
Support Staff | 2 |
Location
Canada | 52 |
Australia | 45 |
California | 44 |
United Kingdom | 37 |
United States | 34 |
United Kingdom (England) | 31 |
China | 28 |
Netherlands | 26 |
Florida | 25 |
New York | 25 |
United Kingdom (Great Britain) | 24 |
More ▼ |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
Meets WWC Standards with or without Reservations | 1 |
Wind, Stefanie A. – Educational and Psychological Measurement, 2023
Rating scale analysis techniques provide researchers with practical tools for examining the degree to which ordinal rating scales (e.g., Likert-type scales or performance assessment rating scales) function in psychometrically useful ways. When rating scales function as expected, researchers can interpret ratings in the intended direction (i.e.,…
Descriptors: Rating Scales, Testing Problems, Item Response Theory, Models
Ken O'Connor; Matt Townsley – Phi Delta Kappan, 2025
Decisions about assessment are often built on myths about teacher professional judgment and subjectivity that prioritize standardized assessment over classroom assessment. Ken O'Connor and Matt Townsley discuss some of the most common myths and explain how to dispel them by developing clear guidelines in which teachers can exercise their judgment,…
Descriptors: Decision Making, Student Evaluation, Standardized Tests, Testing Problems
Glory Tobiason; Adrienne Lavine – Change: The Magazine of Higher Learning, 2025
Current methods for evaluating faculty teaching fall short, and one way to address this is through campus-wide initiatives that focus on change at the level of academic units. The complex context of higher education makes meaningful teaching evaluation difficult; in particular, four sobering realities of this context must be taken into account in…
Descriptors: Teacher Evaluation, Evaluation Methods, Testing Problems, Educational Change
Carlos Cinelli; Andrew Forney; Judea Pearl – Sociological Methods & Research, 2024
Many students of statistics and econometrics express frustration with the way a problem known as "bad control" is treated in the traditional literature. The issue arises when the addition of a variable to a regression equation produces an unintended discrepancy between the regression coefficient and the effect that the coefficient is…
Descriptors: Regression (Statistics), Robustness (Statistics), Error of Measurement, Testing Problems
Natia Afriana Suri; Festiyed; Minda Azhar; Yerimadesi; Yuni Ahda; Heffi Alberida – Research in Learning Technology, 2025
Digital literacy is a critical competency in education across all levels, from primary to higher education. It includes skills such as technical proficiency, information evaluation, online collaboration, creativity and ethical technology use. This study conducts a Systematic Literature Review (SLR), following Preferred Reporting Items for…
Descriptors: Digital Literacy, Measures (Individuals), Core Competencies, Testing Problems
Lewis, Jennifer; Sireci, Stephen G. – Educational Measurement: Issues and Practice, 2022
This module is designed for educators, educational researchers, and psychometricians who would like to develop an understanding of the basic concepts of validity theory, test validation, and documenting a "validity argument." It also describes how an in-depth understanding of the purposes and uses of educational tests sets the foundation…
Descriptors: Test Validity, Tests, Testing Problems, Faculty Development
Alex Buckley – Studies in Higher Education, 2024
Despite a large amount of critical research literature, traditional examinations continue to be widely used in higher education. This article reviews recent literature in order to assess the role played by the approaches adopted by researchers in the gap between research on exams, and the way exams are used. Viviane Robinson's 'problem-based…
Descriptors: Literature Reviews, Testing, Higher Education, Testing Problems
Gökhan Iskifoglu – Turkish Online Journal of Educational Technology - TOJET, 2024
This research paper investigated the importance of conducting measurement invariance analysis in developing measurement tools for assessing differences between and among study variables. Most of the studies, which tended to develop an inventory to assess the existence of an attitude, behavior, belief, IQ, or an intuition in a person's…
Descriptors: Testing, Testing Problems, Error of Measurement, Attitude Measures
Chang, Kuo-Feng – ProQuest LLC, 2022
This dissertation was designed to foster a deeper understanding of population invariance in the context of composite-score equating and provide practitioners with guidelines for addressing score equity concerns at the composite score level. The purpose of this dissertation was threefold. The first was to compare different composite equating…
Descriptors: Test Items, Equated Scores, Methods, Design
James D. Weese; Ronna C. Turner; Allison Ames; Xinya Liang; Brandon Crawford – Journal of Experimental Education, 2024
In this study a standardized effect size was created for use with the SIBTEST procedure. Using this standardized effect size, a single set of heuristics was developed that are appropriate for data fitting different item response models (e.g., 2-parameter logistic, 3-parameter logistic). The standardized effect size rescales the raw beta-uni value…
Descriptors: Test Bias, Test Items, Item Response Theory, Effect Size
Paul T. von Hippel – Annenberg Institute for School Reform at Brown University, 2023
Longitudinal studies can produce biased estimates of learning if children miss tests. In an application to summer learning, we illustrate how missing test scores can create an illusion of large summer learning gaps when true gaps are close to zero. We demonstrate two methods that reduce bias by exploiting the correlations between missing and…
Descriptors: Testing Problems, Scores, Educational Research, Longitudinal Studies
Abdulrahman Alshammari – ProQuest LLC, 2024
A critical component of modern software development practices, particularly continuous integration (CI), is the halt of development activities in response to test failures which requires further investigation and debugging. As software changes, regression testing becomes vital to verify that new code does not affect existing functionality.…
Descriptors: Computer Software, Programming, Coding, Test Reliability
Brunfaut, Tineke – Language Testing, 2023
In this invited Viewpoint on the occasion of the 40th anniversary of the journal "Language Testing," I argue that at the core of future challenges and opportunities for the field--both in scholarly and operational respects--remain basic questions and principles in language testing and assessment. Despite the high levels of sophistication…
Descriptors: Language Tests, Testing, Language Usage, Testing Problems
Pornphan Sureeyatanapas; Panitas Sureeyatanapas; Uthumporn Panitanarak; Jittima Kraisriwattana; Patchanan Sarootyanapat; Daniel O'Connell – Language Testing in Asia, 2024
Ensuring consistent and reliable scoring is paramount in education, especially in performance-based assessments. This study delves into the critical issue of marking consistency, focusing on speaking proficiency tests in English language learning, which often face greater reliability challenges. While existing literature has explored various…
Descriptors: Foreign Countries, Students, English Language Learners, Speech
Linda Borger; Stefan Johansson; Rolf Strietholt – Educational Assessment, Evaluation and Accountability, 2024
PISA aims to serve as a "global yardstick" for educational success, as measured by student performance. For comparisons to be meaningful across countries or over time, PISA samples must be representative of the population of 15-year-old students in each country. Exclusions and non-response can undermine this representativeness and…
Descriptors: Achievement Tests, International Assessment, Foreign Countries, Secondary School Students