Publication Date
| In 2026 | 0 |
| Since 2025 | 5 |
| Since 2022 (last 5 years) | 17 |
| Since 2017 (last 10 years) | 31 |
| Since 2007 (last 20 years) | 60 |
Descriptor
| Test Reliability | 607 |
| Testing Problems | 607 |
| Test Validity | 328 |
| Test Construction | 156 |
| Elementary Secondary Education | 98 |
| Standardized Tests | 97 |
| Achievement Tests | 87 |
| Test Interpretation | 87 |
| Higher Education | 78 |
| Test Bias | 77 |
| Testing | 76 |
| More ▼ | |
Source
Author
| Ebel, Robert L. | 5 |
| Ysseldyke, James E. | 4 |
| Green, Donald Ross | 3 |
| Popham, W. James | 3 |
| Weiss, David J. | 3 |
| Wilcox, Rand R. | 3 |
| Aiken, Lewis R. | 2 |
| Andrulis, Richard S. | 2 |
| Bao, Lei | 2 |
| Bennett, Randy Elliot | 2 |
| Bormuth, John R. | 2 |
| More ▼ | |
Publication Type
Education Level
Audience
| Practitioners | 28 |
| Researchers | 23 |
| Teachers | 11 |
| Counselors | 3 |
| Administrators | 1 |
| Parents | 1 |
| Policymakers | 1 |
| Students | 1 |
| Support Staff | 1 |
Location
| Australia | 6 |
| Canada | 5 |
| United Kingdom | 5 |
| California | 4 |
| China | 4 |
| Illinois | 3 |
| Israel | 3 |
| United States | 3 |
| Texas | 2 |
| Turkey | 2 |
| United Kingdom (Scotland) | 2 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
Patrisius Istiarto Djiwandono; Daniel Ginting – Language Education & Assessment, 2025
The teaching of English as a foreign language in Indonesia has a long history, and it is always important to ask whether the assessment of the students' language skills has been valid and reliable. A screening of many articles in several prominent databases reveal that a number of evaluation studies have been done by Indonesian scholars in the…
Descriptors: Foreign Countries, Language Tests, English (Second Language), Second Language Learning
Danielle R. Blazek; Jason T. Siegel – International Journal of Social Research Methodology, 2024
Social scientists have long agreed that satisficing behavior increases error and reduces the validity of survey data. There have been numerous reviews on detecting satisficing behavior, but preventing this behavior has received less attention. The current narrative review provides empirically supported guidance on preventing satisficing by…
Descriptors: Response Style (Tests), Responses, Reaction Time, Test Interpretation
Esra Sözer Boz – Education and Information Technologies, 2025
International large-scale assessments provide cross-national data on students' cognitive and non-cognitive characteristics. A critical methodological issue that often arises in comparing data from cross-national studies is ensuring measurement invariance, indicating that the construct under investigation is the same across the compared groups.…
Descriptors: Achievement Tests, International Assessment, Foreign Countries, Secondary School Students
Sasima Charubusp; Orawan Wangsombat; Napatacha Sriwichai; Chanida Phongnapharuk – PASAA: Journal of Language Teaching and Learning in Thailand, 2025
Washback refers to the impact of a test on instruction and learning, with high-stakes tests exerting both positive and negative effects. This study examined the washback of an English exit exam (EEE) on English language learning at a Thai university where English-medium instruction is used in most academic disciplines. The EEE is an in-house…
Descriptors: English (Second Language), Second Language Learning, Second Language Instruction, Language Tests
Firdissa J. Aga – Intersection: A Journal at the Intersection of Assessment and Learning, 2024
The study investigated hurdles to the quality of student learning assessment by examining issues related to assessment procedures and practices, learners and learning, learning resources and test constructs, and test admin and feedback. Quantitative and qualitative data were collected from two Ethiopian universities using two types of…
Descriptors: Foreign Countries, College Faculty, College Students, Test Construction
Gökhan Iskifoglu – Turkish Online Journal of Educational Technology - TOJET, 2024
This research paper investigated the importance of conducting measurement invariance analysis in developing measurement tools for assessing differences between and among study variables. Most of the studies, which tended to develop an inventory to assess the existence of an attitude, behavior, belief, IQ, or an intuition in a person's…
Descriptors: Testing, Testing Problems, Error of Measurement, Attitude Measures
James Dean Brown; Ali Panahi; Hassan Mohebbi – Language Teaching Research Quarterly, 2023
Panahi and Mohebbi review James Dean Brown's 50-years of research in language testing, curriculum development and research statistics with reference to an impressionistic framework for analysis containing two components with their subcomponents: Annotations (i.e., briefing and implications) and main concepts and themes (i.e., testing and teaching…
Descriptors: Second Language Learning, Second Language Instruction, Language Tests, Curriculum Development
Zita Lysaght; Michael O'Leary; Angela Mazzone; Conor Scully – Sage Research Methods Cases, 2022
Since 2018, colleagues from two research centers at Dublin City University have been collaborating to develop a measurement scale to assess individuals' ability to identify workplace bullying. Having agreed on an operational definition of the construct, an item pool of 26 workplace bullying scenarios, that is, short descriptions of…
Descriptors: Foreign Countries, Test Construction, Test Validity, Test Reliability
Paul T. von Hippel; Brendan A. Schuetze – Annenberg Institute for School Reform at Brown University, 2025
Researchers across many fields have called for greater attention to heterogeneity of treatment effects--shifting focus from the average effect to variation in effects between different treatments, studies, or subgroups. True heterogeneity is important, but many reports of heterogeneity have proved to be false, non-replicable, or exaggerated. In…
Descriptors: Educational Research, Replication (Evaluation), Generalizability Theory, Inferences
Jiayi Wang; Michael T. Kalkbrenner; Riley Schaner – Psychology in the Schools, 2025
Teaching is a stressful profession with a high turnover rate. Schools and related institutions need to take more action to support teachers and keep teacher stress at a manageable level. The continued research and practical effort require measures to examine teachers' stress in a briefer and accurate manner. The Teacher Stress Scale is a recently…
Descriptors: Elementary School Teachers, Secondary School Teachers, Preschool Teachers, Stress Variables
Mengna Zheng; Chengwu Ruan – South African Journal of Education, 2024
Comprehensive quality assessment is an assessment system that identifies and explores students' strengths. By examining the developmental progress made in pilot provinces that have implemented comprehensive quality assessment, valuable insights and guidance can be derived for other provinces preparing to adopt this assessment approach. In this…
Descriptors: Foreign Countries, High School Students, College Entrance Examinations, Pilot Projects
Ke-Hai Yuan; Zhiyong Zhang; Lijuan Wang – Grantee Submission, 2024
Mediation analysis plays an important role in understanding causal processes in social and behavioral sciences. While path analysis with composite scores was criticized to yield biased parameter estimates when variables contain measurement errors, recent literature has pointed out that the population values of parameters of latent-variable models…
Descriptors: Structural Equation Models, Path Analysis, Weighted Scores, Comparative Testing
Alper Gülay; Emre Cumali; Damla Cumali – International Journal of Contemporary Educational Research, 2024
This qualitative phenomenological study explores the experiences of parents of children with special needs in Turkey, specifically their encounters with Guidance and Research Centers (GRCs) during the process of obtaining educational assessment reports. Through semi-structured interviews with 25 parents, the study reveals complex emotions and…
Descriptors: Foreign Countries, Special Needs Students, Parent Attitudes, Parent Participation
McGill, Ryan J.; Ward, Thomas J.; Canivez, Gary L. – School Psychology International, 2020
The Wechsler Intelligence Scale for Children (WISC) is the most widely used intelligence test in the world. Now in its fifth edition, the WISC-V has been translated and adapted for use in nearly a dozen countries. Despite its popularity, numerous concerns have been raised about some of the procedures used to develop and validate translated and…
Descriptors: Children, Intelligence Tests, Translation, Test Validity
Giraldo, Frank – HOW, 2019
The purpose of this article of reflection is to raise awareness of how poor design of language assessments may have detrimental effects, if crucial qualities and technicalities of test design are not met. The article first discusses these central qualities for useful language assessments. Then, guidelines for creating listening assessments, as an…
Descriptors: Test Construction, Consciousness Raising, Language Tests, Second Language Learning

Peer reviewed
Direct link
