Publication Date
| In 2026 | 0 |
| Since 2025 | 21 |
| Since 2022 (last 5 years) | 92 |
| Since 2017 (last 10 years) | 251 |
| Since 2007 (last 20 years) | 420 |
Descriptor
| Student Evaluation | 977 |
| Test Reliability | 977 |
| Test Validity | 709 |
| Evaluation Methods | 339 |
| Test Construction | 245 |
| Foreign Countries | 194 |
| Elementary Secondary Education | 161 |
| Higher Education | 121 |
| Psychometrics | 104 |
| Academic Achievement | 100 |
| Scores | 96 |
| More ▼ | |
Source
Author
| Greenan, James P. | 8 |
| Tindal, Gerald | 7 |
| Deno, Stanley L. | 4 |
| Fuchs, Lynn S. | 4 |
| Popham, W. James | 4 |
| Ysseldyke, James E. | 4 |
| Alonzo, Julie | 3 |
| Anderson, Daniel | 3 |
| Baker, Eva L. | 3 |
| Bracey, Gerald W. | 3 |
| Epstein, Michael H. | 3 |
| More ▼ | |
Publication Type
Education Level
Audience
| Practitioners | 74 |
| Researchers | 47 |
| Teachers | 47 |
| Administrators | 26 |
| Policymakers | 8 |
| Students | 8 |
| Parents | 4 |
| Support Staff | 4 |
| Community | 3 |
| Counselors | 1 |
Location
| Australia | 22 |
| United Kingdom | 20 |
| Turkey | 19 |
| Canada | 15 |
| Indonesia | 12 |
| United Kingdom (England) | 11 |
| United States | 10 |
| China | 9 |
| Florida | 8 |
| New York | 8 |
| Pennsylvania | 8 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
Wesolowski, Brian C. – Music Educators Journal, 2020
Validity, reliability, and fairness are three prominent indicators for evaluating the quality of assessment processes. Each of the indicators is most often written about and applied in the context of large-scale assessment. As a result, the technical properties of these indicators make them limited in both their practicality and relevance for…
Descriptors: Music Education, Test Validity, Test Reliability, Student Evaluation
Chu-Yang Chang; Hsu-Chan Kuo – Education and Information Technologies, 2025
The rapid advancement of educational technologies in recent decades has underscored the increasing importance of digital literacy (DL) as a core competency for all students, as recognised in various educational policies and programs. Evaluating students' DL is crucial for providing valuable insights to guide future educational initiatives. This…
Descriptors: Digital Literacy, Questionnaires, Test Construction, Test Validity
Junho Lee; Sowon Ahn; Yeong-Houn Yi – Australian Journal of Applied Linguistics, 2025
The rise of machine translation demands a fundamental shift in both translators' roles and educational approaches. However, translation education research and practice have struggled to keep pace with the latest developments. To bridge the pedagogical gap, this study conceptualises machine translation literacy within the context of translation…
Descriptors: Test Construction, Test Validity, Test Reliability, Translation
Mansooreh Hosseinnia; Zahra Kafi – Language Testing in Asia, 2024
As testing involves various aspects of education as well as the ones who are involved like instructors, students, managers, teacher trainers, testers, and decision-makers, it comes to be highly crucial to develop ethical tests. In addition, as some methods of testing are more favored and practiced compared to others without considering the ethical…
Descriptors: Test Construction, Test Validity, Ethics, Testing
Gayle Geschwind; Michael Vignal; Marcos D. Caballero; H.? J. Lewandowski – Physical Review Physics Education Research, 2024
The Survey of Physics Reasoning on Uncertainty Concepts in Experiments (SPRUCE) was designed to measure students' proficiency with measurement uncertainty concepts and practices across ten different assessment objectives to help facilitate the improvement of laboratory instruction focused on this important topic. To ensure the reliability and…
Descriptors: Measurement, Ambiguity (Context), Scientific Concepts, Physics
Emma Healy – ProQuest LLC, 2024
The shortage of autism specialists and lack of culturally sensitive autism assessment tools are helping to perpetuate racial and ethnic disparities in autism identification and treatment. Using DisCrit as a framework, this quantitative study examined the utility of one autism assessment tool, the Social Responsiveness Scale, second edition (SRS-2)…
Descriptors: Autism Spectrum Disorders, Student Evaluation, Diagnostic Tests, Disability Identification
Juliane Schlesier; Diana Raufelder; Barbara Moschner – Journal of Early Adolescence, 2024
This paper describes the development and validation of an instrument to assess how students deal with emotionally challenging classroom situations (the DECCS Questionnaire). The questionnaire is based on a vignette with one learning and one performance situation in a classroom, and is intended for students in grades 4 to 7. On a sample of N = 639…
Descriptors: Behavior Problems, Emotional Problems, Student Behavior, Elementary School Students
Kevin Ackermans; Marjoke Bakker; Pierre Gorissen; Anne-Marieke Loon; Marijke Kral; Gino Camp – Journal of Computer Assisted Learning, 2024
Background: A practical test that measures the information and communication technology (ICT) skills students need for effectively using ICT in primary education has yet to be developed (Oh et al., 2021). This paper reports on the development, validation, and reliability of a test measuring primary school students' ICT skills required for…
Descriptors: Test Construction, Test Validity, Measures (Individuals), Elementary School Students
Sujiyani Kassiavera; A. Suparmi; C. Cari; Sukarmin Sukarmin – Journal of Baltic Science Education, 2024
The challenge of accurately assessing critical thinking in physics education, particularly on topics like work and energy, remains a key issue for educators. The current study aims to address this challenge by exploring students' critical thinking abilities using two-tier test data analyzed through the Rasch model. Data were collected from…
Descriptors: Critical Thinking, Physics, Science Instruction, Foreign Countries
Bowen-Mendoza, Lorena; Pinargote-Ortega, Maricela; Meza, Jaime; Ventura, Sebastián – Journal of Computing in Higher Education, 2022
Peer evaluation consists of the evaluation of students by their peers following criteria or rubrics provided by the teacher, where the way to evaluate students is specified so that they achieve the desired competencies. The quality of the measurement instrument must meet two essential criteria: validity and reliability. In this research, we…
Descriptors: Peer Evaluation, Student Evaluation, Scoring Rubrics, Information Technology
Yousuf, Mustafa S.; Miles, Katherine; Harvey, Heather; Al-Tamimi, Mohammad; Badran, Darwish – Journal of University Teaching and Learning Practice, 2022
Exams should be valid, reliable, and discriminative. Multiple informative methods are used for exam analysis. Displaying analysis results numerically, however, may not be easily comprehended. Using graphical analysis tools could be better for the perception of analysis results. Two such methods were employed: standardized x-bar control charts with…
Descriptors: Multiple Choice Tests, Testing, Test Reliability, Test Validity
Diem Thi Ngoc Hoang; Huy Phung; Nhi Tran – Interactive Learning Environments, 2023
With the increasing significance of technology use in both daily life and education, the digital native assessment scale (DNAS) [developed by Teo, T. (2013). An initial development and validation of a Digital Natives Assessment Scale (DNAS). Computers & Education, 67, 51-57.] has been widely used as a tool to investigate the digital nativeness…
Descriptors: Digital Literacy, Preservice Teachers, Foreign Countries, Likert Scales
Georgios Zacharis; Stamatios Papadakis – Educational Process: International Journal, 2025
Background/purpose: Generative artificial intelligence (GenAI) is often promoted as a transformative tool for assessment, yet evidence of its validity compared to human raters remains limited. This study examined whether an AI-based rater could be used interchangeably with trained faculty in scoring complex coursework. Materials/methods:…
Descriptors: Artificial Intelligence, Technology Uses in Education, Computer Assisted Testing, Grading
Hong, Maxwell; Steedle, Jeffrey T.; Cheng, Ying – Educational and Psychological Measurement, 2020
Insufficient effort responding (IER) affects many forms of assessment in both educational and psychological contexts. Much research has examined different types of IER, IER's impact on the psychometric properties of test scores, and preprocessing procedures used to detect IER. However, there is a gap in the literature in terms of practical advice…
Descriptors: Responses, Psychometrics, Test Validity, Test Reliability
Ken Ardon – Pioneer Institute for Public Policy Research, 2024
This paper reviews overall student performance as well as the performance of student subgroups on the assessment system developed in response to the Massachusetts Education Reform Act of 1993 (MERA), the Massachusetts Comprehensive Assessment System (MCAS). Comparing students in Massachusetts to students in the rest of the United States or against…
Descriptors: Accuracy, Test Reliability, Elementary Secondary Education, Achievement Tests

Peer reviewed
Direct link
