Publication Date
| In 2026 | 0 |
| Since 2025 | 3 |
| Since 2022 (last 5 years) | 19 |
| Since 2017 (last 10 years) | 36 |
| Since 2007 (last 20 years) | 68 |
Descriptor
| Alternative Assessment | 90 |
| Test Reliability | 90 |
| Test Validity | 64 |
| Student Evaluation | 31 |
| Evaluation Methods | 29 |
| Foreign Countries | 16 |
| Educational Assessment | 15 |
| Performance Based Assessment | 15 |
| Standardized Tests | 14 |
| Test Construction | 14 |
| Academic Achievement | 13 |
| More ▼ | |
Source
Author
| Tindal, Gerald | 3 |
| Booker, Kevin | 2 |
| Bruch, Julie | 2 |
| Crawford, Lindy | 2 |
| Gill, Brian | 2 |
| Kettler, Ryan J. | 2 |
| Kurz, Alexander | 2 |
| Plucker, Jonathan A. | 2 |
| Abedi, Jamal | 1 |
| Alcock, Lara | 1 |
| Alexander Swiderski | 1 |
| More ▼ | |
Publication Type
Education Level
Audience
| Parents | 1 |
| Practitioners | 1 |
| Researchers | 1 |
Location
| California | 3 |
| Turkey | 3 |
| Arizona | 2 |
| Malaysia | 2 |
| Pennsylvania | 2 |
| United Kingdom (England) | 2 |
| United States | 2 |
| Canada | 1 |
| China | 1 |
| Colorado | 1 |
| District of Columbia | 1 |
| More ▼ | |
Laws, Policies, & Programs
| Individuals with Disabilities… | 3 |
| No Child Left Behind Act 2001 | 3 |
| Every Student Succeeds Act… | 1 |
Assessments and Surveys
What Works Clearinghouse Rating
| Does not meet standards | 1 |
Marjahan Begum; Pontus Haglund; Ari Korhonen; Violetta Lonati; Mattia Monga; Filip Strömbäck; Artturi Tilanterä – Informatics in Education, 2024
There can be many reasons why students fail to answer correctly to summative tests in advanced computer science courses: often the cause is a lack of prerequisites or misconceptions about topics presented in previous courses. One of the ITiCSE 2020 working groups investigated the possibility of designing assessments suitable for differentiating…
Descriptors: Foreign Countries, College Students, Prerequisites, Computer Science Education
Eunsoo Cho; Mina Son; Sarah Reiley; Eun Ha Kim – Language, Speech, and Hearing Services in Schools, 2025
Purpose: The purpose of this study was to develop and evaluate the initial reliability and validity evidence of the dynamic assessment (DA) of early reading and language as a second-stage screener in kindergarten, the first year of formal schooling. The DA comprises three subtests that capture students' ability to learn letter sounds and blending…
Descriptors: Alternative Assessment, Test Construction, Test Validity, Kindergarten
Kylie Gorney; Sandip Sinharay – Journal of Educational Measurement, 2025
Although there exists an extensive amount of research on subscores and their properties, limited research has been conducted on categorical subscores and their interpretations. In this paper, we focus on the claim of Feinberg and von Davier that categorical subscores are useful for remediation and instructional purposes. We investigate this claim…
Descriptors: Tests, Scores, Test Interpretation, Alternative Assessment
Judy R. Wilkerson; W. Steve Lang; LaSonya Moore – Journal of Research in Education, 2025
The DAATS (Dispositions Assessments Aligned with Teacher Standards) battery is a series of five instruments of different item types that measure teachers' consistency with the critical dispositions embedded in the InTASC Standards. The purpose of this study was to continue a 20-year research project on the development and implementation of…
Descriptors: Educational Assessment, National Standards, Teacher Evaluation, Teacher Competencies
Barnali Mazumdar; Nora De la Mora; Teresa Roberts; Alexander Swiderski; Maria Kapantzoglou; Gerasimos Fergadiotis – Journal of Speech, Language, and Hearing Research, 2024
Purpose: Anomia, or word-finding difficulty, is a prevalent and persistent feature of aphasia, a neurogenic language disorder affecting millions of people in the United States. Anomia assessments are essential for measuring performance and monitoring outcomes in clinical settings. This study aims to evaluate the reliability of response time (RT)…
Descriptors: Pictorial Stimuli, Naming, Aphasia, Reaction Time
Robert Meyer; Sara Hu; Michael Christian – Society for Research on Educational Effectiveness, 2023
Background: This paper develops a new method to estimate quasi-experimental evaluation models when it is necessary to control for measurement error in predictors and individual assignment to the treatment group is based on these same fallible variables. A major methodological finding of the study is that standard methods of estimating models that…
Descriptors: Error of Measurement, Measurement Techniques, Elementary Secondary Education, Report Cards
Jennifer A. Bury – ProQuest LLC, 2024
No Child Left Behind (NCLB) cemented a standardized testing assessment culture in the United States but research has highlighted the inequities (Au, 2020; Dixon, 1978; Grodsky et al., 2008; Khan, 2020; Moses & Nanna, 2007), unreliability (Hunt et al., 2010; Pizmony-Levy & Green Saraisky, 2016) and negative impacts (Berryhill et al., 2009;…
Descriptors: Alternative Assessment, Test Validity, Test Reliability, Accountability
Minerva Bonilla; Daniel Findley – Advances in Engineering Education, 2024
Educators and institutions have considered and continue to explore alternatives to measure students' learning and track their performance. An alternative that started to gain popularity due to its effectiveness in promoting learning engagement, equity, and inclusion, and helping mitigate concerns due to mental health was "ungrading."…
Descriptors: Alternative Assessment, Undergraduate Students, Civil Engineering, Self Evaluation (Individuals)
Alkis Küçükaydin, Mensure; Esen, Seher – International Journal of Science Education, 2023
The Draw-A Scientist Test (DAST) has been used for many years as a data collection tool in studies where the image of the scientist is attempted to be determined. In this test, in which the image of scientist is attempted to be revealed with drawings, the drawings are analyzed using a coding ruler or checklist. However, the DAST has been…
Descriptors: Personality Measures, Cognitive Tests, Freehand Drawing, Projective Measures
Mengna Zheng; Chengwu Ruan – South African Journal of Education, 2024
Comprehensive quality assessment is an assessment system that identifies and explores students' strengths. By examining the developmental progress made in pilot provinces that have implemented comprehensive quality assessment, valuable insights and guidance can be derived for other provinces preparing to adopt this assessment approach. In this…
Descriptors: Foreign Countries, High School Students, College Entrance Examinations, Pilot Projects
Hancock, Gregory R.; An, Ji – Measurement: Interdisciplinary Research and Perspectives, 2020
As an alternative to Cronbach's [alpha] for estimating scale reliability, McDonald's [omega] has attracted increased attention within the methodological community for its less stringent measurement assumptions. Notwithstanding, [omega] is still seldom used by practitioners, likely due to its unavailability in popular software packages (e.g., SPSS)…
Descriptors: Evaluation, Alternative Assessment, Reliability, Test Reliability
Steven Holtzman; Jonathan Steinberg; Jonathan Weeks; Christopher Robertson; Jessica Findley; David Klieger – ETS Research Report Series, 2024
At a time when institutions of higher education are exploring alternatives to traditional admissions testing, institutions are also seeking to better support students and prepare them for academic success. Under such an engaged model, one may seek to measure not just the accumulated knowledge and skills that students would bring to a new academic…
Descriptors: Law Schools, College Applicants, Legal Education (Professions), College Entrance Examinations
Thompson, W. Jake; Clark, Amy K.; Nash, Brooke – Applied Measurement in Education, 2019
As the use of diagnostic assessment systems transitions from research applications to large-scale assessments for accountability purposes, reliability methods that provide evidence at each level of reporting are needed. The purpose of this paper is to summarize one simulation-based method for estimating and reporting reliability for an…
Descriptors: Test Reliability, Diagnostic Tests, Classification, Computation
Willem C. van Wyk; Gary W. Collins; Maria M. Swanepoel – Transformation in Higher Education, 2024
Journalism's societal role hinges on effective communication, demanding proficient language skills. This study assesses a South African University of Technology's Department of Journalism, probing existing selection methods' efficacy in predicting success in two English modules. Concerns persist about these methods accurately identifying students…
Descriptors: Foreign Countries, Journalism Education, Undergraduate Students, English (Second Language)
Predicting Student Success in a Magnet School Setting through Intelligence and Non-Cognitive Factors
John Jeffrey McCann Jr. – ProQuest LLC, 2024
Magnet schools have been a main tool or innovation in urban education settings in the United States, originating in the early 1970's and expanding into most large urban districts today (Blank, 1989). While some magnet schools do not rely on a specific criterion to determine entry, many do. This study focuses on such a setting where students must…
Descriptors: Intelligence Tests, Magnet Schools, Urban Schools, Screening Tests

Peer reviewed
Direct link
