Publication Date
| In 2026 | 0 |
| Since 2025 | 1 |
| Since 2022 (last 5 years) | 6 |
| Since 2017 (last 10 years) | 13 |
| Since 2007 (last 20 years) | 24 |
Descriptor
| Alternative Assessment | 29 |
| Evaluation Methods | 29 |
| Test Reliability | 29 |
| Test Validity | 21 |
| Student Evaluation | 14 |
| Performance Based Assessment | 7 |
| Educational Practices | 6 |
| Scores | 6 |
| Elementary Secondary Education | 5 |
| Evaluation Criteria | 5 |
| Evaluation Research | 5 |
| More ▼ | |
Source
Author
| Booker, Kevin | 2 |
| Bruch, Julie | 2 |
| Gill, Brian | 2 |
| Abedi, Jamal | 1 |
| Allen, Patricia J. | 1 |
| Anthony, Jennifer | 1 |
| Ari Korhonen | 1 |
| Artturi Tilanterä | 1 |
| Bakeman, Roger | 1 |
| Baker, Eva L. | 1 |
| Bardhoshi, Gerta | 1 |
| More ▼ | |
Publication Type
Education Level
Audience
| Researchers | 1 |
Location
| California | 2 |
| Finland | 1 |
| Israel | 1 |
| Massachusetts | 1 |
| North Carolina | 1 |
| Pennsylvania | 1 |
| Sweden | 1 |
| United Kingdom | 1 |
| Virginia | 1 |
Laws, Policies, & Programs
| Every Student Succeeds Act… | 1 |
| Individuals with Disabilities… | 1 |
| No Child Left Behind Act 2001 | 1 |
Assessments and Surveys
| ACT Assessment | 2 |
| Dynamic Indicators of Basic… | 2 |
| Iowa Tests of Basic Skills | 2 |
| Preliminary Scholastic… | 2 |
| Stanford Achievement Tests | 2 |
| Early Childhood Longitudinal… | 1 |
What Works Clearinghouse Rating
| Does not meet standards | 1 |
Marjahan Begum; Pontus Haglund; Ari Korhonen; Violetta Lonati; Mattia Monga; Filip Strömbäck; Artturi Tilanterä – Informatics in Education, 2024
There can be many reasons why students fail to answer correctly to summative tests in advanced computer science courses: often the cause is a lack of prerequisites or misconceptions about topics presented in previous courses. One of the ITiCSE 2020 working groups investigated the possibility of designing assessments suitable for differentiating…
Descriptors: Foreign Countries, College Students, Prerequisites, Computer Science Education
Kylie Gorney; Sandip Sinharay – Journal of Educational Measurement, 2025
Although there exists an extensive amount of research on subscores and their properties, limited research has been conducted on categorical subscores and their interpretations. In this paper, we focus on the claim of Feinberg and von Davier that categorical subscores are useful for remediation and instructional purposes. We investigate this claim…
Descriptors: Tests, Scores, Test Interpretation, Alternative Assessment
Robert Meyer; Sara Hu; Michael Christian – Society for Research on Educational Effectiveness, 2023
Background: This paper develops a new method to estimate quasi-experimental evaluation models when it is necessary to control for measurement error in predictors and individual assignment to the treatment group is based on these same fallible variables. A major methodological finding of the study is that standard methods of estimating models that…
Descriptors: Error of Measurement, Measurement Techniques, Elementary Secondary Education, Report Cards
Jennifer A. Bury – ProQuest LLC, 2024
No Child Left Behind (NCLB) cemented a standardized testing assessment culture in the United States but research has highlighted the inequities (Au, 2020; Dixon, 1978; Grodsky et al., 2008; Khan, 2020; Moses & Nanna, 2007), unreliability (Hunt et al., 2010; Pizmony-Levy & Green Saraisky, 2016) and negative impacts (Berryhill et al., 2009;…
Descriptors: Alternative Assessment, Test Validity, Test Reliability, Accountability
Minerva Bonilla; Daniel Findley – Advances in Engineering Education, 2024
Educators and institutions have considered and continue to explore alternatives to measure students' learning and track their performance. An alternative that started to gain popularity due to its effectiveness in promoting learning engagement, equity, and inclusion, and helping mitigate concerns due to mental health was "ungrading."…
Descriptors: Alternative Assessment, Undergraduate Students, Civil Engineering, Self Evaluation (Individuals)
Yaniv Biton; Ester Halfon – International Journal of Education in Mathematics, Science and Technology, 2024
Evaluation in mathematics is an inherent part of the discipline. In the current study, issues in the assessment of mathematics that concern MTs and S-MTs are studied. The basic assumption for this study is that improving teachers' ability to deal with the challenges of assessment necessitates examining whether those issues are essential or…
Descriptors: Mathematics Teachers, Teacher Attitudes, Mathematics Instruction, Student Evaluation
Little, Todd D.; Chang, Rong; Gorrall, Britt K.; Waggenspack, Luke; Fukuda, Eriko; Allen, Patricia J.; Noam, Gil G. – International Journal of Behavioral Development, 2020
We revisit the merits of the retrospective pretest-posttest (RPP) design for repeated-measures research. The underutilized RPP method asks respondents to rate survey items twice during the same posttest measurement occasion from two specific frames of reference: "now" and "then." Individuals first report their current attitudes…
Descriptors: Pretesting, Alternative Assessment, Program Evaluation, Evaluation Methods
Dumas, Denis G.; McNeish, Daniel M. – Educational Researcher, 2018
Dynamic measurement modeling (DMM) has been shown to improve the consequential validity of longitudinal mathematics assessment in the Early Childhood Longitudinal Study-Kindergarten (ECLS-K) database. Here, the authors demonstrate the capability of DMM to similarly improve the consequential validity of ECLS-K reading assessment through the…
Descriptors: Measurement Techniques, Student Evaluation, Alternative Assessment, Evaluation Methods
Bardhoshi, Gerta; Erford, Bradley T. – Measurement and Evaluation in Counseling and Development, 2017
Precision is a key facet of test development, with score reliability determined primarily according to the types of error one wants to approximate and demonstrate. This article identifies and discusses several primary forms of reliability estimation: internal consistency (i.e., split-half, KR-20, a), test-retest, alternate forms, interscorer, and…
Descriptors: Scores, Test Reliability, Accuracy, Pretests Posttests
Berliner, David C. – Education Policy Analysis Archives, 2018
The Scylla and Charybdis in this discussion of teacher evaluation are standardized achievement test data on the one hand, and classroom observational systems on the other. These are the two most common methods used to judge teachers' competency. Both have serious flaws: the former primarily with validity, the latter primarily with reliability. At…
Descriptors: Teacher Evaluation, Evaluation Problems, Standardized Tests, Achievement Tests
Castellano, Katherine E.; McCaffrey, Daniel F. – Educational Measurement: Issues and Practice, 2017
Mean or median student growth percentiles (MGPs) are a popular measure of educator performance, but they lack rigorous evaluation. This study investigates the error in MGP due to test score measurement error (ME). Using analytic derivations, we find that errors in the commonly used MGP are correlated with average prior latent achievement: Teachers…
Descriptors: Teacher Evaluation, Teacher Effectiveness, Value Added Models, Achievement Gains
Furtak, Erin Marie; Ruiz-Primo, Maria Araceli; Bakeman, Roger – Educational Measurement: Issues and Practice, 2017
Formative assessment is a classroom practice that has received much attention in recent years for its established potential at increasing student learning. A frequent analytic approach for determining the quality of formative assessment practices is to develop a coding scheme and determine frequencies with which the codes are observed; however,…
Descriptors: Sequential Approach, Formative Evaluation, Alternative Assessment, Incidence
Matthews, Michael S.; Farmer, Jennie – Australasian Journal of Gifted Education, 2017
Dynamic assessment methods, initially developed by Feuerstein in the 1970s, have been recommended as being more equitable for identifying the academic abilities of students who may not perform well on traditional assessments due to these learners' cultural, linguistic, or economic differences from the population for whom the traditional measures…
Descriptors: Academic Achievement, Achievement Gains, Predictive Measurement, Hispanic American Students
Ngo, Federick; Kwon, William W. – Research in Higher Education, 2015
Community college students are often placed in developmental math courses based on the results of a single placement test. However, concerns about accurate placement have recently led states and colleges across the country to consider using other measures to inform placement decisions. While the relationships between college outcomes and such…
Descriptors: Access to Education, Success, Community Colleges, Mathematics Education
Parkes, Kelly A.; Powell, Sean R. – Arts Education Policy Review, 2015
The purpose of this article is to describe and analyze the edTPA, a performance assessment created by the Stanford Center for Assessment, Learning, and Equity (SCALE) and administered by Pearson, Inc., to assess the professional readiness of student teachers. We challenge claims made in support of using this assessment, specifically within the…
Descriptors: Teacher Evaluation, Performance Based Assessment, Student Teacher Evaluation, Evaluation Methods
Previous Page | Next Page »
Pages: 1 | 2
Peer reviewed
Direct link
