ERIC - Search Results

Publication Date

In 2026	0
Since 2025	1
Since 2022 (last 5 years)	6
Since 2017 (last 10 years)	13
Since 2007 (last 20 years)	24

Descriptor

Alternative Assessment	29
Evaluation Methods	29
Test Reliability	29
Test Validity	21
Student Evaluation	14
Performance Based Assessment	7
Educational Practices	6
Scores	6
Elementary Secondary Education	5
Evaluation Criteria	5
Evaluation Research	5
Standardized Tests	5
Teacher Effectiveness	5
Teacher Evaluation	5
Academic Achievement	4
Achievement Gains	4
Correlation	4
Error of Measurement	4
Predictive Validity	4
Program Effectiveness	4
State Standards	4
Statistical Analysis	4
Accountability	3
Accuracy	3
Classroom Observation…	3
More ▼

Publication Type

Journal Articles	21
Reports - Research	15
Reports - Descriptive	5
Reports - Evaluative	4
Tests/Questionnaires	3
Dissertations/Theses -…	2
Books	1
Collected Works - General	1
Guides - General	1
Information Analyses	1
Numerical/Quantitative Data	1
Opinion Papers	1
More ▼

Education Level

Elementary Secondary Education	8
Higher Education	5
Postsecondary Education	5
Middle Schools	4
Elementary Education	3
High Schools	2
Junior High Schools	2
Secondary Education	2
Early Childhood Education	1
Grade 2	1
Two Year Colleges	1
More ▼

Audience

Researchers

Location

California	2
Finland	1
Israel	1
Massachusetts	1
North Carolina	1
Pennsylvania	1
Sweden	1
United Kingdom	1
Virginia	1

Laws, Policies, & Programs

Every Student Succeeds Act…	1
Individuals with Disabilities…	1
No Child Left Behind Act 2001	1

Assessments and Surveys

ACT Assessment	2
Dynamic Indicators of Basic…	2
Iowa Tests of Basic Skills	2
Preliminary Scholastic…	2
Stanford Achievement Tests	2
Early Childhood Longitudinal…	1

What Works Clearinghouse Rating

Does not meet standards

Showing 1 to 15 of 29 results Save | Export

Empirical Evaluation of a Differentiated Assessment of Data Structures: The Role of Prerequisite Skills

Peer reviewed
PDF on ERIC

Download full text

Marjahan Begum; Pontus Haglund; Ari Korhonen; Violetta Lonati; Mattia Monga; Filip Strömbäck; Artturi Tilanterä – Informatics in Education, 2024

There can be many reasons why students fail to answer correctly to summative tests in advanced computer science courses: often the cause is a lack of prerequisites or misconceptions about topics presented in previous courses. One of the ITiCSE 2020 working groups investigated the possibility of designing assessments suitable for differentiating…

Descriptors: Foreign Countries, College Students, Prerequisites, Computer Science Education

A Note on the Use of Categorical Subscores

Peer reviewed

Direct link

Kylie Gorney; Sandip Sinharay – Journal of Educational Measurement, 2025

Although there exists an extensive amount of research on subscores and their properties, limited research has been conducted on categorical subscores and their interpretations. In this paper, we focus on the claim of Feinberg and von Davier that categorical subscores are useful for remediation and instructional purposes. We investigate this claim…

Descriptors: Tests, Scores, Test Interpretation, Alternative Assessment

Controlling for Measurement Error in Evaluations When Treatment Group Assignment Is Based on Noisy Measures

Peer reviewed

Direct link

Robert Meyer; Sara Hu; Michael Christian – Society for Research on Educational Effectiveness, 2023

Background: This paper develops a new method to estimate quasi-experimental evaluation models when it is necessary to control for measurement error in predictors and individual assignment to the treatment group is based on these same fallible variables. A major methodological finding of the study is that standard methods of estimating models that…

Descriptors: Error of Measurement, Measurement Techniques, Elementary Secondary Education, Report Cards

Alternate Pathways to Accountability: A Quantitative Study Evaluating the Validity and Reliability of System-Wide Alternative Assessments

Direct link

Jennifer A. Bury – ProQuest LLC, 2024

No Child Left Behind (NCLB) cemented a standardized testing assessment culture in the United States but research has highlighted the inequities (Au, 2020; Dixon, 1978; Grodsky et al., 2008; Khan, 2020; Moses & Nanna, 2007), unreliability (Hunt et al., 2010; Pizmony-Levy & Green Saraisky, 2016) and negative impacts (Berryhill et al., 2009;…

Descriptors: Alternative Assessment, Test Validity, Test Reliability, Accountability

Evaluation of Self-Assessment "Ungrading" Practices in a STEM Course

Peer reviewed
PDF on ERIC

Download full text

Minerva Bonilla; Daniel Findley – Advances in Engineering Education, 2024

Educators and institutions have considered and continue to explore alternatives to measure students' learning and track their performance. An alternative that started to gain popularity due to its effectiveness in promoting learning engagement, equity, and inclusion, and helping mitigate concerns due to mental health was "ungrading."…

Descriptors: Alternative Assessment, Undergraduate Students, Civil Engineering, Self Evaluation (Individuals)

Focus on the Assessment Concerns of In-Service and Pre-Service Mathematics Teachers

Peer reviewed
PDF on ERIC

Download full text

Yaniv Biton; Ester Halfon – International Journal of Education in Mathematics, Science and Technology, 2024

Evaluation in mathematics is an inherent part of the discipline. In the current study, issues in the assessment of mathematics that concern MTs and S-MTs are studied. The basic assumption for this study is that improving teachers' ability to deal with the challenges of assessment necessitates examining whether those issues are essential or…

Descriptors: Mathematics Teachers, Teacher Attitudes, Mathematics Instruction, Student Evaluation

The Retrospective Pretest-Posttest Design Redux: On Its Validity as an Alternative to Traditional Pretest-Posttest Measurement

Peer reviewed

Direct link

Little, Todd D.; Chang, Rong; Gorrall, Britt K.; Waggenspack, Luke; Fukuda, Eriko; Allen, Patricia J.; Noam, Gil G. – International Journal of Behavioral Development, 2020

We revisit the merits of the retrospective pretest-posttest (RPP) design for repeated-measures research. The underutilized RPP method asks respondents to rate survey items twice during the same posttest measurement occasion from two specific frames of reference: "now" and "then." Individuals first report their current attitudes…

Descriptors: Pretesting, Alternative Assessment, Program Evaluation, Evaluation Methods

Increasing the Consequential Validity of Reading Assessment Using Dynamic Measurement Modeling: A Comment on Dumas and McNeish (2017)

Peer reviewed

Direct link

Dumas, Denis G.; McNeish, Daniel M. – Educational Researcher, 2018

Dynamic measurement modeling (DMM) has been shown to improve the consequential validity of longitudinal mathematics assessment in the Early Childhood Longitudinal Study-Kindergarten (ECLS-K) database. Here, the authors demonstrate the capability of DMM to similarly improve the consequential validity of ECLS-K reading assessment through the…

Descriptors: Measurement Techniques, Student Evaluation, Alternative Assessment, Evaluation Methods

Processes and Procedures for Estimating Score Reliability and Precision

Peer reviewed

Direct link

Bardhoshi, Gerta; Erford, Bradley T. – Measurement and Evaluation in Counseling and Development, 2017

Precision is a key facet of test development, with score reliability determined primarily according to the types of error one wants to approximate and demonstrate. This article identifies and discusses several primary forms of reliability estimation: internal consistency (i.e., split-half, KR-20, a), test-retest, alternate forms, interscorer, and…

Descriptors: Scores, Test Reliability, Accuracy, Pretests Posttests

Between Scylla and Charybdis: Reflections on and Problems Associated with the Evaluation of Teachers in an Era of Metrification

Peer reviewed
PDF on ERIC

Download full text

Berliner, David C. – Education Policy Analysis Archives, 2018

The Scylla and Charybdis in this discussion of teacher evaluation are standardized achievement test data on the one hand, and classroom observational systems on the other. These are the two most common methods used to judge teachers' competency. Both have serious flaws: the former primarily with validity, the latter primarily with reliability. At…

Descriptors: Teacher Evaluation, Evaluation Problems, Standardized Tests, Achievement Tests

The Accuracy of Aggregate Student Growth Percentiles as Indicators of Educator Performance

Peer reviewed

Direct link

Castellano, Katherine E.; McCaffrey, Daniel F. – Educational Measurement: Issues and Practice, 2017

Mean or median student growth percentiles (MGPs) are a popular measure of educator performance, but they lack rigorous evaluation. This study investigates the error in MGP due to test score measurement error (ME). Using analytic derivations, we find that errors in the commonly used MGP are correlated with average prior latent achievement: Teachers…

Descriptors: Teacher Evaluation, Teacher Effectiveness, Value Added Models, Achievement Gains

Exploring the Utility of Sequential Analysis in Studying Informal Formative Assessment Practices

Peer reviewed

Direct link

Furtak, Erin Marie; Ruiz-Primo, Maria Araceli; Bakeman, Roger – Educational Measurement: Issues and Practice, 2017

Formative assessment is a classroom practice that has received much attention in recent years for its established potential at increasing student learning. A frequent analytic approach for determining the quality of formative assessment practices is to develop a coding scheme and determine frequencies with which the codes are observed; however,…

Descriptors: Sequential Approach, Formative Evaluation, Alternative Assessment, Incidence

Predicting Academic Achievement Growth among Low-Income Mexican American Learners Using Dynamic and Static Assessments

Peer reviewed

Direct link

Matthews, Michael S.; Farmer, Jennie – Australasian Journal of Gifted Education, 2017

Dynamic assessment methods, initially developed by Feuerstein in the 1970s, have been recommended as being more equitable for identifying the academic abilities of students who may not perform well on traditional assessments due to these learners' cultural, linguistic, or economic differences from the population for whom the traditional measures…

Descriptors: Academic Achievement, Achievement Gains, Predictive Measurement, Hispanic American Students

Using Multiple Measures to Make Math Placement Decisions: Implications for Access and Success in Community Colleges

Peer reviewed

Direct link

Ngo, Federick; Kwon, William W. – Research in Higher Education, 2015

Community college students are often placed in developmental math courses based on the results of a single placement test. However, concerns about accurate placement have recently led states and colleges across the country to consider using other measures to inform placement decisions. While the relationships between college outcomes and such…

Descriptors: Access to Education, Success, Community Colleges, Mathematics Education

Is the EdTPA the Right Choice for Evaluating Teacher Readiness?

Peer reviewed

Direct link

Parkes, Kelly A.; Powell, Sean R. – Arts Education Policy Review, 2015

The purpose of this article is to describe and analyze the edTPA, a performance assessment created by the Stanford Center for Assessment, Learning, and Equity (SCALE) and administered by Pearson, Inc., to assess the professional readiness of student teachers. We challenge claims made in support of using this assessment, specifically within the…

Descriptors: Teacher Evaluation, Performance Based Assessment, Student Teacher Evaluation, Evaluation Methods

Previous Page | Next Page »

Pages: 1 | 2

Education Policy Analysis…	2
Educational Measurement:…	2
Journal of Educational…	2
ProQuest LLC	2
Regional Educational…	2
Advances in Engineering…	1
American Institutes for…	1
Arts Education Policy Review	1
Australasian Journal of…	1
Contemporary Issues in…	1
Educational Research and…	1
Educational Researcher	1
Gifted Child Quarterly	1
IAP - Information Age…	1
Informatics in Education	1
International Journal of…	1
International Journal of…	1
Journal of Educational…	1
Journal of Special Education	1
Measurement and Evaluation in…	1
Research in Higher Education	1
Research in Mathematics…	1
Society for Research on…	1
More ▼

Booker, Kevin	2
Bruch, Julie	2
Gill, Brian	2
Abedi, Jamal	1
Allen, Patricia J.	1
Anthony, Jennifer	1
Ari Korhonen	1
Artturi Tilanterä	1
Bakeman, Roger	1
Baker, Eva L.	1
Bardhoshi, Gerta	1
Berliner, David C.	1
Camilli, Gregory	1
Cannon, Jill S.	1
Castellano, Katherine E.	1
Chang, Rong	1
Daniel Findley	1
Dumas, Denis G.	1
Erford, Bradley T.	1
Ester Halfon	1
Farmer, Jennie	1
Filip Strömbäck	1
Fukuda, Eriko	1
Furtak, Erin Marie	1
More ▼