ERIC - Search Results

Publication Date

In 2026	0
Since 2025	0
Since 2022 (last 5 years)	0
Since 2017 (last 10 years)	2
Since 2007 (last 20 years)	5

Descriptor

Evaluation Methods	6
Evaluation Research	6
Scoring Formulas	6
Educational Practices	2
Error Correction	2
Evaluation Problems	2
Feedback (Response)	2
Grading	2
Interrater Reliability	2
Models	2
Scoring	2
Scoring Rubrics	2
Standardized Tests	2
Test Scoring Machines	2
Accountability	1
Accuracy	1
Alternative Assessment	1
Automation	1
Classroom Techniques	1
College Entrance Examinations	1
Computation	1
Correlation	1
Data	1
Demography	1
Educational History	1
More ▼

Source

Applied Psychological…	1
Association for Supervision…	1
ETS Research Report Series	1
Journal of Curriculum and…	1
Measurement and Evaluation in…	1
Teaching Mathematics and Its…	1

Author

Attali, Yigal	1
Bardhoshi, Gerta	1
Bridgeman, Brent	1
Davey, Tim	1
Erford, Bradley T.	1
Gaches, Sonya	1
Hill, Diana	1
Laitsch, Dan	1
Ramineni, Chaitanya	1
Trapani, Catherine S.	1
Van Hecke, Tanja	1
Williamson, David M.	1
More ▼

Publication Type

Journal Articles	5
Reports - Descriptive	2
Reports - Evaluative	2
Reports - Research	2
Information Analyses	1

Education Level

Higher Education	2
Elementary Secondary Education	1
Postsecondary Education	1

Audience

Policymakers

Location

United States

Laws, Policies, & Programs

No Child Left Behind Act 2001

Assessments and Surveys

Graduate Record Examinations

What Works Clearinghouse Rating

Showing all 6 results Save | Export

Processes and Procedures for Estimating Score Reliability and Precision

Peer reviewed

Direct link

Bardhoshi, Gerta; Erford, Bradley T. – Measurement and Evaluation in Counseling and Development, 2017

Precision is a key facet of test development, with score reliability determined primarily according to the types of error one wants to approximate and demonstrate. This article identifies and discusses several primary forms of reliability estimation: internal consistency (i.e., split-half, KR-20, a), test-retest, alternate forms, interscorer, and…

Descriptors: Scores, Test Reliability, Accuracy, Pretests Posttests

Power and Assessment: A Genealogical Analysis of the CLASS

Peer reviewed

Direct link

Gaches, Sonya; Hill, Diana – Journal of Curriculum and Pedagogy, 2017

The CLASS (classroom assessment scoring system) has become integrally linked with quality rating and improvement systems (QRIS) throughout the United States and other international locations. This relationship reinforces the neoliberal consumer-based perspectives of quality and devalues localized perspectives. This article challenges the notion of…

Descriptors: Classroom Techniques, Evaluation Methods, Evaluation Research, Evaluation Problems

Initial Correction versus Negative Marking in Multiple Choice Examinations

Peer reviewed

Direct link

Van Hecke, Tanja – Teaching Mathematics and Its Applications, 2015

Optimal assessment tools should measure in a limited time the knowledge of students in a correct and unbiased way. A method for automating the scoring is multiple choice scoring. This article compares scoring methods from a probabilistic point of view by modelling the probability to pass: the number right scoring, the initial correction (IC) and…

Descriptors: Multiple Choice Tests, Error Correction, Grading, Evaluation Methods

Immediate Feedback and Opportunity to Revise Answers: Application of a Graded Response IRT Model

Peer reviewed

Direct link

Attali, Yigal – Applied Psychological Measurement, 2011

Recently, Attali and Powers investigated the usefulness of providing immediate feedback on the correctness of answers to constructed response questions and the opportunity to revise incorrect answers. This article introduces an item response theory (IRT) model for scoring revised responses to questions when several attempts are allowed. The model…

Descriptors: Feedback (Response), Item Response Theory, Models, Error Correction

Evaluation of the "e-rater"® Scoring Engine for the "GRE"® Issue and Argument Prompts. Research Report. ETS RR-12-02

Peer reviewed
PDF on ERIC

Download full text

Ramineni, Chaitanya; Trapani, Catherine S.; Williamson, David M.; Davey, Tim; Bridgeman, Brent – ETS Research Report Series, 2012

Automated scoring models for the "e-rater"® scoring engine were built and evaluated for the "GRE"® argument and issue-writing tasks. Prompt-specific, generic, and generic with prompt-specific intercept scoring models were built and evaluation statistics such as weighted kappas, Pearson correlations, standardized difference in…

Descriptors: Scoring, Test Scoring Machines, Automation, Models

A Policymaker's Primer on Testing and Assessment. Assessment Policy. Info Brief. Number 42

Direct link

Laitsch, Dan – Association for Supervision and Curriculum Development, 2005

Standardized testing plays an increasingly important role in the lives of today's students and educators. The U.S. No Child Left Behind Act (NCLB) requires assessment in math and literacy in grades 3-8 and 10 and, as of 2007-08, in science once in grades 3-5, 6-9, and 10-12. Based on National Center for Education Statistics enrollment projections,…

Descriptors: Testing, Standardized Tests, Enrollment Projections, Accountability