ERIC - Search Results

Publication Date

In 2026	0
Since 2025	0
Since 2022 (last 5 years)	2
Since 2017 (last 10 years)	2
Since 2007 (last 20 years)	17

Descriptor

Educational Testing	30
Evaluation Methods	30
Scoring	30
Educational Assessment	16
Student Evaluation	13
Test Construction	9
Elementary Secondary Education	8
Test Interpretation	7
Computer Assisted Testing	6
Measurement	6
Psychometrics	6
Testing Problems	5
Achievement Tests	4
Data Analysis	4
Item Response Theory	4
Program Development	4
Program Effectiveness	4
Test Items	4
Test Use	4
Writing Evaluation	4
Academic Standards	3
Comparative Analysis	3
Criterion Referenced Tests	3
Disabilities	3
Essay Tests	3
More ▼

Source

Assessing Writing	2
Computers & Education	2
Journal of Educational…	2
ProQuest LLC	2
ASCD	1
Educational Assessment	1
Educational and Psychological…	1
Evaluation in Education:…	1
Journal of Career Assessment	1
Learning, Media and Technology	1
Measurement and Evaluation in…	1
Measurement:…	1
Ministerial Council on…	1
Nebraska Department of…	1
Pennsylvania Department of…	1
More ▼

Publication Type

Journal Articles	13
Guides - Non-Classroom	7
Reports - Research	7
Reports - Descriptive	5
Reports - Evaluative	4
Information Analyses	3
Books	2
Dissertations/Theses -…	2
Numerical/Quantitative Data	2
Guides - Classroom - Teacher	1
Opinion Papers	1
Reports - General	1
Speeches/Meeting Papers	1
More ▼

Education Level

Elementary Secondary Education	8
Postsecondary Education	3
Secondary Education	3
Elementary Education	2
Higher Education	2
Grade 6	1
Junior High Schools	1
Middle Schools	1

Audience

Practitioners	2
Researchers	1
Teachers	1

Location

Australia	2
United Kingdom	2
Canada	1
China	1
Hong Kong	1
India	1
Japan	1
Kentucky	1
Nebraska	1
New Jersey	1
North Carolina	1
Pennsylvania	1
South Korea	1
Taiwan	1
United States	1
Wyoming	1
More ▼

Laws, Policies, & Programs

Elementary and Secondary…

Assessments and Surveys

Advanced Placement…	1
Program for International…	1
SAT (College Admission Test)	1

What Works Clearinghouse Rating

Showing 1 to 15 of 30 results Save | Export

Resolving and Re-Scoring Constructed Response Items in Mixed-Format Assessments: An Exploration of Three Approaches

Peer reviewed

Direct link

Stefanie A. Wind; Yangmeng Xu – Educational Assessment, 2024

We explored three approaches to resolving or re-scoring constructed-response items in mixed-format assessments: rater agreement, person fit, and targeted double scoring (TDS). We used a simulation study to consider how the three approaches impact the psychometric properties of student achievement estimates, with an emphasis on person fit. We found…

Descriptors: Interrater Reliability, Error of Measurement, Evaluation Methods, Examiners

2023-2024 NSCAS Growth: English Language Arts, Mathematics, and Science Technical Report

Download full text

Nebraska Department of Education, 2024

The Nebraska Student-Centered Assessment System (NSCAS) is a statewide assessment system that embodies Nebraska's holistic view of students and helps them prepare for success in postsecondary education, career, and civic life. It uses multiple measures throughout the year to provide educators and decision-makers at all levels with the insights…

Descriptors: Student Evaluation, Evaluation Methods, Elementary School Students, Middle School Students

Automated Essay Scoring: Psychometric Guidelines and Practices

Peer reviewed

Direct link

Ramineni, Chaitanya; Williamson, David M. – Assessing Writing, 2013

In this paper, we provide an overview of psychometric procedures and guidelines Educational Testing Service (ETS) uses to evaluate automated essay scoring for operational use. We briefly describe the e-rater system, the procedures and criteria used to evaluate e-rater, implications for a range of potential uses of e-rater, and directions for…

Descriptors: Educational Testing, Guidelines, Scoring, Psychometrics

The Automatic Assessment of Free Text Answers Using a Modified BLEU Algorithm

Peer reviewed

Direct link

Noorbehbahani, F.; Kardan, A. A. – Computers & Education, 2011

e-Learning plays an undoubtedly important role in today's education and assessment is one of the most essential parts of any instruction-based learning process. Assessment is a common way to evaluate a student's knowledge regarding the concepts related to learning objectives. In this paper, a new method for assessing the free text answers of…

Descriptors: Evaluation Methods, Educational Assessment, Student Evaluation, Scoring

Large-Scale Assessment, Locally-Developed Measures, and Automated Scoring of Essays: Fishing for Red Herrings?

Peer reviewed

Direct link

Condon, William – Assessing Writing, 2013

Automated Essay Scoring (AES) has garnered a great deal of attention from the rhetoric and composition/writing studies community since the Educational Testing Service began using e-rater[R] and the "Criterion"[R] Online Writing Evaluation Service as products in scoring writing tests, and most of the responses have been negative. While the…

Descriptors: Measurement, Psychometrics, Evaluation Methods, Educational Testing

The Evidence for a Subscore Structure in a Test of English Language Competency for English Language Learners

Peer reviewed

Direct link

Reckase, Mark D.; Xu, Jing-Ru – Educational and Psychological Measurement, 2015

How to compute and report subscores for a test that was originally designed for reporting scores on a unidimensional scale has been a topic of interest in recent years. In the research reported here, we describe an application of multidimensional item response theory to identify a subscore structure in a test designed for reporting results using a…

Descriptors: English, Language Skills, English Language Learners, Scores

How to Assess Higher-Order Thinking Skills in Your Classroom

Direct link

Brookhart, Susan M. – ASCD, 2010

Don't settle for assessing recall and comprehension only when you can use this guide to create assessments for higher-order thinking skills. Assessment expert Susan M. Brookhart brings you up to speed on how to develop and use test questions and other assessments that reveal how well your students can analyze, reason, solve problems, and think…

Descriptors: Test Items, Performance Based Assessment, Thinking Skills, Cognitive Processes

Test-Taking Strategy Use on the Reading Section of the TOEFL iBT: A Study of Arab ESL Learners

Direct link

Assiri, Mohammed S. – ProQuest LLC, 2011

With the focus on how a sample of 25 Arab ESL learners respond to the TOEFL-iBT reading tasks, this study aimed to find out what strategies respondents tend to use, investigate if there are differences between high- and low-scorers in strategy use, and determine aspects of effective strategy use among respondents. Data were collected using a…

Descriptors: Arabs, Program Effectiveness, Scoring, Data Analysis

Score Reporting in Teacher Certification Testing: A Review, Design, and Interview/Focus Group Study

Direct link

Klesch, Heather S. – ProQuest LLC, 2010

The reporting of scores on educational tests is at times misunderstood, misinterpreted, and potentially confusing to examinees and other stakeholders who may need to interpret test scores. In reporting test results to examinees, there is a need for clarity in the message communicated. As pressure rises for students to demonstrate performance at a…

Descriptors: Feedback (Response), Test Results, Focus Groups, Educational Testing

Comparison of Examination Methods Based on Multiple-Choice Questions and Constructed-Response Questions Using Personal Computers

Peer reviewed

Direct link

Ventouras, Errikos; Triantis, Dimos; Tsiakas, Panagiotis; Stergiopoulos, Charalampos – Computers & Education, 2010

The aim of the present research was to compare the use of multiple-choice questions (MCQs) as an examination method, to the examination based on constructed-response questions (CRQs). Despite that MCQs have an advantage concerning objectivity in the grading process and speed in production of results, they also introduce an error in the final…

Descriptors: Computer Assisted Instruction, Scoring, Grading, Comparative Analysis

The 2008-2009 Pennsylvania System of School Assessment Handbook for Assessment Coordinators: Writing, Reading and Mathematics, Science

Download full text

Pennsylvania Department of Education, 2010

This handbook describes the responsibilities of district and school assessment coordinators in the administration of the Pennsylvania System of School Assessment (PSSA). This updated guidebook contains the following sections: (1) General Assessment Guidelines for All Assessments; (2) Writing Specific Guidelines; (3) Reading and Mathematics…

Descriptors: Guidelines, Guides, Educational Assessment, Writing Tests

Monitoring Rater Performance over Time: A Framework for Detecting Differential Accuracy and Differential Scale Category Use

Peer reviewed

Direct link

Myford, Carol M.; Wolfe, Edward W. – Journal of Educational Measurement, 2009

In this study, we describe a framework for monitoring rater performance over time. We present several statistical indices to identify raters whose standards drift and explain how to use those indices operationally. To illustrate the use of the framework, we analyzed rating data from the 2002 Advanced Placement English Literature and Composition…

Descriptors: English Literature, Advanced Placement, Measures (Individuals), Writing (Composition)

Judges' Use of Examinee Performance Data in an Angoff Standard-Setting Exercise for a Medical Licensing Examination: An Experimental Study

Peer reviewed

Direct link

Clauser, Brian E.; Mee, Janet; Baldwin, Su G.; Margolis, Melissa J.; Dillon, Gerard F. – Journal of Educational Measurement, 2009

Although the Angoff procedure is among the most widely used standard setting procedures for tests comprising multiple-choice items, research has shown that subject matter experts have considerable difficulty accurately making the required judgments in the absence of examinee performance data. Some authors have viewed the need to provide…

Descriptors: Standard Setting (Scoring), Program Effectiveness, Expertise, Health Personnel

From #2 Pencils to the World Wide Web: A History of Test Scoring

Peer reviewed

Direct link

Zytowski, Donald G. – Journal of Career Assessment, 2008

The present highly developed status of psychological and educational testing in the United States is in part the result of many efforts over the past 100 years to develop economical and reliable methods of scoring. The present article traces a number of methods, ranging from hand scoring to present-day computer applications, stimulated by the need…

Descriptors: Educational Testing, Achievement Tests, Computers, Scoring

How Much Can We Reliably Know about What Examinees Know?

Peer reviewed

Direct link

Sinharay, Sandip; Haberman, Shelby J. – Measurement: Interdisciplinary Research and Perspectives, 2009

In this commentary, the authors discuss some of the issues regarding the use of diagnostic classification models that practitioners should keep in mind. In the authors experience, these issues are not as well known as they should be. The authors then provide recommendations on diagnostic scoring.

Descriptors: Scoring, Reliability, Validity, Classification

Previous Page | Next Page »

Pages: 1 | 2

Quenemoen, Rachel	2
Schafer, William D.	2
Thurlow, Martha	2
Agruso, Susan A.	1
Assiri, Mohammed S.	1
Baldwin, Su G.	1
Bell, Gregory	1
Brookhart, Susan M.	1
Bruno, James E.	1
Clauser, Brian E.	1
Condon, William	1
DIEDERICH, PAUL B.	1
Dillon, Gerard F.	1
Donovan, Jenny	1
Driscoll, Lydia Abell	1
Fagan, Barbara M.	1
Haberman, Shelby J.	1
Helms, Janet E.	1
Horst, Donald P.	1
Hutton, Penny	1
Johnson, Martin	1
Johnson, Robert L.	1
Kardan, A. A.	1
Klesch, Heather S.	1
More ▼