ERIC - Search Results

Publication Date

In 2026	0
Since 2025	1
Since 2022 (last 5 years)	1
Since 2017 (last 10 years)	6
Since 2007 (last 20 years)	8

Descriptor

Scores	8
Test Reliability	8
Value Added Models	8
Correlation	3
Factor Analysis	3
Academic Achievement	2
Computation	2
Computer Assisted Testing	2
Decision Making	2
English (Second Language)	2
Evaluation Methods	2
Grade 4	2
Grade 5	2
Grade 6	2
Grade 7	2
Grade 8	2
Korean	2
Language Tests	2
School Districts	2
Second Language Learning	2
Semitic Languages	2
Spanish	2
Statistical Analysis	2
Teacher Evaluation	2
Test Validity	2
More ▼

Source

ETS Research Report Series	2
Annenberg Institute for…	1
Educational Assessment,…	1
Journal of Experimental…	1
Language Testing	1
Mathematica Policy Research,…	1
Phi Delta Kappan	1

Publication Type

Reports - Research	7
Journal Articles	6
Reports - Evaluative	1

Education Level

Higher Education	2
Postsecondary Education	2
Grade 3	1
Grade 4	1

Audience

Location

New Mexico	1
Pennsylvania	1
Pennsylvania (Pittsburgh)	1
Texas (Houston)	1

Laws, Policies, & Programs

Assessments and Surveys

Test of English as a Foreign…	2
Graduate Record Examinations	1

What Works Clearinghouse Rating

Showing all 8 results Save | Export

The Sensitivity of Value-Added Estimates to Test Scoring Decisions. EdWorkingPaper No. 25-1226

Download full text

Joshua B. Gilbert; James G. Soland; Benjamin W. Domingue – Annenberg Institute for School Reform at Brown University, 2025

Value-Added Models (VAMs) are both common and controversial in education policy and accountability research. While the sensitivity of VAMs to model specification and covariate selection is well documented, the extent to which test scoring methods (e.g., mean scores vs. IRT-based scores) may affect VA estimates is less studied. We examine the…

Descriptors: Value Added Models, Tests, Testing, Scoring

Using Test Scores to Evaluate and Hold School Teachers Accountable in New Mexico

Peer reviewed

Direct link

Geiger, Tray J.; Amrein-Beardsley, Audrey; Holloway, Jessica – Educational Assessment, Evaluation and Accountability, 2020

For this study, researchers critically reviewed documents pertaining to the highest profile of the 15 teacher evaluation lawsuits that occurred throughout the U.S. as pertaining to the use of student test scores to evaluate teachers. In New Mexico, teacher plaintiffs contested how they were being evaluated and held accountable using a homegrown…

Descriptors: Court Litigation, Teacher Responsibility, Accountability, Value Added Models

Exploration of Factors Affecting the Added Value of Test Subscores

Peer reviewed

Direct link

Wang, Xiaolin; Svetina, Dubravka; Dai, Shenghai – Journal of Experimental Education, 2019

Recently, interest in test subscore reporting for diagnosis purposes has been growing rapidly. The two simulation studies here examined factors (sample size, number of subscales, correlation between subscales, and three factors affecting subscore reliability: number of items per subscale, item parameter distribution, and data generating model)…

Descriptors: Value Added Models, Scores, Sample Size, Correlation

Statistical Properties of the "GRE"® Psychology Test Subscores. ETS GRE® Board Research Report. ETS GRE®-18-02. ETS Research Report. RR-18-19

Peer reviewed
PDF on ERIC

Download full text

Liu, Yuming; Robin, Frédéric; Yoo, Hanwook; Manna, Venessa – ETS Research Report Series, 2018

The "GRE"® Psychology test is an achievement test that measures core knowledge in 12 content domains that represent the courses commonly offered at the undergraduate level. Currently, a total score and 2 subscores, experimental and social, are reported to test takers as well as graduate institutions. However, the American Psychological…

Descriptors: College Entrance Examinations, Graduate Study, Psychological Testing, Scores

Do the TOEFL iBT® Section Scores Provide Value-Added Information to Stakeholders

Peer reviewed

Direct link

Sawaki, Yasuyo; Sinharay, Sandip – Language Testing, 2018

The present study examined the reliability of the reading, listening, speaking, and writing section scores for the TOEFL iBT® test and their interrelationship in order to collect empirical evidence to support, respectively, the "generalization" inference and the "explanation" inference in the TOEFL iBT validity argument…

Descriptors: English (Second Language), Language Tests, Second Language Learning, Computer Assisted Testing

All Sizzle and No Steak: Value-Added Model Doesn't Add Value in Houston

Direct link

Amrein-Beardsley, Audrey; Geiger, Tray – Phi Delta Kappan, 2017

Houston's experience with the Educational Value-Added Assessment System (R) (EVAAS) raises questions that other districts should consider before buying the software and using it for high-stakes decisions. Researchers found that teachers in Houston, all of whom were under the EVAAS gun, but who taught relatively more racial minority students,…

Descriptors: Value Added Models, School Districts, Computer Software, Educational Technology

Measuring Teachers' Effectiveness: A Report from Phase 3 of Pennsylvania's Pilot of the Framework for Teaching. Final Report

Download full text

Lipscomb, Stephen; Terziev, Jeffrey; Chaplin, Duncan – Mathematica Policy Research, Inc., 2015

Like many states throughout the nation, Pennsylvania is in the midst of major reforms to its teacher evaluation system. Under the new system, the state will base annual evaluations on several measures, including supervisor observations using the Framework for Teaching (FFT) and, for many teachers, their contributions to student achievement growth…

Descriptors: Teacher Effectiveness, Pilot Projects, Correlation, Value Added Models

Investigating the Value of Section Scores for the "TOEFL iBT"® Test. "TOEFL iBT"® Research Report. TOEFL iBT-21. ETS Research Report RR-13-35

Peer reviewed
PDF on ERIC

Download full text

Sawaki, Yasuyo; Sinharay, Sandip – ETS Research Report Series, 2013

This study investigates the value of reporting the reading, listening, speaking, and writing section scores for the "TOEFL iBT"® test, focusing on 4 related aspects of the psychometric quality of the TOEFL iBT section scores: reliability of the section scores, dimensionality of the test, presence of distinct score profiles, and the…

Descriptors: Scores, Computer Assisted Testing, Factor Analysis, Correlation

Amrein-Beardsley, Audrey	2
Sawaki, Yasuyo	2
Sinharay, Sandip	2
Benjamin W. Domingue	1
Chaplin, Duncan	1
Dai, Shenghai	1
Geiger, Tray	1
Geiger, Tray J.	1
Holloway, Jessica	1
James G. Soland	1
Joshua B. Gilbert	1
Lipscomb, Stephen	1
Liu, Yuming	1
Manna, Venessa	1
Robin, Frédéric	1
Svetina, Dubravka	1
Terziev, Jeffrey	1
Wang, Xiaolin	1
Yoo, Hanwook	1
More ▼