ERIC - Search Results

Publication Date

In 2026	0
Since 2025	0
Since 2022 (last 5 years)	0
Since 2017 (last 10 years)	0
Since 2007 (last 20 years)	6

Source

ETS Research Report Series	2
Applied Psychological…	1
Educational Testing Service	1
Educational and Psychological…	1
Language Testing	1
Measurement and Evaluation in…	1

Author

Attali, Yigal	3
Davison, Mark L.	2
Semmes, Robert	2
Brooke, Stephanie L.	1
Close, Catherine	1
Close, Catherine N.	1
Hawthorn, John	1
Huang, Lan	1
Lewis, Will	1
Powers, Don	1
Ricker-Pedley, Kathryn L.	1
Sinharay, Sandip	1
Steier, Michael	1
More ▼

Publication Type

Journal Articles	6
Reports - Research	5
Reports - Evaluative	1
Reports - General	1

Education Level

Higher Education	6
Postsecondary Education	6

Audience

Location

Laws, Policies, & Programs

Assessments and Surveys

Graduate Record Examinations	7
ACT Assessment	2
SAT (College Admission Test)	2
Differential Aptitude Test	1

What Works Clearinghouse Rating

Showing all 7 results Save | Export

Scoring with the Computer: Alternative Procedures for Improving the Reliability of Holistic Essay Scoring

Peer reviewed

Direct link

Attali, Yigal; Lewis, Will; Steier, Michael – Language Testing, 2013

Automated essay scoring can produce reliable scores that are highly correlated with human scores, but is limited in its evaluation of content and other higher-order aspects of writing. The increased use of automated essay scoring in high-stakes testing underscores the need for human scoring that is focused on higher-order aspects of writing. This…

Descriptors: Scoring, Essay Tests, Reliability, High Stakes Tests

An Examination of the Link between Rater Calibration Performance and Subsequent Scoring Accuracy in Graduate Record Examinations[R] (GRE[R]) Writing. Research Report. ETS RR-11-03

Download full text

Ricker-Pedley, Kathryn L. – Educational Testing Service, 2011

A pseudo-experimental study was conducted to examine the link between rater accuracy calibration performances and subsequent accuracy during operational scoring. The study asked 45 raters to score a 75-response calibration set and then a 100-response (operational) set of responses from a retired Graduate Record Examinations[R] (GRE[R]) writing…

Descriptors: Scoring, Accuracy, College Entrance Examinations, Writing Tests

Automated Trait Scores for "GRE"® Writing Tasks. Research Report. ETS RR-15-15

Peer reviewed
PDF on ERIC

Download full text

Attali, Yigal; Sinharay, Sandip – ETS Research Report Series, 2015

The "e-rater"® automated essay scoring system is used operationally in the scoring of the argument and issue tasks that form the Analytical Writing measure of the "GRE"® General Test. For each of these tasks, this study explored the value added of reporting 4 trait scores for each of these 2 tasks over the total e-rater score.…

Descriptors: Scores, Computer Assisted Testing, Computer Software, Grammar

On the Reliability and Validity of a Numerical Reasoning Speed Dimension Derived from Response Times Collected in Computerized Testing

Peer reviewed

Direct link

Davison, Mark L.; Semmes, Robert; Huang, Lan; Close, Catherine N. – Educational and Psychological Measurement, 2012

Data from 181 college students were used to assess whether math reasoning item response times in computerized testing can provide valid and reliable measures of a speed dimension. The alternate forms reliability of the speed dimension was .85. A two-dimensional structural equation model suggests that the speed dimension is related to the accuracy…

Descriptors: Computer Assisted Testing, Reaction Time, Reliability, Validity

Modeling Individual Differences in Numerical Reasoning Speed as a Random Effect of Response Time Limits

Peer reviewed

Direct link

Semmes, Robert; Davison, Mark L.; Close, Catherine – Applied Psychological Measurement, 2011

If numerical reasoning items are administered under time limits, will two dimensions be required to account for the responses, a numerical ability dimension and a speed dimension? A total of 182 college students answered 74 numerical reasoning items. Every item was taken with and without time limits by half the students. Three psychometric models…

Descriptors: Individual Differences, Logical Thinking, Timed Tests, College Students

Effect of Immediate Feedback and Revision on Psychometric Properties of Open-Ended Sentence- Completion Items. ETS GRE Board Research Report No. 03-15. ETS RR-08-16

Peer reviewed
PDF on ERIC

Download full text

Attali, Yigal; Powers, Don; Hawthorn, John – ETS Research Report Series, 2008

Registered examinees for the GRE® General Test answered open-ended sentence-completion items. For half of the items, participants received immediate feedback on the correctness of their answers and up to two opportunities to revise their answers. A significant feedback-and-revision effect was found. Participants were able to correct many of their…

Descriptors: College Entrance Examinations, Graduate Study, Sentences, Psychometrics

Cliffs' GRE StudyWare Package: A Critical Evaluation.

Peer reviewed

Brooke, Stephanie L. – Measurement and Evaluation in Counseling and Development, 1995

Provides evaluation of Cliffs' GRE StudyWare package (Bobrow, 1992). Discusses the educational implications of using Cliffs' approach, in addition to focusing on software considerations. Makes recommendations concerning Cliffs' method for Graduate Record Examination (GRE) preparation. (Author/LKS)

Descriptors: Achievement Tests, Computer Assisted Instruction, Computer Software Reviews, Computer Uses in Education

Reliability	7
College Entrance Examinations	6
Correlation	4
Computer Assisted Testing	3
Reaction Time	3
Scoring	3
Validity	3
Accuracy	2
College Students	2
Graduate Study	2
Logical Thinking	2
Mathematics Tests	2
Scores	2
Timed Tests	2
Writing Tests	2
Achievement Tests	1
Automation	1
Comparative Analysis	1
Computer Assisted Instruction	1
Computer Software	1
Computer Software Reviews	1
Computer Uses in Education	1
Construct Validity	1
Content Analysis	1
Cultural Awareness	1
More ▼