ERIC - Search Results

Publication Date

In 2026	0
Since 2025	0
Since 2022 (last 5 years)	0
Since 2017 (last 10 years)	1
Since 2007 (last 20 years)	6

Descriptor

Computer Assisted Testing	9
Interrater Reliability	9
Statistical Analysis	9
Second Language Learning	6
English (Second Language)	5
Language Tests	5
Evaluators	4
Correlation	3
Language Proficiency	3
Oral Language	3
Scores	3
Scoring	3
Writing Evaluation	3
Chinese	2
Comparative Analysis	2
Computer Software	2
Difficulty Level	2
Educational Technology	2
Essays	2
Foreign Countries	2
Item Analysis	2
Measurement Techniques	2
Multiple Regression Analysis	2
Prediction	2
Scoring Rubrics	2
More ▼

Source

Applied Measurement in…	1
ETS Research Report Series	1
Educational Research and…	1
Journal of Applied Testing…	1
Language Assessment Quarterly	1
Language Testing	1
National Center for Education…	1
ProQuest LLC	1

Publication Type

Journal Articles	6
Reports - Research	5
Reports - Evaluative	2
Books	1
Collected Works - General	1
Dissertations/Theses -…	1
Numerical/Quantitative Data	1

Education Level

Grade 11	1
Higher Education	1
Postsecondary Education	1

Audience

Practitioners	1
Teachers	1

Location

Hong Kong	1
Israel	1

Laws, Policies, & Programs

Assessments and Surveys

Test of English as a Foreign…

What Works Clearinghouse Rating

Showing all 9 results Save | Export

Validating Human and Automated Scoring of Essays against "True" Scores

Peer reviewed

Direct link

Cohen, Yoav; Levi, Effi; Ben-Simon, Anat – Applied Measurement in Education, 2018

In the current study, two pools of 250 essays, all written as a response to the same prompt, were rated by two groups of raters (14 or 15 raters per group), thereby providing an approximation to the essay's true score. An automated essay scoring (AES) system was trained on the datasets and then scored the essays using a cross-validation scheme. By…

Descriptors: Test Validity, Automation, Scoring, Computer Assisted Testing

The Influence of Training and Experience on Rater Performance in Scoring Spoken Language

Peer reviewed

Direct link

Davis, Larry – Language Testing, 2016

Two factors were investigated that are thought to contribute to consistency in rater scoring judgments: rater training and experience in scoring. Also considered were the relative effects of scoring rubrics and exemplars on rater performance. Experienced teachers of English (N = 20) scored recorded responses from the TOEFL iBT speaking test prior…

Descriptors: Evaluators, Oral Language, Scores, Language Tests

The Role of Lexical Properties and Cohesive Devices in Text Integration and Their Effect on Human Ratings of Speaking Proficiency

Peer reviewed

Direct link

Crossley, Scott; Clevinger, Amanda; Kim, YouJin – Language Assessment Quarterly, 2014

There has been a growing interest in the use of integrated tasks in the field of second language testing to enhance the authenticity of language tests. However, the role of text integration in test takers' performance has not been widely investigated. The purpose of the current study is to examine the effects of text-based relational (i.e.,…

Descriptors: Language Proficiency, Connected Discourse, Language Tests, English (Second Language)

Developing Analytic Rating Guides for "TOEFL iBT"® Integrated Speaking Tasks. "TOEFL iBT"® Research Report, TOEFL iBT-20. ETS Research Report. RR-13-13

Peer reviewed
PDF on ERIC

Download full text

Jamieson, Joan; Poonpon, Kornwipa – ETS Research Report Series, 2013

Research and development of a new type of scoring rubric for the integrated speaking tasks of "TOEFL iBT"® are described. These "analytic rating guides" could be helpful if tasks modeled after those in TOEFL iBT were used for formative assessment, a purpose which is different from TOEFL iBT's primary use for admission…

Descriptors: Oral Language, Language Proficiency, Scaling, Scores

A Comparison of Onscreen and Paper-Based Marking in the Hong Kong Public Examination System

Peer reviewed

Direct link

Coniam, David – Educational Research and Evaluation, 2009

This paper describes a study comparing paper-based marking (PBM) and onscreen marking (OSM) in Hong Kong utilising English language essay scripts drawn from the live 2007 Hong Kong Certificate of Education Examination (HKCEE) Year 11 English Language Writing Paper. In the study, 30 raters from the 2007 HKCEE Writing Paper marked on paper 100…

Descriptors: Student Attitudes, Foreign Countries, Essays, Comparative Analysis

Speech Recognition Software for Language Learning: Toward an Evaluation of Validity and Student Perceptions

Direct link

Cordier, Deborah – ProQuest LLC, 2009

A renewed focus on foreign language (FL) learning and speech for communication has resulted in computer-assisted language learning (CALL) software developed with Automatic Speech Recognition (ASR). ASR features for FL pronunciation (Lafford, 2004) are functional components of CALL designs used for FL teaching and learning. The ASR features…

Descriptors: Feedback (Response), Computer Assisted Instruction, Validity, Computer Software

Online Assessment in Mathematics and Writing: Reports from the NAEP Technology-Based Assessment Project, Research and Development Series. NCES 2005-457

Peer reviewed
PDF on ERIC

Download full text

Sandene, Brent; Horkay, Nancy; Bennett, Randy Elliot; Allen, Nancy; Braswell, James; Kaplan, Bruce; Oranje, Andreas – National Center for Education Statistics, 2005

This publication presents the reports from two studies, Math Online (MOL) and Writing Online (WOL), part of the National Assessment of Educational Progress (NAEP) Technology-Based Assessment (TBA) project. Funded by the National Center for Education Statistics (NCES), the Technology-Based Assessment project is intended to explore the use of new…

Descriptors: Grade 8, Statistical Analysis, Scoring, Familiarity

Evaluating Computer Automated Scoring: Issues, Methods, and an Empirical Illustration

Peer reviewed

Direct link

Yang, Yongwei; Buckendahl, Chad W.; Juszkiewicz, Piotr J.; Bhola, Dennison S. – Journal of Applied Testing Technology, 2005

With the continual progress of computer technologies, computer automated scoring (CAS) has become a popular tool for evaluating writing assessments. Research of applications of these methodologies to new types of performance assessments is still emerging. While research has generally shown a high agreement of CAS system generated scores with those…

Descriptors: Scoring, Validity, Interrater Reliability, Comparative Analysis

Technology and Language Testing. A Collection of Papers from the Annual Colloquium on Language Testing Research (7th, Princeton, New Jersey, April 6-9, 1985).

Stansfield, Charles W., Ed. – 1986

This collection of essays on measurement theory and language testing includes: "Computerized Adaptive Testing: Implications for Language Test Developers" (Peter Tung); "The Promise and Threat of Computerized Adaptive Assessment of Reading Comprehension" (Michael Canale); "Computerized Rasch Analysis of Item Bias in ESL…

Descriptors: Chinese, Cloze Procedure, Computer Assisted Testing, Computer Software

Allen, Nancy	1
Ben-Simon, Anat	1
Bennett, Randy Elliot	1
Bhola, Dennison S.	1
Braswell, James	1
Buckendahl, Chad W.	1
Clevinger, Amanda	1
Cohen, Yoav	1
Coniam, David	1
Cordier, Deborah	1
Crossley, Scott	1
Davis, Larry	1
Horkay, Nancy	1
Jamieson, Joan	1
Juszkiewicz, Piotr J.	1
Kaplan, Bruce	1
Kim, YouJin	1
Levi, Effi	1
Oranje, Andreas	1
Poonpon, Kornwipa	1
Sandene, Brent	1
Stansfield, Charles W., Ed.	1
Yang, Yongwei	1
More ▼