ERIC - Search Results

Publication Date

In 2026	0
Since 2025	0
Since 2022 (last 5 years)	0
Since 2017 (last 10 years)	1
Since 2007 (last 20 years)	5

Descriptor

Interrater Reliability	7
Scoring Formulas	7
Test Reliability	7
Grading	3
Higher Education	3
Accuracy	2
Error of Measurement	2
Essay Tests	2
Evaluation Methods	2
Scoring Rubrics	2
Test Validity	2
Writing Evaluation	2
Accounting	1
Achievement Tests	1
Adult Learning	1
Alignment (Education)	1
Alternative Assessment	1
Audiovisual Aids	1
Behavioral Objectives	1
Benchmarking	1
Capacity Building	1
College Entrance Examinations	1
College Freshmen	1
College Second Language…	1
Communities of Practice	1
More ▼

Source

Accounting Education	1
Educational Leadership	1
Educational Sciences: Theory…	1
Measurement and Evaluation in…	1
Working Papers in TESOL &…	1

Author

Aghbar, Ali-Asghar	1
Bardhoshi, Gerta	1
Barter, Alice K.	1
Beltrán, Jorge	1
Erdogan, Semra	1
Erford, Bradley T.	1
Guskey, Thomas R.	1
Hassall, Trevor	1
Herbert, Ian P.	1
Joyce, John	1
Jung, Lee Ann	1
Kaya, Irem Ersöz	1
Selvi, Hüseyin	1
Temel, Gülhan Orekici	1
More ▼

Publication Type

Journal Articles	5
Reports - Research	5
Tests/Questionnaires	4
Reports - Descriptive	1
Reports - Evaluative	1

Education Level

Adult Education	1
Higher Education	1
Postsecondary Education	1

Audience

Location

New York (New York)	1
Turkey	1

Laws, Policies, & Programs

Assessments and Surveys

What Works Clearinghouse Rating

Showing all 7 results Save | Export

Processes and Procedures for Estimating Score Reliability and Precision

Peer reviewed

Direct link

Bardhoshi, Gerta; Erford, Bradley T. – Measurement and Evaluation in Counseling and Development, 2017

Precision is a key facet of test development, with score reliability determined primarily according to the types of error one wants to approximate and demonstrate. This article identifies and discusses several primary forms of reliability estimation: internal consistency (i.e., split-half, KR-20, a), test-retest, alternate forms, interscorer, and…

Descriptors: Scores, Test Reliability, Accuracy, Pretests Posttests

Grading: Why You Should Trust Your Judgment

Direct link

Guskey, Thomas R.; Jung, Lee Ann – Educational Leadership, 2016

Many educators consider grades calculated from statistical algorithms more accurate, objective, and reliable than grades they calculate themselves. But in this research, the authors first asked teachers to use their professional judgment to choose a summary grade for hypothetical students. When the researchers compared the teachers' grade with the…

Descriptors: Grading, Computer Assisted Testing, Interrater Reliability, Grades (Scholastic)

Investigation of Coefficient of Individual Agreement in Terms of Sample Size, Random and Monotone Missing Ratio, and Number of Repeated Measures

Peer reviewed
PDF on ERIC

Download full text

Temel, Gülhan Orekici; Erdogan, Semra; Selvi, Hüseyin; Kaya, Irem Ersöz – Educational Sciences: Theory and Practice, 2016

Studies based on longitudinal data focus on the change and development of the situation being investigated and allow for examining cases regarding education, individual development, cultural change, and socioeconomic improvement in time. However, as these studies require taking repeated measures in different time periods, they may include various…

Descriptors: Investigations, Sample Size, Longitudinal Studies, Interrater Reliability

Assessment in Higher Education: The Potential for a Community of Practice to Improve Inter-Marker Reliability

Peer reviewed

Direct link

Herbert, Ian P.; Joyce, John; Hassall, Trevor – Accounting Education, 2014

The design, delivery and assessment of a complete educational scheme, such as a degree programme or a professional qualification course, is a complex matter. Maintaining alignment between the stated aims of the curriculum and the scoring of student achievement is an overarching concern. The potential for drift across individual aspects of an…

Descriptors: Higher Education, Student Evaluation, Communities of Practice, Interrater Reliability

The Effects of Visual Input on Scoring a Speaking Achievement Test

Peer reviewed
PDF on ERIC

Download full text

Beltrán, Jorge – Working Papers in TESOL & Applied Linguistics, 2016

In the assessment of aural skills of second language learners, the study of the inclusion of visual stimuli has almost exclusively been conducted in the context of listening assessment. While the inclusion of contextual information in test input has been advocated for by numerous researchers (Ockey, 2010), little has been said regarding the…

Descriptors: Achievement Tests, Speech Skills, Speech Tests, Second Language Learning

A Comparison of Two Instruments for Evaluating Composition.

Barter, Alice K.; And Others – 1980

A follow-up study of two instruments for evaluating college writing was conducted. The experimental scale (E Scale) was developed in 1976 and revised for this study. The control scale (C Scale) was described in the literature in 1977. Ten English majors graded ten essays from diagnostic entrance exams. Both the E Scale and the C Scale were used,…

Descriptors: College Entrance Examinations, Comparative Testing, Essay Tests, Evaluation Criteria

Read-Comp as an Additional Measure of Writing Ability.

Aghbar, Ali-Asghar – 1986

The effectiveness of the "read-comp" technique in assessing writing ability and the usefulness of a rubric and procedure devised for scoring read-comp samples and essays were evaluated. Subjects were 100 freshman students enrolled in general and remedial English classes in a 6-week summer session at Indiana University of Pennsylvania.…

Descriptors: College Freshmen, Essay Tests, Evaluation Methods, Grading