ERIC - Search Results

Publication Date

In 2026	0
Since 2025	0
Since 2022 (last 5 years)	2
Since 2017 (last 10 years)	10
Since 2007 (last 20 years)	24

Descriptor

Interrater Reliability	46
Scoring	46
Test Validity	46
Test Reliability	30
Test Construction	22
Language Tests	12
Evaluation Methods	11
Psychometrics	11
Test Items	11
Student Evaluation	10
Correlation	9
English (Second Language)	9
Writing Evaluation	9
Foreign Countries	8
Computer Assisted Testing	7
Higher Education	7
Scores	7
Test Use	7
Elementary School Students	6
Evaluators	6
Performance Based Assessment	6
Testing	6
Error of Measurement	5
Language Proficiency	5
Measurement Techniques	5
More ▼

Publication Type

Reports - Research	24
Journal Articles	23
Reports - Descriptive	8
Reports - Evaluative	7
Speeches/Meeting Papers	7
Numerical/Quantitative Data	4
Tests/Questionnaires	4
Information Analyses	3
Guides - Non-Classroom	2
Dissertations/Theses -…	1
Guides - General	1
Opinion Papers	1
More ▼

Education Level

Higher Education	6
Elementary Secondary Education	4
Elementary Education	3
Middle Schools	3
Postsecondary Education	3
High Schools	2
Junior High Schools	2
Secondary Education	2
Early Childhood Education	1
Grade 1	1
Grade 2	1
Grade 3	1
Grade 4	1
Grade 5	1
Grade 6	1
Grade 7	1
Grade 8	1
Intermediate Grades	1
Kindergarten	1
Primary Education	1
More ▼

Audience

Researchers	3
Practitioners	2
Administrators	1
Teachers	1

Location

New Mexico	2
Turkey	2
Australia	1
Canada	1
China	1
Colombia	1
Germany	1
India	1
Israel	1
Japan	1
Jordan	1
Mexico	1
New York	1
South Korea	1
United Kingdom (England)	1
United Kingdom (London)	1
United States	1
More ▼

Laws, Policies, & Programs

Individuals with Disabilities…	1
No Child Left Behind Act 2001	1
Race to the Top	1

Assessments and Surveys

Test of English as a Foreign…	5
ACT Assessment	1
Child Behavior Checklist	1
Clinical Evaluation of…	1
Graduate Record Examinations	1
National Assessment of…	1
Raven Progressive Matrices	1
SAT (College Admission Test)	1
Strengths and Difficulties…	1

What Works Clearinghouse Rating

Showing 1 to 15 of 46 results Save | Export

Developing a Validity Argument Case for Locally Developed University English Preparedness Testing from an Ethical Perspective

Direct link

Lynsey Joohyun Lee – ProQuest LLC, 2021

Reliability and validity are two important topics that have been studied for many decades in the educational measurement field, including discussions of Writing Studies' subfield of writing assessment, since the establishment of the College Entrance Exam Board [CEEB] in 1899 (Huot et al., 2010). In recent years, scholarly conversations of fairness…

Descriptors: Writing Evaluation, Test Validity, Test Reliability, Case Studies

Development of Gazi Functional Vision Assessment Instrument

Peer reviewed
PDF on ERIC

Download full text

Safak, Pinar; Cakmak, Salih; Karakoc, Tamer; Aydin O'Dwyer, Pinar – European Journal of Educational Research, 2021

This study aimed to develop a valid and reliable instrument that measures the functional vision of students with low vision. Thus, an assessment tool and performance activities were developed for three vision skill groups (near vision skills, distance vision skills, and visual field) that include functional vision skills. The universe was 1485…

Descriptors: Foreign Countries, Vision Tests, Diagnostic Tests, Vision

Online Administration of the Test of Narrative Language--Second Edition: Psychometrics and Considerations for Remote Assessment

Peer reviewed
PDF on ERIC

Download full text

Direct link

Beula M. Magimairaj; Philip Capin; Sandra L. Gillam; Sharon Vaughn; Greg Roberts; Anna-Maria Fall; Ronald B. Gillam – Grantee Submission, 2022

Purpose: Our aim was to evaluate the psychometric properties of the online administered format of the Test of Narrative Language--Second Edition (TNL-2; Gillam & Pearson, 2017), given the importance of assessing children's narrative ability and considerable absence of psychometric studies of spoken language assessments administered online.…

Descriptors: Computer Assisted Testing, Language Tests, Story Telling, Language Impairments

Online Administration of the Test of Narrative Language--Second Edition: Psychometrics and Considerations for Remote Assessment

Peer reviewed
PDF on ERIC

Download full text

Direct link

Beula M. Magimairaj; Philip Capin; Sandra L. Gillam; Sharon Vaughn; Greg Roberts; Anna-Maria Fall; Ronald B. Gillam – Language, Speech, and Hearing Services in Schools, 2022

Descriptors: Computer Assisted Testing, Language Tests, Story Telling, Language Impairments

Applying Kane's Validity Framework to a Simulation Based Assessment of Clinical Competence

Peer reviewed

Direct link

Tavares, Walter; Brydges, Ryan; Myre, Paul; Prpic, Jason; Turner, Linda; Yelle, Richard; Huiskamp, Maud – Advances in Health Sciences Education, 2018

Assessment of clinical competence is complex and inference based. Trustworthy and defensible assessment processes must have favourable evidence of validity, particularly where decisions are considered high stakes. We aimed to organize, collect and interpret validity evidence for a high stakes simulation based assessment strategy for certifying…

Descriptors: Competence, Simulation, Allied Health Personnel, Certification

Developing an Innovative Elicited Imitation Task for Efficient English Proficiency Assessment. TOEFL® Research Report. RR-96. ETS RR-21-24

Peer reviewed
PDF on ERIC

Download full text

Davis, Larry; Norris, John – ETS Research Report Series, 2021

The elicited imitation task (EIT), in which language learners listen to a series of spoken sentences and repeat each one verbatim, is a commonly used measure of language proficiency in second language acquisition research. The "TOEFL® Essentials"™ test includes an EIT as a holistic measure of speaking proficiency, referred to as the…

Descriptors: Task Analysis, Language Proficiency, Speech Communication, Language Tests

Validating Human and Automated Scoring of Essays against "True" Scores

Peer reviewed

Direct link

Cohen, Yoav; Levi, Effi; Ben-Simon, Anat – Applied Measurement in Education, 2018

In the current study, two pools of 250 essays, all written as a response to the same prompt, were rated by two groups of raters (14 or 15 raters per group), thereby providing an approximation to the essay's true score. An automated essay scoring (AES) system was trained on the datasets and then scored the essays using a cross-validation scheme. By…

Descriptors: Test Validity, Automation, Scoring, Computer Assisted Testing

Adaptation and Validation of a Test of Ethical Sensitivity in Teaching

Peer reviewed

Direct link

Maxwell, Bruce; Boon, Helen; Tanchuk, Nicolas; Rauwerda, Bryan – Journal of Moral Education, 2021

This article documents the adaptation, piloting and validation of a measure of teachers' ethical sensitivity. To create the test, we modified a measure from dentistry drawing on literature in teacher professional ethics and drew on the expertise of professional ethics scholars and practitioners. Based on the results of Rasch analysis combined with…

Descriptors: Ethics, Moral Values, Scores, Teacher Education Programs

Development and Validation of the Written Communication Assessment of the "HEIghten"® Outcomes Assessment Suite. Research Report. ETS RR-17-53

Peer reviewed
PDF on ERIC

Download full text

Rios, Joseph A.; Sparks, Jesse R.; Zhang, Mo; Liu, Ou Lydia – ETS Research Report Series, 2017

Proficiency with written communication (WC) is critical for success in college and careers. As a result, institutions face a growing challenge to accurately evaluate their students' writing skills to obtain data that can support demands of accreditation, accountability, or curricular improvement. Many current standardized measures, however, lack…

Descriptors: Test Construction, Test Validity, Writing Tests, College Outcomes Assessment

Constructing a Validity Argument for the Objective Structured Assessment of Technical Skills (OSATS): A Systematic Review of Validity Evidence

Peer reviewed

Direct link

Hatala, Rose; Cook, David A.; Brydges, Ryan; Hawkins, Richard – Advances in Health Sciences Education, 2015

In order to construct and evaluate the validity argument for the Objective Structured Assessment of Technical Skills (OSATS), based on Kane's framework, we conducted a systematic review. We searched MEDLINE, EMBASE, CINAHL, PsycINFO, ERIC, Web of Science, Scopus, and selected reference lists through February 2013. Working in duplicate, we selected…

Descriptors: Measures (Individuals), Test Validity, Surgery, Skills

ITC Guidelines for the Large-Scale Assessment of Linguistically and Culturally Diverse Populations

Peer reviewed

Direct link

International Journal of Testing, 2019

These guidelines describe considerations relevant to the assessment of test takers in or across countries or regions that are linguistically or culturally diverse. The guidelines were developed by a committee of experts to help inform test developers, psychometricians, test users, and test administrators about fairness issues in support of the…

Descriptors: Test Bias, Student Diversity, Cultural Differences, Language Usage

Comparison of Integrated Testlet and Constructed-Response Question Formats

Peer reviewed

Direct link

Slepkov, Aaron D.; Shiell, Ralph C. – Physical Review Special Topics - Physics Education Research, 2014

Constructed-response (CR) questions are a mainstay of introductory physics textbooks and exams. However, because of the time, cost, and scoring reliability constraints associated with this format, CR questions are being increasingly replaced by multiple-choice (MC) questions in formal exams. The integrated testlet (IT) is a recently developed…

Descriptors: Science Tests, Physics, Responses, Multiple Choice Tests

A Reliable and Valid Weighted Scoring Instrument for Use in Grading APA-Style Empirical Research Report

Peer reviewed

Direct link

Greenberg, Kathleen Puglisi – Teaching of Psychology, 2012

The scoring instrument described in this article is based on a deconstruction of the seven sections of an American Psychological Association (APA)-style empirical research report into a set of learning outcomes divided into content-, expression-, and format-related categories. A double-weighting scheme used to score the report yields a final grade…

Descriptors: Scoring, Research Reports, Grading, Outcome Measures

Testing to the Top: Everything But the Kitchen Sink?

Direct link

Dietel, Ron – Phi Delta Kappan, 2011

Two tests intended to measure student achievement of the Common Core State Standards will face intense scrutiny, but the test makers say they will include performance assessments and other items that are not multiple-choice questions. Incorporating performance items on this tests will bring up issues over scoring, costs, and validity.

Descriptors: Student Evaluation, State Standards, Test Construction, Intellectual Property

Using Calibrated Exemplars in the Teacher-Assessment of Writing: An Empirical Study

Peer reviewed

Direct link

Heldsinger, Sandra A.; Humphry, Stephen M. – Educational Research, 2013

Background: Many in education argue for the importance of incorporating teacher judgements in the assessment and reporting of student performance. Advocates of such an approach are cognisant, though, that obtaining a satisfactory level of consistency in teacher judgements poses a challenge. Purpose: This study investigates the extent to which the…

Descriptors: Evaluation Methods, Student Evaluation, Teacher Attitudes, Comparative Analysis

Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4

ETS Research Report Series	3
Advances in Health Sciences…	2
Applied Measurement in…	2
New Mexico Public Education…	2
Assessment	1
Educational Research	1
Educational Testing Service	1
European Journal of…	1
Grantee Submission	1
International Journal of…	1
International Journal of…	1
Journal of Chemical Education	1
Journal of Moral Education	1
Journal of Psychoeducational…	1
Language, Speech, and Hearing…	1
Modern Language Journal	1
New York State Education…	1
Phi Delta Kappan	1
Physical Educator	1
Physical Review Special…	1
ProQuest LLC	1
Research Papers in Education	1
Teaching of Psychology	1
More ▼

Anna-Maria Fall	2
Bejar, Isaac I.	2
Beula M. Magimairaj	2
Brydges, Ryan	2
Greg Roberts	2
Philip Capin	2
Ronald B. Gillam	2
Sandra L. Gillam	2
Sharon Vaughn	2
Alderson, J. Charles	1
Alverez de Santizo, Myrna…	1
Apache, R. R.	1
Aydin O'Dwyer, Pinar	1
Ben-Simon, Anat	1
Bennett, Randy Elliot	1
Boccaccini, Marcus T.	1
Boon, Helen	1
Botting, Nicola	1
Breland, Hunter M.	1
Brooks, Val	1
Brown, William L.	1
Cakmak, Salih	1
Camp, Roberta	1
Carlson, Sybil B.	1
More ▼