ERIC - Search Results

Publication Date

In 2026	0
Since 2025	0
Since 2022 (last 5 years)	22
Since 2017 (last 10 years)	59
Since 2007 (last 20 years)	160

Descriptor

Scoring	247
Validity	247
Reliability	90
Evaluation Methods	47
Comparative Analysis	40
Scores	37
Writing Evaluation	37
Correlation	36
Student Evaluation	35
Computer Assisted Testing	34
Foreign Countries	32
Test Construction	28
Essays	27
Measures (Individuals)	25
Models	25
Interrater Reliability	24
Psychometrics	24
Elementary Secondary Education	23
Performance Based Assessment	23
Second Language Learning	22
Automation	21
English (Second Language)	21
Elementary School Students	20
Statistical Analysis	20
Educational Assessment	18
More ▼

Publication Type

Journal Articles	163
Reports - Research	129
Reports - Evaluative	56
Reports - Descriptive	32
Speeches/Meeting Papers	25
Tests/Questionnaires	14
Guides - Non-Classroom	9
Numerical/Quantitative Data	7
Dissertations/Theses -…	6
Information Analyses	5
Opinion Papers	5
Books	4
Collected Works - General	2
ERIC Digests in Full Text	2
ERIC Publications	2
Reference Materials -…	2
Book/Product Reviews	1
More ▼

Education Level

Higher Education	35
Postsecondary Education	28
Elementary Education	22
Secondary Education	21
Elementary Secondary Education	15
Middle Schools	13
High Schools	9
Junior High Schools	8
Grade 8	7
Grade 4	6
Grade 6	6
Early Childhood Education	5
Grade 3	5
Intermediate Grades	5
Grade 10	4
Grade 12	3
Grade 2	3
Grade 5	3
Grade 7	3
Primary Education	3
Grade 11	2
Preschool Education	2
Grade 1	1
Grade 9	1
Kindergarten	1
More ▼

Audience

Researchers	6
Practitioners	4
Teachers	3
Community	1
Counselors	1
Parents	1
Policymakers	1
Students	1

Location

California	7
United States	5
Australia	4
China	4
Kentucky	4
New York	4
Turkey	4
United Kingdom (England)	4
Canada	3
Japan	3
Colorado	2
Connecticut	2
Florida	2
Indiana	2
Jordan	2
Massachusetts	2
New Hampshire	2
New Zealand	2
Pennsylvania	2
Rhode Island	2
United Kingdom	2
Vermont	2
Alabama	1
Belgium	1
California (Los Angeles)	1
More ▼

Laws, Policies, & Programs

Every Student Succeeds Act…	3
No Child Left Behind Act 2001	3
Elementary and Secondary…	1
Elementary and Secondary…	1
Kentucky Education Reform Act…	1

What Works Clearinghouse Rating

Showing 1 to 15 of 247 results Save | Export

Towards a More Nuanced Conceptualisation of Differential Examiner Stringency in OSCEs

Peer reviewed

Direct link

Matt Homer – Advances in Health Sciences Education, 2024

Quantitative measures of systematic differences in OSCE scoring across examiners (often termed examiner stringency) can threaten the validity of examination outcomes. Such effects are usually conceptualised and operationalised based solely on checklist/domain scores in a station, and global grades are not often used in this type of analysis. In…

Descriptors: Examiners, Scoring, Validity, Cutting Scores

Validity Arguments for AI-Based Automated Scores: Essay Scoring as an Illustration

Peer reviewed

Direct link

Ferrara, Steve; Qunbar, Saed – Journal of Educational Measurement, 2022

In this article, we argue that automated scoring engines should be transparent and construct relevant--that is, as much as is currently feasible. Many current automated scoring engines cannot achieve high degrees of scoring accuracy without allowing in some features that may not be easily explained and understood and may not be obviously and…

Descriptors: Artificial Intelligence, Scoring, Essays, Automation

Psychometric Evaluation of an Alternate Scoring for the Remote Associates Test

Peer reviewed

Direct link

Beisemann, Marie; Forthmann, Boris; Bürkner, Paul-Christian; Holling, Heinz – Journal of Creative Behavior, 2020

The Remote Associates Test (RAT; Mednick, 1962; Mednick & Mednick, 1967) is a commonly employed test of creative convergent thinking. The RAT is scored with a dichotomous scoring, scoring correct answers as 1 and all other answers as 0. Based on recent research into the information processing underlying RAT performance, we argued that the…

Descriptors: Psychometrics, Scoring, Tests, Semantics

A Method for Identifying Partial Test-Taking Engagement

Peer reviewed

Direct link

Wise, Steven; Kuhfeld, Megan – Applied Measurement in Education, 2021

Effort-moderated (E-M) scoring is intended to estimate how well a disengaged test taker would have performed had they been fully engaged. It accomplishes this adjustment by excluding disengaged responses from scoring and estimating performance from the remaining responses. The scoring method, however, assumes that the remaining responses are not…

Descriptors: Scoring, Achievement Tests, Identification, Validity

Anchoring Validity Evidence for Automated Essay Scoring

Peer reviewed

Direct link

Shermis, Mark D. – Journal of Educational Measurement, 2022

One of the challenges of discussing validity arguments for machine scoring of essays centers on the absence of a commonly held definition and theory of good writing. At best, the algorithms attempt to measure select attributes of writing and calibrate them against human ratings with the goal of accurate prediction of scores for new essays.…

Descriptors: Scoring, Essays, Validity, Writing Evaluation

Development and Validation of a Short-Form Inventory to Identify Personality Types: The Personality Identity Estimator (PIE)

Peer reviewed
PDF on ERIC

Download full text

Conti, Gary J. – Journal of Education and Learning, 2023

The use of personality inventories has been limited because of their cost and the length. To overcome these limitations, this study created the Personality Identity Estimator (PIE), an easy-to-use inventory to estimate personality types that can be used at no cost. PIE is a categorical inventory containing 12 items with 3 items for each of the 4…

Descriptors: Personality Measures, Personality Traits, Validity, Reliability

Reflections on the Application and Validation of Technology in Language Testing

Peer reviewed

Direct link

Barry O'Sullivan – Language Assessment Quarterly, 2023

This paper highlights as issues of concern the rapid changes in technology and the tendency to report on partial validation efforts where the work is not identified as forming part of a larger validation project. With close human supervision emerging technologies can have a significant and positive impact on language testing. While technology…

Descriptors: Technology Uses in Education, Computer Assisted Testing, Language Tests, Supervision

Strengthening the Pennsylvania School Climate Survey to Inform School Decisionmaking. REL 2024-006

Peer reviewed
PDF on ERIC

Download full text

Alyson Burnett; Katlyn Lee Milless; Michelle Bennett; Whitney Kozakowski; Sonia Alves; Christine Ross – Regional Educational Laboratory Mid-Atlantic, 2024

This study analyzed Pennsylvania School Climate Survey data from students and staff in the 2021/22 school year to assess the validity and reliability of the elementary school student version of the survey; approaches to scoring the survey in individual schools at all grade levels; and perceptions of school climate across student, staff, and school…

Descriptors: Educational Environment, Decision Making, Surveys, Validity

On the Limitations of Human-Computer Agreement in Automated Essay Scoring

Peer reviewed
PDF on ERIC

Download full text

Doewes, Afrizal; Pechenizkiy, Mykola – International Educational Data Mining Society, 2021

Scoring essays is generally an exhausting and time-consuming task for teachers. Automated Essay Scoring (AES) facilitates the scoring process to be faster and more consistent. The most logical way to assess the performance of an automated scorer is by measuring the score agreement with the human raters. However, we provide empirical evidence that…

Descriptors: Man Machine Systems, Automation, Computer Assisted Testing, Scoring

Observation Studies in Special Education: A Synthesis of Validity Evidence for Observation Systems

Peer reviewed

Direct link

Rodgers, Wendy J.; Morris-Mathews, Hannah; Romig, John Elwood; Bettini, Elizabeth – Review of Educational Research, 2022

Classroom observation research plays an important role in policy, practice, and scholarship for students with disabilities. When interpreting results of observation studies, it is important to consider the validity evidence provided by researchers and how that speaks to the intended use of those results. In this literature synthesis, we used…

Descriptors: Special Education, Validity, Classroom Research, Students with Disabilities

Comparative Judgement, Proof Summaries and Proof Comprehension

Peer reviewed

Direct link

Davies, Ben; Alcock, Lara; Jones, Ian – Educational Studies in Mathematics, 2020

Proof is central to mathematics and has drawn substantial attention from the mathematics education community. Yet, valid and reliable measures of proof comprehension remain rare. In this article, we present a study investigating proof comprehension via students' summaries of a given proof. These summaries were evaluated by expert judges making…

Descriptors: Mathematical Logic, Mathematics Skills, Comprehension, Reliability

Modeling Writing Traits in a Formative Essay Corpus. Research Report. ETS RR-24-02

Peer reviewed
PDF on ERIC

Download full text

Paul Deane; Duanli Yan; Katherine Castellano; Yigal Attali; Michelle Lamar; Mo Zhang; Ian Blood; James V. Bruno; Chen Li; Wenju Cui; Chunyi Ruan; Colleen Appel; Kofi James; Rodolfo Long; Farah Qureshi – ETS Research Report Series, 2024

This paper presents a multidimensional model of variation in writing quality, register, and genre in student essays, trained and tested via confirmatory factor analysis of 1.37 million essay submissions to ETS' digital writing service, Criterion®. The model was also validated with several other corpora, which indicated that it provides a…

Descriptors: Writing (Composition), Essays, Models, Elementary School Students

Using Think-Aloud Interviews to Examine a Clinically Oriented Performance Assessment Rubric

Peer reviewed

Direct link

Roduta Roberts, Mary; Gotch, Chad M.; Cook, Megan; Werther, Karin; Chao, Iris C. I. – Measurement: Interdisciplinary Research and Perspectives, 2022

Performance-based assessment is a common approach to assess the development and acquisition of practice competencies among health professions students. Judgments related to the quality of performance are typically operationalized as ratings against success criteria specified within a rubric. The extent to which the rubric is understood,…

Descriptors: Protocol Analysis, Scoring Rubrics, Interviews, Performance Based Assessment

Psychometric Models for Scoring Multiple Reporter Assessments: Applications to Integrative Data Analysis in Prevention Science and Beyond

Peer reviewed

Direct link

Curran, Patrick J.; Georgeson, A. R.; Bauer, Daniel J.; Hussong, Andrea M. – International Journal of Behavioral Development, 2021

Conducting valid and reliable empirical research in the prevention sciences is an inherently difficult and challenging task. Chief among these is the need to obtain numerical scores of underlying theoretical constructs for use in subsequent analysis. This challenge is further exacerbated by the increasingly common need to consider multiple…

Descriptors: Psychometrics, Scoring, Prevention, Scores

Evaluating an Explicit Instruction Teacher Observation Protocol through a Validity Argument Approach

Peer reviewed

Direct link

Johnson, Evelyn S.; Zheng, Yuzhu; Crawford, Angela R.; Moylan, Laura A. – Journal of Experimental Education, 2022

In this study, we examined the scoring and generalizability assumptions of an explicit instruction (EI) special education teacher observation protocol using many-faceted Rasch measurement (MFRM). Video observations of classroom instruction from 48 special education teachers across four states were collected. External raters (n = 20) were trained…

Descriptors: Direct Instruction, Teacher Education, Classroom Observation Techniques, Validity

Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10 | 11 | ... | 17

Language Testing	10
Applied Measurement in…	7
Assessing Writing	7
Educational and Psychological…	7
ETS Research Report Series	6
Educational Assessment	6
Educational Measurement:…	6
Journal of Educational…	6
ProQuest LLC	6
Journal of Psychoeducational…	5
Grantee Submission	4
Applied Psychological…	3
Assessment in Education:…	3
Journal of Creative Behavior	3
Journal of Experimental…	3
Journal of Speech, Language,…	3
Online Submission	3
Reading & Writing Quarterly	3
Assessment & Evaluation in…	2
English Teaching Forum	2
Higher Education Quarterly	2
International Journal of…	2
Journal of Applied Testing…	2
Journal of Technology,…	2
Language Assessment Quarterly	2
More ▼

Williamson, David M.	7
Bejar, Isaac I.	5
Attali, Yigal	4
Forthmann, Boris	3
Jaeger, Richard M.	3
Mercer, Sterett H.	3
Ramineni, Chaitanya	3
Bauer, Malcolm I.	2
Bell, Courtney A.	2
Borko, Hilda	2
Breyer, F. Jay	2
Burstein, Jill	2
Childs, Ruth A.	2
Crawford, Angela R.	2
Darling-Hammond, Linda	2
Dimitrov, Dimiter M.	2
Dings, Jonathan	2
Ferrara, Steve	2
Gearhart, Maryl	2
Hambleton, Ronald K.	2
Holling, Heinz	2
Jaciw, Andrew P.	2
Jin, Hui	2
Johnson, Evelyn S.	2
More ▼

National Assessment of…	5
Test of English as a Foreign…	4
Graduate Record Examinations	3
Massachusetts Comprehensive…	2
New York State Regents…	2
Torrance Tests of Creative…	2
Wechsler Intelligence Scale…	2
Bender Gestalt Test	1
Dynamic Indicators of Basic…	1
Early Childhood Environment…	1
Flesch Kincaid Grade Level…	1
Graduate Management Admission…	1
Measures of Academic Progress	1
Medical College Admission Test	1
Minnesota Multiphasic…	1
Myers Briggs Type Indicator	1
Neale Analysis of Reading…	1
Patterns of Adaptive Learning…	1
Praxis Series	1
Remote Associates Test	1
SAT (College Admission Test)	1
Stanford Achievement Tests	1
United States Medical…	1
Wechsler Memory Scale	1
Wechsler Preschool and…	1
More ▼