ERIC - Search Results

Publication Date

In 2026	1
Since 2025	166
Since 2022 (last 5 years)	1019
Since 2017 (last 10 years)	2334
Since 2007 (last 20 years)	6520

Descriptor

Reliability	9759
Validity	3866
Foreign Countries	2823
Measures (Individuals)	1892
Correlation	1522
Factor Analysis	1460
Statistical Analysis	1278
Questionnaires	1084
Scores	1064
Student Attitudes	1034
Psychometrics	979
Evaluation Methods	965
Higher Education	917
Comparative Analysis	880
College Students	842
Models	673
Academic Achievement	648
Teaching Methods	646
Teacher Attitudes	636
Research Methodology	593
Rating Scales	555
Factor Structure	551
Construct Validity	549
Measurement Techniques	547
Undergraduate Students	540
More ▼

Education Level

Higher Education	2103
Postsecondary Education	1630
Secondary Education	864
Elementary Education	799
High Schools	462
Middle Schools	387
Elementary Secondary Education	378
Early Childhood Education	303
Junior High Schools	256
Primary Education	145
Preschool Education	142
Grade 5	124
Grade 8	124
Intermediate Grades	116
Grade 4	115
Grade 7	105
Grade 3	100
Grade 6	100
Kindergarten	96
Adult Education	87
Grade 1	72
Grade 2	65
Grade 9	65
Grade 10	55
Grade 11	48
More ▼

Audience

Researchers	181
Practitioners	101
Teachers	61
Administrators	42
Policymakers	33
Students	21
Counselors	10
Media Staff	5
Community	1
Parents	1
Support Staff	1
More ▼

Location

Turkey	454
Australia	155
Canada	144
China	127
United States	127
Taiwan	107
United Kingdom	100
Nigeria	98
California	95
Netherlands	91
Indonesia	86
United Kingdom (England)	86
Spain	79
Florida	73
Hong Kong	70
Malaysia	69
Germany	66
Iran	62
South Korea	62
New York	56
Texas	56
Jordan	54
Pennsylvania	54
India	49
Greece	48
More ▼

What Works Clearinghouse Rating

Meets WWC Standards without Reservations	3
Meets WWC Standards with or without Reservations	4
Does not meet standards	2

Reliability X

Showing 46 to 60 of 9,759 results Save | Export

Reliability of Measuring Constructs in Applied Linguistics Research: A Comparative Study of Domestic and International Graduate Theses

Peer reviewed

Direct link

Razavipour, Kioumars; Raji, Behnaz – Language Testing in Asia, 2022

The credibility of conclusions arrived at in quantitative research depends, to a large extent, on the quality of data collection instruments used to quantify language and non-language constructs. Despite this, research into data collection instruments used in Applied Linguistics and particularly in the thesis genre remains limited. This study…

Descriptors: Applied Linguistics, Test Reliability, Language Tests, Credibility

Superficially Plausible Outputs from a Black Box: Problematising GenAI Tools for Analysing Qualitative SoTL Data

Peer reviewed
PDF on ERIC

Download full text

Mirjam Sophia Glessmer; Rachel Forsyth – Teaching & Learning Inquiry, 2025

Generative AI tools (GenAI) are increasingly used for academic tasks, including qualitative data analysis for the Scholarship of Teaching and Learning (SoTL). In our practice as academic developers, we are frequently asked for advice on whether this use for GenAI is reliable, valid, and ethical. Since this is a new field, we have not been able to…

Descriptors: Artificial Intelligence, Research Methodology, Data Analysis, Scholarship

GPT-4 in Education: Evaluating Aptness, Reliability, and Loss of Coherence in Solving Calculus Problems and Grading Submissions

Peer reviewed

Direct link

Alberto Gandolfi – International Journal of Artificial Intelligence in Education, 2025

In this paper, we initially investigate the capabilities of GPT-3 5 and GPT-4 in solving college-level calculus problems, an essential segment of mathematics that remains under-explored so far. Although improving upon earlier versions, GPT-4 attains approximately 65% accuracy for standard problems and decreases to 20% for competition-like…

Descriptors: Artificial Intelligence, Reliability, Problem Solving, Mathematics Skills

Inter-Instrument Agreement of Three Ultrasound Systems in Measuring the Cross-Sectional Area of the Geniohyoid Muscle

Peer reviewed

Direct link

Takashi Mori; Nami Ogawa; Ichiro Fujishima; Hidetaka Wakabayashi; Keishi Okamoto; Yuto Kameyama; Ai Hirano; Fumiko Oshima; Masataka Itoda; Sumito Ogawa; Tomohisa Ohno; Minoru Yamada; Kenjiro Kunieda; Takashi Shigematsu; Shinta Nishioka; Kazuki Fukuma; Akio Shimizu; Yoichiro Sugiyama – International Journal of Language & Communication Disorders, 2025

Purpose: Measurement of swallowing muscle mass is important in determining sarcopenic dysphagia. Ultrasound equipment can measure the cross-sectional area of the swallowing muscles, but the inter-instrument reliability is unknown. In this study, the inter-instrument reliability was investigated. Methods: Three ultrasound devices were used to…

Descriptors: Human Body, Motor Reactions, Diagnostic Tests, Acoustics

Measuring a Slippery Beast: Validity Evidence for a Measure of Situational Leadership in an Outdoor Leadership Context

Peer reviewed

Direct link

Guy B. deBrun – Journal of Outdoor Recreation, Education, and Leadership, 2025

Discussions of what it means to be an effective outdoor leader are common in outdoor education literature (Martin et al., 2025; Smith, 2021). Research has identified core competencies (Martin et al., 2025), conceptual frameworks (Pomfret et al., 2023), and course curricula/qualifications for effective leadership (Baker & O'Brien, 2019; Seaman…

Descriptors: Outdoor Leadership, Leadership Effectiveness, Evaluation Methods, Scoring Rubrics

Visualizing Agreement: Bland-Altman Plots as a Supplement to Inter-Rater Reliability Indices

Peer reviewed

Direct link

Brogan L. Barr; Virginia V. W. McIntosh; Eileen F. Britt; Jennifer Jordan; Janet D. Carter – Measurement: Interdisciplinary Research and Perspectives, 2024

Even when raters demonstrate agreement in the use of a measure, limited score variability or violation of often-ignored statistical assumptions can result in lower reliability estimates than intuitively expected. This article uses data drawn from two randomized controlled trials of schema therapy and cognitive behavioral therapy for the treatment…

Descriptors: Evaluators, Interrater Reliability, Reliability, Measurement Techniques

Using Systematic Social Observations to Measure Crime Prevention through Environmental Design and Disorder: In-situ Observations, Photographs, and Google Street View Imagery

Peer reviewed

Direct link

Sas, Marlies; Snaphaan, Thom; Pauwels, Lieven J. R.; Ponnet, Koen; Hardyns, Wim – Field Methods, 2023

This study focuses on the use of systematic social observations (SSO) to measure crime prevention through environmental design (CPTED) and disorder. To improve knowledge about measurement issues in small area research, SSO is conducted by means of three different methods: in-situ, photographs, and Google Street View (GSV) imagery. By evaluating…

Descriptors: Crime Prevention, Measurement Techniques, Photography, Observation

Reliability of Quadriceps Twitch Muscle Properties and Explosive Voluntary Contractions at Different Knee Joint Angles

Peer reviewed

Direct link

Haiko Bruno Zimmermann; Debora Knihs; Raphael Sakugawa; Chris Bishop; Juliano Dal Pupo – Measurement in Physical Education and Exercise Science, 2024

Background: Measures that assess muscle strength and its development, either voluntarily or involuntarily, are important in the clinical and research context. The main aim of this study was to verify the interday reliability and the minimum detectable change (MDC) of the knee extensors muscles torque using evoked contractions and explosive…

Descriptors: Human Body, Physiology, Motor Reactions, Muscular Strength

Using Regularization to Identify Measurement Bias across Multiple Background Characteristics: A Penalized Expectation-Maximization Algorithm

Peer reviewed

Direct link

William C. M. Belzak; Daniel J. Bauer – Journal of Educational and Behavioral Statistics, 2024

Testing for differential item functioning (DIF) has undergone rapid statistical developments recently. Moderated nonlinear factor analysis (MNLFA) allows for simultaneous testing of DIF among multiple categorical and continuous covariates (e.g., sex, age, ethnicity, etc.), and regularization has shown promising results for identifying DIF among…

Descriptors: Test Bias, Algorithms, Factor Analysis, Error of Measurement

How Well Can We Diagnose Autism in Adults? Evaluating an Informant-Based Interview: The Dutch Developmental, Dimensional and Diagnostic Interview -- Adult Version (3Di-Adult)

Peer reviewed

Direct link

L.J.G. Krijnen; K. Greaves-Lord; W. Mandy; K.J.S. Mataw; P. Hartog; S. Begeer – Journal of Autism and Developmental Disorders, 2024

The current study evaluated a brief, informant-based autism interview: the Developmental, Dimensional and Diagnostic Interview -- Adult Version (3Di-Adult). Feasibility, reliability and validity of the Dutch 3Di-Adult was tested amongst autistic participants (n = 62) and a non-autistic comparison group (n = 30) in the Netherlands. The 3Di-Adult…

Descriptors: Autism Spectrum Disorders, Identification, Foreign Countries, Adults

What the Malleability of Kolb's Learning Style Preferences Reveals about Categorical Differences in Learning

Peer reviewed

Direct link

Sidney Newton; Rui Wang – Educational Studies, 2024

Notwithstanding the neuromyth controversy, the malleability of learning style preferences impacts the validity of the measurement instrument and the effectiveness of the associated model of learning. This study investigates the test-retest reliability and underlying dynamics of Kolb's Learning Style Inventory (KLSI). It surveys 245 college-level…

Descriptors: Cognitive Style, Preferences, Reliability, Validity

Estimating Reliability for Response-Time Difference Measures: Toward a Standardized, Model-Based Approach

Peer reviewed

Direct link

Bronson Hui; Zhiyi Wu – Studies in Second Language Acquisition, 2024

A slowdown or a speedup in response times across experimental conditions can be taken as evidence of online deployment of knowledge. However, response-time difference measures are rarely evaluated on their reliability, and there is no standard practice to estimate it. In this article, we used three open data sets to explore an approach to…

Descriptors: Reliability, Reaction Time, Psychometrics, Criticism

Reliable for Whom? Inferring and Reporting Reliability across Diverse Populations

Peer reviewed

Direct link

Richard S. Balkin; Quentin Hunter; Bradley T. Erford – Measurement and Evaluation in Counseling and Development, 2024

We describe best practices in reporting reliability estimates in counseling research with consideration to precision, generalization, and diverse populations. We provide a historical context to reporting reliability estimates, the limitations of past practices, and new methods to address reliability generalization. We highlight best practices…

Descriptors: Best Practices, Reliability, Counseling, Research

Evaluating the Performance of the LI3P in Latent Profile Analysis Models

Peer reviewed

Direct link

Russell P. Houpt; Kevin J. Grimm; Aaron T. McLaughlin; Daryl R. Van Tongeren – Structural Equation Modeling: A Multidisciplinary Journal, 2024

Numerous methods exist to determine the optimal number of classes when using latent profile analysis (LPA), but none are consistently correct. Recently, the likelihood incremental percentage per parameter (LI3P) was proposed as a model effect-size measure. To evaluate the LI3P more thoroughly, we simulated 50,000 datasets, manipulating factors…

Descriptors: Structural Equation Models, Profiles, Sample Size, Evaluation Methods

Adaptation and Validation of the Academic Motivation Scale for Higher Education across Four Eastern European Countries

Peer reviewed

Direct link

Ilona Kocvarová; Jan Kalenda; Jitka Vaculíková; Zuzana Neupauer; Ruženka Šimonji Cernak; Anna Wloch – Higher Education Quarterly, 2024

The article focuses on adaptation and validation of the Academic Motivation Scale questionnaire (AMS-28) in higher education in four Eastern European countries: Czechia, Slovakia, Serbia, and Poland. The research was conducted with a total of 1711 respondents. We examined the construct validity of AMS-28 including measurement invariance and…

Descriptors: Foreign Countries, Learning Motivation, Measures (Individuals), Validity

« Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10 | 11 | ... | 651

ProQuest LLC	363
Educational and Psychological…	241
Online Submission	142
Journal of Psychoeducational…	93
Psychological Assessment	85
Educational Research and…	84
Applied Psychological…	74
Journal of Education and…	72
Measurement in Physical…	68
Research on Social Work…	65
Measurement and Evaluation in…	63
Journal of Educational…	59
Educational Sciences: Theory…	57
Journal of Speech, Language,…	57
International Education…	56
Psychology in the Schools	56
Assessment & Evaluation in…	55
Child Development	52
Journal of Autism and…	52
Alliance for Excellent…	50
Applied Measurement in…	50
Grantee Submission	47
Advances in Health Sciences…	40
Assessment	40
Developmental Psychology	40
More ▼

Raykov, Tenko	28
Thompson, Bruce	23
Brennan, Robert L.	16
Feldt, Leonard S.	16
Henson, Robin K.	15
Haberman, Shelby J.	14
Marsh, Herbert W.	14
Onwuegbuzie, Anthony J.	14
Price, Gary G.	14
Briesch, Amy M.	13
Fraser, Barry J.	13
Fan, Xitao	12
Attali, Yigal	11
Daniel, Larry G.	11
Lane, Kathleen Lynne	11
Riley-Tillman, T. Chris	11
Tindal, Gerald	11
Christ, Theodore J.	10
Kolen, Michael J.	10
Tsai, Chin-Chung	10
Zumbo, Bruno D.	10
Chafouleas, Sandra M.	9
Follman, John	9
Francis, Leslie J.	9
More ▼

Journal Articles	7535
Reports - Research	6225
Reports - Evaluative	1427
Reports - Descriptive	742
Speeches/Meeting Papers	542
Tests/Questionnaires	419
Information Analyses	371
Dissertations/Theses -…	368
Opinion Papers	258
Guides - Non-Classroom	102
Numerical/Quantitative Data	84
Books	58
Guides - Classroom - Teacher	24
Reports - General	18
Collected Works - General	17
Guides - General	16
ERIC Publications	14
Reference Materials -…	12
Collected Works - Serials	11
Collected Works - Proceedings	10
Book/Product Reviews	9
Non-Print Media	9
ERIC Digests in Full Text	8
Guides - Classroom - Learner	8
Legal/Legislative/Regulatory…	8
More ▼

No Child Left Behind Act 2001	95
Individuals with Disabilities…	21
Race to the Top	20
Every Student Succeeds Act…	10
American Recovery and…	8
Elementary and Secondary…	7
Individuals with Disabilities…	5
Americans with Disabilities…	4
Elementary and Secondary…	3
Rehabilitation Act 1973…	3
Adoption and Safe Families…	2
Comprehensive Employment and…	2
Elementary and Secondary…	2
Head Start	2
Adoption Assistance and Child…	1
Child Abuse Prevention and…	1
Debra P v Turlington	1
Education Amendments 1972	1
Education Amendments 1974	1
Education for All Handicapped…	1
Education of the Handicapped…	1
Elementary and Secondary…	1
Elementary and Secondary…	1
Emergency School Aid Act 1972	1
Family Educational Rights and…	1
More ▼

Wechsler Intelligence Scale…	30
SAT (College Admission Test)	25
Program for International…	23
Peabody Picture Vocabulary…	22
Motivated Strategies for…	21
Beck Depression Inventory	20
Child Behavior Checklist	20
Stanford Achievement Tests	17
Strengths and Difficulties…	17
ACT Assessment	16
Behavior Assessment System…	15
Test of English as a Foreign…	15
Iowa Tests of Basic Skills	14
Marlowe Crowne Social…	14
Dynamic Indicators of Basic…	13
National Assessment of…	13
Teacher Efficacy Scale	13
Trends in International…	13
Learning Style Inventory	12
Maslach Burnout Inventory	12
Minnesota Multiphasic…	12
Academic Motivation Scale	11
Early Childhood Environment…	11
Self Directed Search	11
Social Skills Rating System	11
More ▼