ERIC - Search Results

Publication Date

In 2026	0
Since 2025	16
Since 2022 (last 5 years)	74
Since 2017 (last 10 years)	149
Since 2007 (last 20 years)	283

Descriptor

Computer Software	283
Reliability	183
Foreign Countries	107
Validity	63
Statistical Analysis	60
Interrater Reliability	59
Comparative Analysis	53
Correlation	51
Test Reliability	50
Second Language Learning	44
Teaching Methods	44
Artificial Intelligence	42
Models	41
Evaluation Methods	40
Accuracy	37
English (Second Language)	36
Educational Technology	34
Computer Assisted Testing	33
Student Attitudes	32
Scores	31
Second Language Instruction	31
Undergraduate Students	30
Factor Analysis	27
Questionnaires	27
Scoring	27
More ▼

Publication Type

Journal Articles	245
Reports - Research	198
Reports - Evaluative	36
Reports - Descriptive	26
Tests/Questionnaires	22
Dissertations/Theses -…	15
Speeches/Meeting Papers	13
Information Analyses	9
Books	3
Collected Works - General	2
Opinion Papers	2
Collected Works - Proceedings	1
Guides - Classroom - Learner	1
Guides - General	1
Multilingual/Bilingual…	1
Numerical/Quantitative Data	1
Reference Materials -…	1
Reports - General	1
More ▼

Education Level

Higher Education	88
Postsecondary Education	78
Secondary Education	32
Elementary Education	29
Elementary Secondary Education	19
Middle Schools	14
Early Childhood Education	7
High Schools	7
Junior High Schools	7
Intermediate Grades	5
Primary Education	5
Grade 2	3
Grade 4	3
Grade 7	3
Grade 8	3
Kindergarten	3
Grade 5	2
Preschool Education	2
Grade 1	1
Grade 10	1
Grade 11	1
Grade 12	1
Grade 3	1
Grade 6	1
Grade 9	1
More ▼

Audience

Researchers	1
Students	1
Support Staff	1

Location

Turkey	17
Iran	6
Saudi Arabia	6
Australia	5
Canada	5
Germany	5
Japan	5
Malaysia	5
China	4
Egypt	4
Netherlands	4
Pakistan	4
Philippines	4
South Korea	4
United Kingdom	4
United States	4
Cyprus	3
India	3
Jordan	3
Nigeria	3
Pennsylvania	3
Sweden	3
Czech Republic	2
Denmark	2
Estonia	2
More ▼

Laws, Policies, & Programs

No Child Left Behind Act 2001

Assessments and Surveys

International English…	3
Test of English as a Foreign…	3
Peabody Picture Vocabulary…	2
Dale Chall Readability Formula	1
Expressive One Word Picture…	1
Flesch Kincaid Grade Level…	1
Flesch Reading Ease Formula	1
Fry Readability Formula	1
Graduate Record Examinations	1
Mean Length of Utterance	1
National Assessment of…	1
Students Evaluation of…	1
Test of English for…	1
Torrance Tests of Creative…	1
Trends in International…	1
Wechsler Adult Intelligence…	1
More ▼

What Works Clearinghouse Rating

Showing 1 to 15 of 283 results Save | Export

Scale Reliability Evaluation Using Bayesian Analysis: A Latent Variable Modeling Procedure

Peer reviewed

Direct link

Tenko Raykov; George Marcoulides; Randall Schumacker – Measurement: Interdisciplinary Research and Perspectives, 2024

An application of Bayesian factor analysis for evaluation of scale reliability is discussed, which is developed within the framework of latent variable modeling. The method permits direct point and interval estimation of the reliability coefficient of multiple-component measuring instruments using Bayesian inference. The approach allows also point…

Descriptors: Reliability, Bayesian Statistics, Measurement Techniques, Computer Software

Evaluating Cronbach's Coefficient Alpha and Testing Its Identity to Scale Reliability: A Direct Bayesian Confirmatory Factor Analysis Procedure

Peer reviewed

Direct link

Tenko Raykov; George Marcoulides; James Anthony; Natalja Menold – Measurement: Interdisciplinary Research and Perspectives, 2024

A Bayesian statistics-based approach is discussed that can be used for direct evaluation of the popular Cronbach's coefficient alpha as an internal consistency index for multiple-component measuring instruments, as well as for testing its identity to scale reliability. The method represents an application of confirmatory factor analysis within the…

Descriptors: Reliability, Factor Analysis, Bayesian Statistics, Measurement Techniques

Decision-Making Efficiency with Aided Information: The Impact of Automation Reliability and Task Difficulty

Peer reviewed

Direct link

Hanshu Zhang; Ran Zhou; Cheng-You Cheng; Sheng-Hsu Huang; Ming-Hui Cheng; Cheng-Ta Yang – Cognitive Research: Principles and Implications, 2025

Although it is commonly believed that automation aids human decision-making, conflicting evidence raises questions about whether individuals would gain greater advantages from automation in difficult tasks. Our study examines the combined influence of task difficulty and automation reliability on aided decision-making. We assessed decision…

Descriptors: Task Analysis, Difficulty Level, Decision Making, Automation

The General Attitudes towards Artificial Intelligence (GAAIS): A Meta-Analytic Reliability Generalization Study

Peer reviewed
PDF on ERIC

Download full text

Melek Gülsah Sahin; Yildiz Yildirim – International Journal of Assessment Tools in Education, 2024

This study aims to generalize the reliability of the GAAIS, which is known to perform valid and reliable measurements, is frequently used in the literature, aims to measure one of today's popular topics, and is one of the first examples developed in the field. Within the meta-analytic reliability generalization study, moderator analyses were also…

Descriptors: Generalization, Meta Analysis, Databases, Research Reports

How Reliable Is Assessment of Children's Sentence Comprehension Using a Self-Directed App? A Comparison of Supported versus Independent Use

Peer reviewed

Direct link

Pauline Frizelle; Ana Buckley; Tricia Biancone; Anna Ceroni; Darren Dahly; Paul Fletcher; Dorothy V. M. Bishop; Cristina McKean – Journal of Child Language, 2024

This study reports on the feasibility of using the Test of Complex Syntax- Electronic (TECS-E), as a self-directed app, to measure sentence comprehension in children aged 4 to 5 ½ years old; how testing apps might be adapted for effective independent use; and agreement levels between face-to-face supported computerized and independent computerized…

Descriptors: Language Processing, Computer Software, Language Tests, Syntax

Enabling Data Conversion between Micromine and Surpac -- Enhancing Efficiency in Geological Exploration

Peer reviewed

Direct link

Fumei Liu – Cogent Education, 2024

This paper details how to effectively share three-dimensional geological models using data conversion between two mainstream mining software, Micromine and Surpac. It also discusses the impact of this conversion method on geological integrated exploration decision-making guidance. The current situation primarily manifests in the fact that both…

Descriptors: Computer Software, Geology, Models, Decision Making

Modeling Directional Testlet Effects on Multiple Open-Ended Questions

Peer reviewed

Direct link

Kuan-Yu Jin; Wai-Lok Siu – Journal of Educational Measurement, 2025

Educational tests often have a cluster of items linked by a common stimulus ("testlet"). In such a design, the dependencies caused between items are called "testlet effects." In particular, the directional testlet effect (DTE) refers to a recursive influence whereby responses to earlier items can positively or negatively affect…

Descriptors: Models, Test Items, Educational Assessment, Scores

An Authoritative Bibliography of Technical Adequacy Research Conducted on easyCBM - 2024 (Technical Report # 2402.1)

Download full text

Denise Swanson; Gerald Tindal – Behavioral Research and Teaching, 2024

This technical report provides an authoritative bibliographic resource of all the studies conducted on "easyCBM"® and published on the main website for Behavioral Research and Teaching under Publications (https://brtprojects.org). The "easyCBM"© software is a direct descendent of "Curriculum-based Measurement" (CBM)…

Descriptors: Bibliographies, Computer Software, Test Construction, Test Reliability

Multilingual Language Models: Analysis and Algorithms

Direct link

Terra Blevins – ProQuest LLC, 2024

While large language models (LLMs) continue to grow in scale and gain new zero-shot capabilities, their performance for languages beyond English increasingly lags behind. This gap is due to the "curse of multilinguality," where multilingual language models perform worse on individual languages than a monolingual model trained on that…

Descriptors: Multilingualism, Computational Linguistics, Second Languages, Reliability

Accuracy and Reliability of Large Language Models in Assessing Learning Outcomes Achievement across Cognitive Domains

Peer reviewed

Direct link

Swapna Haresh Teckwani; Amanda Huee-Ping Wong; Nathasha Vihangi Luke; Ivan Cherh Chiet Low – Advances in Physiology Education, 2024

The advent of artificial intelligence (AI), particularly large language models (LLMs) like ChatGPT and Gemini, has significantly impacted the educational landscape, offering unique opportunities for learning and assessment. In the realm of written assessment grading, traditionally viewed as a laborious and subjective process, this study sought to…

Descriptors: Accuracy, Reliability, Computational Linguistics, Standards

A Closed-Form Alternative for Estimating [omega] Reliability under Unidimensionality

Peer reviewed

Direct link

Hancock, Gregory R.; An, Ji – Measurement: Interdisciplinary Research and Perspectives, 2020

As an alternative to Cronbach's [alpha] for estimating scale reliability, McDonald's [omega] has attracted increased attention within the methodological community for its less stringent measurement assumptions. Notwithstanding, [omega] is still seldom used by practitioners, likely due to its unavailability in popular software packages (e.g., SPSS)…

Descriptors: Evaluation, Alternative Assessment, Reliability, Test Reliability

Transforming Assessment: The Impacts and Implications of Large Language Models and Generative AI

Peer reviewed

Direct link

Jiangang Hao; Alina A. von Davier; Victoria Yaneva; Susan Lottridge; Matthias von Davier; Deborah J. Harris – Educational Measurement: Issues and Practice, 2024

The remarkable strides in artificial intelligence (AI), exemplified by ChatGPT, have unveiled a wealth of opportunities and challenges in assessment. Applying cutting-edge large language models (LLMs) and generative AI to assessment holds great promise in boosting efficiency, mitigating bias, and facilitating customized evaluations. Conversely,…

Descriptors: Evaluation Methods, Artificial Intelligence, Educational Change, Computer Software

Detecting Test Flakiness without Rerunning Tests

Direct link

Abdulrahman Alshammari – ProQuest LLC, 2024

A critical component of modern software development practices, particularly continuous integration (CI), is the halt of development activities in response to test failures which requires further investigation and debugging. As software changes, regression testing becomes vital to verify that new code does not affect existing functionality.…

Descriptors: Computer Software, Programming, Coding, Test Reliability

Combined Logistic and Confined Exponential Growth Models: Estimation Using SEM Software

Peer reviewed

Direct link

Phillip K. Wood – Structural Equation Modeling: A Multidisciplinary Journal, 2024

The logistic and confined exponential curves are frequently used in studies of growth and learning. These models, which are nonlinear in their parameters, can be estimated using structural equation modeling software. This paper proposes a single combined model, a weighted combination of both models. Mplus, Proc Calis, and lavaan code for the model…

Descriptors: Structural Equation Models, Computation, Computer Software, Weighted Scores

Application of Model Averaging for Measurement in the Presence of Unknown Familiarization Phase or Fatigue Phase

Peer reviewed

Direct link

Steven Kim; Stephanie Lara-Sotelo; Eric Martin – Measurement in Physical Education and Exercise Science, 2024

A number of familiarization trials are needed for reliable measurement, particularly for inexperienced subjects. Researchers have studied and developed familiarization protocols that vary by exercise and study population. The pace of familiarization and fatigue may be an individual-level characteristic, so a population-level protocol may not fit…

Descriptors: Familiarity, Physical Education, Fatigue (Biology), Reliability

Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10 | 11 | ... | 19

ProQuest LLC	14
Online Submission	8
Educational Research and…	6
Grantee Submission	6
Education and Information…	5
English Language Teaching	5
Journal of Education and…	5
Measurement:…	5
Research Synthesis Methods	5
ETS Research Report Series	4
International Educational…	4
Journal of Speech, Language,…	4
Advances in Physiology…	3
British Journal of…	3
Contemporary Educational…	3
Educational Technology &…	3
Educational and Psychological…	3
International Education…	3
Journal of Computer Assisted…	3
Journal of Educational…	3
Journal of Information…	3
Language Testing	3
Turkish Online Journal of…	3
Advances in Language and…	2
Australasian Journal of…	2
More ▼

An, Ji	2
Bahreini, Kiavash	2
Bodur, Yasar	2
Gentile, Claudia	2
George Marcoulides	2
Hancock, Gregory R.	2
Kantor, Robert	2
Kay, Robin H.	2
Knaack, Liesel	2
Lee, Yong-Won	2
Lenhard, Wolfgang	2
McNamara, Danielle S.	2
Mustafa Taktak	2
Nadolski, Rob	2
Ritzhaupt, Albert D.	2
Seifried, Eva	2
Spinath, Birgit	2
Tenko Raykov	2
Unal, Aslihan	2
Unal, Zafer	2
Wang, Wen-Chung	2
Westera, Wim	2
A. K. Somasekhar	1
Abalaka, Eneojo N.	1
Abdullah D. Alenezi	1
More ▼