ERIC - Search Results

Publication Date

In 2026	0
Since 2025	0
Since 2022 (last 5 years)	9
Since 2017 (last 10 years)	17
Since 2007 (last 20 years)	26

Descriptor

Computer Software	45
Reliability	29
Test Reliability	13
Computer Assisted Testing	10
Evaluation Methods	8
Statistical Analysis	7
Validity	7
Educational Technology	6
Measurement Techniques	6
Accuracy	5
Classification	5
Comparative Analysis	5
Interrater Reliability	5
Models	5
Scores	5
Scoring	5
Computation	4
Computational Linguistics	4
Correlation	4
Data Analysis	4
Educational Research	4
Factor Analysis	4
Foreign Countries	4
Item Analysis	4
Second Language Learning	4
More ▼

Publication Type

Reports - Descriptive	45
Journal Articles	37
Speeches/Meeting Papers	3
Books	2
Opinion Papers	2
Book/Product Reviews	1
Computer Programs	1
Numerical/Quantitative Data	1
Reference Materials -…	1

Education Level

Higher Education	6
Postsecondary Education	4
Elementary Secondary Education	3
Elementary Education	2
Grade 1	1
Secondary Education	1

Audience

Researchers	3
Practitioners	1

Location

Australia	1
Europe	1
Japan	1
United Kingdom	1

Laws, Policies, & Programs

No Child Left Behind Act 2001

Assessments and Surveys

International English…	2
Test of English for…	1

What Works Clearinghouse Rating

Showing 1 to 15 of 45 results Save | Export

Scale Reliability Evaluation Using Bayesian Analysis: A Latent Variable Modeling Procedure

Peer reviewed

Direct link

Tenko Raykov; George Marcoulides; Randall Schumacker – Measurement: Interdisciplinary Research and Perspectives, 2024

An application of Bayesian factor analysis for evaluation of scale reliability is discussed, which is developed within the framework of latent variable modeling. The method permits direct point and interval estimation of the reliability coefficient of multiple-component measuring instruments using Bayesian inference. The approach allows also point…

Descriptors: Reliability, Bayesian Statistics, Measurement Techniques, Computer Software

Evaluating Cronbach's Coefficient Alpha and Testing Its Identity to Scale Reliability: A Direct Bayesian Confirmatory Factor Analysis Procedure

Peer reviewed

Direct link

Tenko Raykov; George Marcoulides; James Anthony; Natalja Menold – Measurement: Interdisciplinary Research and Perspectives, 2024

A Bayesian statistics-based approach is discussed that can be used for direct evaluation of the popular Cronbach's coefficient alpha as an internal consistency index for multiple-component measuring instruments, as well as for testing its identity to scale reliability. The method represents an application of confirmatory factor analysis within the…

Descriptors: Reliability, Factor Analysis, Bayesian Statistics, Measurement Techniques

Enabling Data Conversion between Micromine and Surpac -- Enhancing Efficiency in Geological Exploration

Peer reviewed

Direct link

Fumei Liu – Cogent Education, 2024

This paper details how to effectively share three-dimensional geological models using data conversion between two mainstream mining software, Micromine and Surpac. It also discusses the impact of this conversion method on geological integrated exploration decision-making guidance. The current situation primarily manifests in the fact that both…

Descriptors: Computer Software, Geology, Models, Decision Making

An Authoritative Bibliography of Technical Adequacy Research Conducted on easyCBM - 2024 (Technical Report # 2402.1)

Download full text

Denise Swanson; Gerald Tindal – Behavioral Research and Teaching, 2024

This technical report provides an authoritative bibliographic resource of all the studies conducted on "easyCBM"® and published on the main website for Behavioral Research and Teaching under Publications (https://brtprojects.org). The "easyCBM"© software is a direct descendent of "Curriculum-based Measurement" (CBM)…

Descriptors: Bibliographies, Computer Software, Test Construction, Test Reliability

Transforming Assessment: The Impacts and Implications of Large Language Models and Generative AI

Peer reviewed

Direct link

Jiangang Hao; Alina A. von Davier; Victoria Yaneva; Susan Lottridge; Matthias von Davier; Deborah J. Harris – Educational Measurement: Issues and Practice, 2024

The remarkable strides in artificial intelligence (AI), exemplified by ChatGPT, have unveiled a wealth of opportunities and challenges in assessment. Applying cutting-edge large language models (LLMs) and generative AI to assessment holds great promise in boosting efficiency, mitigating bias, and facilitating customized evaluations. Conversely,…

Descriptors: Evaluation Methods, Artificial Intelligence, Educational Change, Computer Software

Frequentist and Bayesian Factorial Invariance Using R

Peer reviewed
PDF on ERIC

Download full text

Teck Kiang Tan – Practical Assessment, Research & Evaluation, 2024

The procedures of carrying out factorial invariance to validate a construct were well developed to ensure the reliability of the construct that can be used across groups for comparison and analysis, yet mainly restricted to the frequentist approach. This motivates an update to incorporate the growing Bayesian approach for carrying out the Bayesian…

Descriptors: Bayesian Statistics, Factor Analysis, Programming Languages, Reliability

Disambiguating and Specifying Social Actors in Big Data: Using Wikipedia as a Data Source for Demographic Information

Peer reviewed

Direct link

Poschmann, Philipp; Goldenstein, Jan – Sociological Methods & Research, 2022

Despite the recent and ongoing progress in using text-mining tools to automatically analyze large text corpora, there remains significant potential to facilitate the study of social action in social science research. In this context, particularly the disambiguation (who is referred to in a text?) and specification (which demographic…

Descriptors: Web Sites, Collaborative Writing, Reliability, Accuracy

Curating Cyberbullying Datasets: A Human-AI Collaborative Approach

Peer reviewed

Direct link

Christopher E. Gomez; Marcelo O. Sztainberg; Rachel E. Trana – International Journal of Bullying Prevention, 2022

Cyberbullying is the use of digital communication tools and spaces to inflict physical, mental, or emotional distress. This serious form of aggression is frequently targeted at, but not limited to, vulnerable populations. A common problem when creating machine learning models to identify cyberbullying is the availability of accurately annotated,…

Descriptors: Video Technology, Computer Software, Computer Mediated Communication, Bullying

Generalizability Theory in R

Peer reviewed
PDF on ERIC

Download full text

Huebner, Alan; Lucht, Marissa – Practical Assessment, Research & Evaluation, 2019

Generalizability theory is a modern, powerful, and broad framework used to assess the reliability, or dependability, of measurements. While there exist classic works that explain the basic concepts and mathematical foundations of the method, there is currently a lack of resources addressing computational resources for those researchers wishing to…

Descriptors: Generalizability Theory, Test Reliability, Computer Software, Statistical Analysis

CTT Package in R

Peer reviewed

Direct link

Sheng, Yanyan – Measurement: Interdisciplinary Research and Perspectives, 2019

Classical approach to test theory has been the foundation for educational and psychological measurement for over 90 years. This approach concerns with measurement error and hence test reliability, which in part relies on individual test items. The CTT package, developed in light of this, provides functions for test- and item-level analyses of…

Descriptors: Item Response Theory, Test Reliability, Item Analysis, Error of Measurement

Duolingo English Test: An Alternative Online English Proficiency Test

Peer reviewed
PDF on ERIC

Download full text

Heng Lu – PASAA: Journal of Language Teaching and Learning in Thailand, 2023

The test view is on the Duolingo English Test (DET), an alternative online English proficiency test with a machine-driven characteristic. The review covers essential information of the DET such as test purpose, usage, score-mapping with CEFR scale, price, and publisher. Meanwhile, the test usefulness is discussed with focuses on reliability,…

Descriptors: Computer Software, Computer Assisted Instruction, Second Language Learning, Second Language Instruction

Item Response Theory-Based Methods for Estimating Classification Accuracy and Consistency

Peer reviewed

Direct link

Diao, Hongyu; Sireci, Stephen G. – Journal of Applied Testing Technology, 2018

Whenever classification decisions are made on educational tests, such as pass/fail, or basic, proficient, or advanced, the consistency and accuracy of those decisions should be estimated and reported. Methods for estimating the reliability of classification decisions made on the basis of educational tests are well-established (e.g., Rudner, 2001;…

Descriptors: Classification, Item Response Theory, Accuracy, Reliability

Developing an Online Test to Measure Writing and Speaking Skills Automatically

Peer reviewed

Direct link

Bateson, Gordon – International Journal of Computer-Assisted Language Learning and Teaching, 2021

As a result of the Japanese Ministry of Education's recent edict that students' written and spoken English should be assessed in university entrance exams, there is an urgent need for tools to help teachers and students prepare for these exams. Although some commercial tools already exist, they are generally expensive and inflexible. To address…

Descriptors: Test Construction, Computer Assisted Testing, Internet, Writing Tests

Comparative Judgement: Assess Student Production without Absolute Judgements

Peer reviewed
PDF on ERIC

Download full text

Sumner, Josh – Research-publishing.net, 2021

Comparative Judgement (CJ) has emerged as a technique that typically makes use of holistic judgement to assess difficult-to-specify constructs such as production (speaking and writing) in Modern Foreign Languages (MFL). In traditional approaches, markers assess candidates' work one-by-one in an absolute manner, assigning scores to different…

Descriptors: Holistic Approach, Student Evaluation, Comparative Analysis, Decision Making

R Packages for Item Response Theory Analysis: Descriptions and Features

Peer reviewed

Direct link

Choi, Youn-Jeng; Asilkalkan, Abdullah – Measurement: Interdisciplinary Research and Perspectives, 2019

About 45 R packages to analyze data using item response theory (IRT) have been developed over the last decade. This article introduces these 45 R packages with their descriptions and features. It also describes possible advanced IRT models using R packages, as well as dichotomous and polytomous IRT models, and R packages that contain applications…

Descriptors: Item Response Theory, Data Analysis, Computer Software, Test Bias

Previous Page | Next Page »

Pages: 1 | 2 | 3

Educational and Psychological…	5
Measurement:…	4
Journal of Technology,…	2
Practical Assessment,…	2
Behavioral Research and…	1
Cogent Education	1
Collegiate Microcomputer	1
Computers in the Schools	1
Education Next	1
Educational Measurement:…	1
Educational Psychology Review	1
European Journal of…	1
Grantee Submission	1
International Journal of…	1
International Journal of…	1
International Journal of…	1
International Journal of…	1
Internet and Higher Education	1
Journal of Applied…	1
Journal of Applied Testing…	1
Journal of Educational…	1
Journal of Instruction…	1
Journal of Interactive…	1
MultiMedia Schools	1
Multivariate Behavioral…	1
More ▼

George Marcoulides	2
Tenko Raykov	2
Abedi, Jamal	1
Alina A. von Davier	1
Arguello, Jaime	1
Asilkalkan, Abdullah	1
Attali, Yigal	1
Barnette, J. Jackson	1
Bateson, Gordon	1
Bucur, Ion I.	1
Burstein, Jill	1
Cai, Zhiqiang	1
Choi, Youn-Jeng	1
Christopher E. Gomez	1
Cicchetti, Domenic V.	1
Clariana, Roy B.	1
Cobern, William W.	1
Collier, Chris	1
Cui, Yue	1
Curtis, Ruth V.	1
Davies, Dan	1
Deborah J. Harris	1
Denise Swanson	1
Diao, Hongyu	1
More ▼