ERIC - Search Results

Publication Date

In 2026	0
Since 2025	2
Since 2022 (last 5 years)	4
Since 2017 (last 10 years)	6
Since 2007 (last 20 years)	11

Descriptor

Computer Software	15
Rating Scales	15
Foreign Countries	6
Factor Analysis	5
Interrater Reliability	5
Reliability	5
Test Reliability	5
Artificial Intelligence	4
Evaluators	4
Scoring	4
Computer Assisted Testing	3
Correlation	3
Evaluation Methods	3
Measurement Techniques	3
Student Attitudes	3
Technology Integration	3
Test Construction	3
Test Validity	3
Validity	3
College Students	2
Comparative Analysis	2
Computer Assisted Instruction	2
Critical Thinking	2
Educational Technology	2
Electronic Learning	2
More ▼

Source

Educational Sciences: Theory…	2
Assessment in Education:…	1
British Journal of…	1
Computers & Education	1
Educational and Psychological…	1
Electronic Journal of…	1
International Educational…	1
International Journal of…	1
Online Submission	1
PASAA: Journal of Language…	1
ProQuest LLC	1
More ▼

Publication Type

Reports - Research	10
Journal Articles	9
Speeches/Meeting Papers	4
Reports - Evaluative	3
Dissertations/Theses -…	1
Information Analyses	1
Numerical/Quantitative Data	1
Reports - Descriptive	1
Tests/Questionnaires	1

Education Level

Higher Education	5
Postsecondary Education	4
Elementary Education	1

Audience

Location

Europe	2
Turkey	2
Florida	1
Poland	1
Portugal	1
Turkey (Istanbul)	1

Laws, Policies, & Programs

Assessments and Surveys

What Works Clearinghouse Rating

Showing all 15 results Save | Export

Evaluating Quadratic Weighted Kappa as the Standard Performance Metric for Automated Essay Scoring

Peer reviewed
PDF on ERIC

Download full text

Doewes, Afrizal; Kurdhi, Nughthoh Arfawi; Saxena, Akrati – International Educational Data Mining Society, 2023

Automated Essay Scoring (AES) tools aim to improve the efficiency and consistency of essay scoring by using machine learning algorithms. In the existing research work on this topic, most researchers agree that human-automated score agreement remains the benchmark for assessing the accuracy of machine-generated scores. To measure the performance of…

Descriptors: Essays, Writing Evaluation, Evaluators, Accuracy

Prompting Minds: Evaluating How Students Perceive Generative AI's Critical Thinking Dispositions

Peer reviewed
PDF on ERIC

Download full text

Luciana Oliveira; Célia Tavares; Artur Strzelecki; Manuel Silva – Electronic Journal of e-Learning, 2025

As generative artificial intelligence tools like ChatGPT become increasingly integrated into educational environments, understanding their impact on critical thinking is crucial. Despite growing concerns about AI's potential to diminish students' independent reasoning, there is a lack of research tools specifically designed to evaluate students'…

Descriptors: Critical Thinking, Artificial Intelligence, Computer Software, Technology Integration

ChatGPT Usage Scale in Education: Validity and Reliability Study

Peer reviewed
PDF on ERIC

Download full text

Mustafa Taktak; Görsev Bafrali – International Journal of Technology in Education, 2025

This study aimed to develop a valid and reliable scale to measure individuals' and organizations' attitudes toward the use of ChatGPT, emphasizing the necessity for organizations to adapt to rapidly evolving information and technology environments. The methodology consisted of three stages. In the first stage, a 13-item draft scale was…

Descriptors: Artificial Intelligence, Technology Uses in Education, Factor Analysis, Validity

Duolingo English Test: An Alternative Online English Proficiency Test

Peer reviewed
PDF on ERIC

Download full text

Heng Lu – PASAA: Journal of Language Teaching and Learning in Thailand, 2023

The test view is on the Duolingo English Test (DET), an alternative online English proficiency test with a machine-driven characteristic. The review covers essential information of the DET such as test purpose, usage, score-mapping with CEFR scale, price, and publisher. Meanwhile, the test usefulness is discussed with focuses on reliability,…

Descriptors: Computer Software, Computer Assisted Instruction, Second Language Learning, Second Language Instruction

Assessing L2 English Speaking Using Automated Scoring Technology: Examining Automarker Reliability

Peer reviewed

Direct link

Xu, Jing; Jones, Edmund; Laxton, Victoria; Galaczi, Evelina – Assessment in Education: Principles, Policy & Practice, 2021

Recent advances in machine learning have made automated scoring of learner speech widespread, and yet validation research that provides support for applying automated scoring technology to assessment is still in its infancy. Both the educational measurement and language assessment communities have called for greater transparency in describing…

Descriptors: Second Language Learning, Second Language Instruction, English (Second Language), Computer Software

The Impact of Rater Variability on Relationships among Different Effect-Size Indices for Inter-Rater Agreement between Human and Automated Essay Scoring

Direct link

Yun, Jiyeo – ProQuest LLC, 2017

Since researchers investigated automatic scoring systems in writing assessments, they have dealt with relationships between human and machine scoring, and then have suggested evaluation criteria for inter-rater agreement. The main purpose of my study is to investigate the magnitudes of and relationships among indices for inter-rater agreement used…

Descriptors: Interrater Reliability, Essays, Scoring, Evaluators

Content Analysis of the Studies in Turkey on the Ability of Critical Thinking

Peer reviewed
PDF on ERIC

Download full text

Polat, Seyat – Educational Sciences: Theory and Practice, 2015

Critical thinking, along with other skills, is included as a basic skill in the constructive education program that has been in use in Turkey since 2005. Therefore, a large increase has been observed in studies on critical thinking skills since 2005. In this frame, the present study was conducted in order to systematically examine research papers…

Descriptors: Foreign Countries, Content Analysis, Critical Thinking, Predictor Variables

The Effect of Organizational Trust on the Culture of Teacher Leadership in Primary Schools

Peer reviewed
PDF on ERIC

Download full text

Demir, Kamile – Educational Sciences: Theory and Practice, 2015

The purpose of this research is to examine the effect of the level of trust of primary school teachers towards their organization in relation to their perceptions of the school having a culture of teacher leadership. Participants of the study consisted of 378 teachers working in Burdur public primary schools. The data collection tool used two…

Descriptors: Foreign Countries, Teacher Leadership, Elementary School Teachers, Elementary Schools

Evaluating the Quality of E-Learning at the Degree Level in the Student Experience of Blended Learning

Peer reviewed

Direct link

Ginns, Paul; Ellis, Rob A. – British Journal of Educational Technology, 2009

This paper reports on the development of a scale for determining the quality of the student e-learning experience at the degree level when the student learning context is predominately a campus-based experience. Rapid developments in the use of information and communication technologies (ICT) in higher education require methods for evaluating the…

Descriptors: Undergraduate Students, Quality Control, Psychometrics, Student Experience

The DAATS Model: Initial Psychometric and Statistical Findings. A Top Ten Illustration

Download full text

Lang, W. Steve – Online Submission, 2008

The INTASC Principles, when used as the basis for developing appropriate measurement instruments to assess teacher dispositions, provide a viable approach to the diagnosis and remediation of skill-related affective performance in teacher candidates and also to meeting NCATE requirements for Standard 1. In this symposium, the development and use of…

Descriptors: Computer Software, Teacher Education Programs, Rating Scales, Measurement

EGameFlow: A Scale to Measure Learners' Enjoyment of E-Learning Games

Peer reviewed

Direct link

Fu, Fong-Ling; Su, Rong-Chang; Yu, Sheng-Chin – Computers & Education, 2009

In an effective e-learning game, the learner's enjoyment acts as a catalyst to encourage his/her learning initiative. Therefore, the availability of a scale that effectively measures the enjoyment offered by e-learning games assist the game designer to understanding the strength and flaw of the game efficiently from the learner's points of view.…

Descriptors: Online Courses, Questionnaires, Educational Games, Student Attitudes

Three Coefficients for Analyzing the Reliability and Validity of Ratings.

Peer reviewed

Aiken, Lewis R. – Educational and Psychological Measurement, 1985

Three numerical coefficients for analyzing the validity and reliability of ratings are described. Each coefficient is computed as the ratio of an obtained to a maximum sum of differences in ratings. The coefficients are also applicable to the item analysis, agreement analysis, and cluster or factor analysis of rating-scale data. (Author/BW)

Descriptors: Computer Software, Data Analysis, Factor Analysis, Item Analysis

Training Emphasis Task Factor Data: Methods of Analysis.

Download full text

Jansen, Hans P. – 1985

The Air Force Occupational Measurement Center conducts task-based occupational surveys of Air Force specialties that include supervisor ratings on recommended training emphasis for entry-level airmen. Priorities are input to the Instructional System Development training model, which guides the development and revision of specialty training…

Descriptors: Cluster Analysis, Computer Software, Evaluation Methods, Factor Analysis

On-line Performance Assessment Using Rating Scales.

Download full text

Stahl, John; And Others – 1996

On-line performance assessment was developed to maximize the usefulness of performance assessment and to minimize the time and labor costs incurred. This paper reports on the development of an on-line performance assessment instrument, focusing on the establishment and validation of the scoring rubric and its implementation in the Rasch model, the…

Descriptors: Computer Software, Computer Software Development, Cost Effectiveness, Interrater Reliability

Variation among Examiners and Protocols on Oral Examinations.

Lunz, Mary E.; And Others – 1989

A method for understanding and controlling the multiple facets of an oral examination (OE) or other judge-intermediated examination is presented and illustrated. This study focused on determining the extent to which the facets model (FM) analysis constructs meaningful variables for each facet of an OE involving protocols, examiners, and…

Descriptors: Computer Software, Difficulty Level, Evaluators, Examiners

Aiken, Lewis R.	1
Artur Strzelecki	1
Célia Tavares	1
Demir, Kamile	1
Doewes, Afrizal	1
Ellis, Rob A.	1
Fu, Fong-Ling	1
Galaczi, Evelina	1
Ginns, Paul	1
Görsev Bafrali	1
Heng Lu	1
Jansen, Hans P.	1
Jones, Edmund	1
Kurdhi, Nughthoh Arfawi	1
Lang, W. Steve	1
Laxton, Victoria	1
Luciana Oliveira	1
Lunz, Mary E.	1
Manuel Silva	1
Mustafa Taktak	1
Polat, Seyat	1
Saxena, Akrati	1
Stahl, John	1
Su, Rong-Chang	1
More ▼