ERIC - Search Results

Publication Date

In 2026	0
Since 2025	2
Since 2022 (last 5 years)	6
Since 2017 (last 10 years)	12
Since 2007 (last 20 years)	25

Publication Type

Reports - Research	42
Journal Articles	30
Speeches/Meeting Papers	6
Tests/Questionnaires	5
Numerical/Quantitative Data	1
Reports - Evaluative	1

Education Level

Higher Education	12
Postsecondary Education	8
Secondary Education	4
Elementary Secondary Education	3
High Schools	3
Elementary Education	2
Grade 7	2
Grade 8	2
Middle Schools	2
Early Childhood Education	1
Grade 6	1
Junior High Schools	1
Preschool Education	1
More ▼

Audience

Researchers

Location

Turkey	2
Canada (London)	1
Greece	1
Indonesia	1
Iran	1
Kentucky	1
Massachusetts	1
Netherlands	1
New Jersey	1
Poland	1
Sweden	1
United States	1
More ▼

Laws, Policies, & Programs

Assessments and Surveys

Test of English as a Foreign…	3
Graduate Record Examinations	2
ACT Assessment	1
Gates MacGinitie Reading Tests	1
International English…	1
Kaufman Assessment Battery…	1
SAT (College Admission Test)	1

What Works Clearinghouse Rating

Showing 1 to 15 of 42 results Save | Export

Construct Validity and Test-Retest Reliability of the Test of Advanced Movement Skills with a Dual-Outcome Scoring System

Peer reviewed

Direct link

Wu, Sz-Yan; Kang, Hyeon-Ah; Jensen, Jody L. – Measurement in Physical Education and Exercise Science, 2023

The objective was to verify the construct validity and test-retest reliability of the Test of Advanced Movement Skills (TAMS) with an innovative dual-outcome scoring system. Three statistical approaches--confirmatory factor analysis (CFA), exploratory structural equation modeling (ESEM), and item response theory analysis (IRT)--were applied to the…

Descriptors: Construct Validity, Pretests Posttests, Psychomotor Skills, Scoring

The Enhanced ACT Linking Study Report. ACT Research. Research Paper. R2515

Download full text

Dongmei Li; Shalini Kapoor; Ann Arthur; Chi-Yu Huang; YoungWoo Cho; Chen Qiu; Hongling Wang – ACT Education Corp., 2025

Starting in April 2025, ACT will introduce enhanced forms of the ACT® test for national online testing, with a full rollout to all paper and online test takers in national, state and district, and international test administrations by Spring 2026. ACT introduced major updates by changing the test lengths and testing times, providing more time per…

Descriptors: College Entrance Examinations, Testing, Change, Scoring

Can AI Grade Like a Human? Validity, Reliability, and Fairness in University Coursework Assessment

Peer reviewed
PDF on ERIC

Download full text

Georgios Zacharis; Stamatios Papadakis – Educational Process: International Journal, 2025

Background/purpose: Generative artificial intelligence (GenAI) is often promoted as a transformative tool for assessment, yet evidence of its validity compared to human raters remains limited. This study examined whether an AI-based rater could be used interchangeably with trained faculty in scoring complex coursework. Materials/methods:…

Descriptors: Artificial Intelligence, Technology Uses in Education, Computer Assisted Testing, Grading

Evaluating the Construct Validity of an Automated Writing Evaluation System with a Randomization Algorithm

Peer reviewed

Direct link

Myers, Matthew C.; Wilson, Joshua – International Journal of Artificial Intelligence in Education, 2023

This study evaluated the construct validity of six scoring traits of an automated writing evaluation (AWE) system called "MI Write." Persuasive essays (N = 100) written by students in grades 7 and 8 were randomized at the sentence-level using a script written with Python's NLTK module. Each persuasive essay was randomized 30 times (n =…

Descriptors: Construct Validity, Automation, Writing Evaluation, Algorithms

Developing the Mathematical Thinking Scale for Gifted Students

Peer reviewed
PDF on ERIC

Download full text

Er, Zübeyde; Dinç Artut, Perihan; Bal, Ayten Pinar – Pegem Journal of Education and Instruction, 2023

The aim of this research is to develop a reliable and valid scale to determine the mathematical thinking skills of gifted students. In addition, with the developed scale, thinking skills of gifted students was examined in terms of various variables. In this context, the research was carried out on two different study groups. The first stage of…

Descriptors: Measures (Individuals), Rating Scales, Test Construction, Construct Validity

Performance Assessment and Standardization in Higher Education: A Problematic Conjunction?

Peer reviewed

Direct link

Braun, Henry – British Journal of Educational Psychology, 2019

Background: There is unrealized potential in higher education for greater use of performance assessment, particularly in support of teaching and learning: Well-designed performance tasks can elicit evidence regarding what students know and can do with respect to complex learning objectives. At the same time, there is some pressure, at least in the…

Descriptors: Performance Based Assessment, Higher Education, Test Format, Standardized Tests

Technical Manual 2022: Multiple-Choice Online Causal Comprehension Assessment (MOCCA)-College. MOCCA-College Technical Report (MCTR) 2022

Peer reviewed
PDF on ERIC

Download full text

Ben Seipel; Sarah E. Carlson; Virginia Clinton-Lisell; Mark L. Davison; Patrick C. Kennedy – Grantee Submission, 2022

Originally designed for students in Grades 3 through 5, MOCCA (formerly the Multiple-choice Online Causal Comprehension Assessment), identifies students who struggle with comprehension, and helps uncover why they struggle. There are many reasons why students might not comprehend what they read. They may struggle with decoding, or reading words…

Descriptors: Multiple Choice Tests, Computer Assisted Testing, Diagnostic Tests, Reading Tests

On Designing Construct Driven Situational Judgment Tests: Some Preliminary Recommendations

Peer reviewed

Direct link

Guenole, Nigel; Chernyshenko, Oleksandr S.; Weekly, Jeff – International Journal of Testing, 2017

Situational judgment tests (SJTs) are widely agreed to be a measurement technique. It is also widely agreed that SJTs are a questionable methodological choice for measurement of psychological constructs, such as behavioral competencies, due to a lack of evidence supporting appropriate factor structures and high internal consistencies.…

Descriptors: Situational Tests, Psychological Evaluation, Test Construction, Industrial Psychology

For a Greater Good: Bias Analysis in Writing Assessment

Peer reviewed

Direct link

Ahmadi Shirazi, Masoumeh – SAGE Open, 2019

Threats to construct validity should be reduced to a minimum. If true, sources of bias, namely raters, items, tests as well as gender, age, race, language background, culture, and socio-economic status need to be spotted and removed. This study investigates raters' experience, language background, and the choice of essay prompt as potential…

Descriptors: Foreign Countries, Language Tests, Test Bias, Essay Tests

An Evaluation Paradox: The Issues of Test Validity in the Realm of Writing Test as the Final School Examination in the Indonesian Senior High School Milieu

Peer reviewed
PDF on ERIC

Download full text

Imamyartha, David; Sulistyo, Gunadi Harry – Dinamika Ilmu, 2017

Even though there are four English language skills in the Indonesia's national curriculum at upper secondary schools, each of these skills is given an unequal emphasis since only reading and listening skills are formally tested in the national examination. Although writing competence possesses a particular stake as the determinant of students'…

Descriptors: Foreign Countries, High School Students, Writing Tests, Writing Evaluation

The Resiliency Scale for Young Adults

Peer reviewed

Direct link

Prince-Embury, Sandra; Saklofske, Donald H.; Nordstokke, David W. – Journal of Psychoeducational Assessment, 2017

The Resiliency Scale for Young Adults (RSYA) is presented as an upward extension of the Resiliency Scales for Children and Adolescents (RSCA). The RSYA is based on the "three-factor model of personal resiliency" including "mastery," "relatedness," and "emotional reactivity." Several stages of scale…

Descriptors: Measures (Individuals), Resilience (Psychology), Young Adults, Factor Structure

Polish Listening SPAN: A New Tool for Measuring Verbal Working Memory

Peer reviewed
PDF on ERIC

Download full text

Zychowicz, Katarzyna; Biedron, Adriana; Pawlak, Miroslaw – Studies in Second Language Learning and Teaching, 2017

Individual differences in second language acquisition (SLA) encompass differences in working memory capacity, which is believed to be one of the most crucial factors influencing language learning. However, in Poland research on the role of working memory in SLA is scarce due to a lack of proper Polish instruments for measuring this construct. The…

Descriptors: Verbal Ability, Short Term Memory, Individual Differences, Second Language Learning

Application of Context Input Process and Product Model in Curriculum Evaluation: Case Study of a Call Centre

Peer reviewed
PDF on ERIC

Download full text

Kavgaoglu, Derya; Alci, Bülent – Educational Research and Reviews, 2016

The goal of this research which was carried out in reputable dedicated call centres within the Turkish telecommunication sector aims is to evaluate competence-based curriculums designed by means of internal funding through Stufflebeam's context, input, process, product (CIPP) model. In the research, a general scanning pattern in the scope of…

Descriptors: Foreign Countries, Evaluation Methods, Models, Curriculum Evaluation

Automated Trait Scores for "TOEFL"® Writing Tasks. Research Report. ETS RR-15-14

Peer reviewed
PDF on ERIC

Download full text

Attali, Yigal; Sinharay, Sandip – ETS Research Report Series, 2015

The "e-rater"® automated essay scoring system is used operationally in the scoring of "TOEFL iBT"® independent and integrated tasks. In this study we explored the psychometric added value of reporting four trait scores for each of these two tasks, beyond the total e-rater score.The four trait scores are word choice, grammatical…

Descriptors: Writing Tests, Scores, Language Tests, English (Second Language)

Writing Quality, Knowledge, and Comprehension Correlates of Human and Automated Essay Scoring

Peer reviewed
PDF on ERIC

Download full text

Roscoe, Rod D.; Crossley, Scott A.; Snow, Erica L.; Varner, Laura K.; McNamara, Danielle S. – Grantee Submission, 2014

Automated essay scoring tools are often criticized on the basis of construct validity. Specifically, it has been argued that computational scoring algorithms may be unaligned to higher-level indicators of quality writing, such as writers' demonstrated knowledge and understanding of the essay topics. In this paper, we consider how and whether the…

Descriptors: Correlation, Essays, Scoring, Writing Evaluation

Previous Page | Next Page »

Pages: 1 | 2 | 3

ETS Research Report Series	4
Educational and Psychological…	3
Grantee Submission	2
ACT Education Corp.	1
Assessment	1
Assessment & Evaluation in…	1
British Journal of…	1
Dinamika Ilmu	1
Educational Assessment	1
Educational Process:…	1
Educational Psychology in…	1
Educational Renaissance	1
Educational Research and…	1
International Journal of…	1
International Journal of…	1
Internet and Higher Education	1
Journal of Abnormal Child…	1
Journal of Applied Testing…	1
Journal of Psychoeducational…	1
Journal of Research in…	1
Journal of the Learning…	1
Language Testing	1
Measurement in Physical…	1
Online Submission	1
Pegem Journal of Education…	1
More ▼

Construct Validity	42
Scoring	42
Correlation	11
Factor Analysis	11
Computer Assisted Testing	9
Foreign Countries	9
Test Construction	9
Scores	8
English (Second Language)	7
Language Tests	7
Measures (Individuals)	7
Reliability	7
Test Reliability	7
Writing Evaluation	7
College Students	6
Essays	6
Models	6
Performance Based Assessment	6
Undergraduate Students	6
Content Validity	5
Evaluation Methods	5
Evaluators	5
Factor Structure	5
Interrater Reliability	5
Predictive Validity	5
More ▼

Attali, Yigal	4
Sinharay, Sandip	2
Ahmadi Shirazi, Masoumeh	1
Alci, Bülent	1
Andersson, Marie	1
Ann Arthur	1
Artino, Anthony R., Jr.	1
Baker, Ryan S.	1
Bal, Ayten Pinar	1
Ben Seipel	1
Biedron, Adriana	1
Boldt, Robert F.	1
Braun, Henry	1
Briller, Vladimir	1
Carifio, James	1
Carlson, Sybil B.	1
Chen Qiu	1
Chernyshenko, Oleksandr S.	1
Chi-Yu Huang	1
Crehan, Kevin D.	1
Crossley, Scott A.	1
Dejong, William	1
Deng, Hui	1
Dinç Artut, Perihan	1
More ▼