ERIC - Search Results

Publication Date

In 2026	0
Since 2025	1
Since 2022 (last 5 years)	2
Since 2017 (last 10 years)	3
Since 2007 (last 20 years)	12

Descriptor

Construct Validity	13
Test Bias	13
Test Reliability	13
Test Validity	9
Factor Analysis	6
Factor Structure	6
Test Construction	5
Foreign Countries	4
Goodness of Fit	4
Elementary School Teachers	3
Item Response Theory	3
Measures (Individuals)	3
Psychometrics	3
Test Items	3
Classroom Techniques	2
College Students	2
Difficulty Level	2
Educational Research	2
English (Second Language)	2
Evaluation Research	2
Measurement Techniques	2
Observation	2
Reading Tests	2
Student Evaluation	2
Test Theory	2
More ▼

Source

Language Assessment Quarterly	2
ProQuest LLC	2
School Psychology Quarterly	2
Educational Process:…	1
Educational Research and…	1
International Journal of…	1
Journal of General Education	1
Regional Educational…	1
Research in Science Education	1

Publication Type

Journal Articles	9
Reports - Research	9
Dissertations/Theses -…	2
Reports - Evaluative	2
Numerical/Quantitative Data	1
Speeches/Meeting Papers	1

Education Level

Higher Education	7
Postsecondary Education	6
Elementary Education	4
Elementary Secondary Education	1
High Schools	1
Kindergarten	1
Secondary Education	1

Audience

Researchers

Location

New Jersey	2
New York	2
China	1
Greece	1
India	1
Indiana	1
New Mexico	1
Nigeria	1
Singapore	1

Laws, Policies, & Programs

Assessments and Surveys

Cattell Culture Fair…	1
Dynamic Indicators of Basic…	1

What Works Clearinghouse Rating

Showing all 13 results Save | Export

Can AI Grade Like a Human? Validity, Reliability, and Fairness in University Coursework Assessment

Peer reviewed
PDF on ERIC

Download full text

Georgios Zacharis; Stamatios Papadakis – Educational Process: International Journal, 2025

Background/purpose: Generative artificial intelligence (GenAI) is often promoted as a transformative tool for assessment, yet evidence of its validity compared to human raters remains limited. This study examined whether an AI-based rater could be used interchangeably with trained faculty in scoring complex coursework. Materials/methods:…

Descriptors: Artificial Intelligence, Technology Uses in Education, Computer Assisted Testing, Grading

Interdisciplinary Science Assessment of Carbon Cycling: Construct Validity Evidence Based on Internal Structure

Peer reviewed

Direct link

You, Hye Sun; Park, Sunyoung; Marshall, Jill A.; Delgado, Cesar – Research in Science Education, 2022

Growing interest in interdisciplinary (ID) understanding has led to the recent development of four ID assessments, none of which have previously been comprehensively validated. Sources of evidence for the validity of tests include construct validity, such as the internal structure of the test. ID tests may (and should) test both disciplinary (D)…

Descriptors: High School Students, College Students, Interdisciplinary Approach, Test Construction

Scientific Evidence for the Validity of the New Mexico Kindergarten Observation Tool. REL 2018-281

Peer reviewed
PDF on ERIC

Download full text

Dahlke, Katie; Yang, Rui; Martínez, Carmen; Chavez, Suzette; Martin, Alejandra; Hawkinson, Laura; Shields, Joseph; Garland, Marshall; Carle, Jill – Regional Educational Laboratory Southwest, 2017

The New Mexico Public Education Department developed the Kindergarten Observation Tool (KOT) as a multidimensional observational measure of students' knowledge and skills at kindergarten entry. The primary purpose of the KOT is to inform instruction, so that kindergarten teachers can use the information about their students' knowledge and skills…

Descriptors: Test Validity, Observation, Measures (Individuals), Kindergarten

Measuring Teacher Self-Report on Classroom Practices: Construct Validity and Reliability of the Classroom Strategies Scale-Teacher Form

Peer reviewed

Direct link

Reddy, Linda A.; Dudek, Christopher M.; Fabiano, Gregory A.; Peters, Stephanie – School Psychology Quarterly, 2015

This article presents information about the construct validity and reliability of a new teacher self-report measure of classroom instructional and behavioral practices (the Classroom Strategies Scales-Teacher Form; CSS-T). The theoretical underpinnings and empirical basis for the instructional and behavioral management scales are presented.…

Descriptors: Measurement Techniques, Construct Validity, Test Validity, Test Reliability

Psychometric Properties of the Inventory of Student Experiences in Undergraduate Research

Peer reviewed

Direct link

Cater, Melissa; Ferstel, Sarah D.; O'Neil, Carol E. – Journal of General Education, 2016

Student participation in undergraduate research (ugr) may be influenced by interest in research, future career and educational plans, perceived value of undergraduate research experiences, or perceived competence in research skills. The purpose of this study was to develop a questionnaire that could be used to validly and reliably assess students'…

Descriptors: Undergraduate Students, Student Experience, Questionnaires, Test Construction

Development and Construct Validity of the Classroom Strategies Scale-Observer Form

Peer reviewed

Direct link

Reddy, Linda A.; Fabiano, Gregory; Dudek, Christopher M.; Hsu, Louis – School Psychology Quarterly, 2013

Research on progress monitoring has almost exclusively focused on student behavior and not on teacher practices. This article presents the development and validation of a new teacher observational assessment (Classroom Strategies Scale) of classroom instructional and behavioral management practices. The theoretical underpinnings and empirical…

Descriptors: Test Construction, Construct Validity, Test Validity, Observation

The Singapore-Cambridge General Certificate of Education Advanced-Level General Paper Examination

Peer reviewed

Direct link

Hassan, Nurul Huda; Shih, Chih-Min – Language Assessment Quarterly, 2013

This article describes and reviews the Singapore-Cambridge General Certificate of Education Advanced Level General Paper (GP) examination. As a written test that is administered to preuniversity students, the GP examination is internationally recognised and accepted by universities and employers as proof of English competence. In this article, the…

Descriptors: Foreign Countries, College Entrance Examinations, English (Second Language), Writing Tests

Addressing the Lack of Measurement Invariance for the Measure of Acceptance of the Theory of Evolution

Peer reviewed

Direct link

Wagler, Amy; Wagler, Ron – International Journal of Science Education, 2013

The Measure of Acceptance of the Theory of Evolution (MATE) was constructed to be a single-factor instrument that assesses an individual's overall acceptance of evolutionary theory. The MATE was validated and the scores resulting from the MATE were found to be reliable for the population of inservice high school biology teachers. However, many…

Descriptors: Evolution, Theories, Measures (Individuals), Preservice Teachers

A "Conditional" Sense of Fairness in Assessment

Peer reviewed

Direct link

Mislevy, Robert J.; Haertel, Geneva; Cheng, Britte H.; Ructtinger, Liliana; DeBarger, Angela; Murray, Elizabeth; Rose, David; Gravel, Jenna; Colker, Alexis M.; Rutstein, Daisy; Vendlinski, Terry – Educational Research and Evaluation, 2013

Standardizing aspects of assessments has long been recognized as a tactic to help make evaluations of examinees fair. It reduces variation in irrelevant aspects of testing procedures that could advantage some examinees and disadvantage others. However, recent attention to making assessment accessible to a more diverse population of students…

Descriptors: Testing Accommodations, Access to Education, Testing, Psychometrics

The Nature of Science Instrument-Elementary (NOSI-E): Using Rasch Principles to Develop a Theoretically Grounded Scale to Measure Elementary Student Understanding of the Nature of Science

Direct link

Peoples, Shelagh – ProQuest LLC, 2012

The purpose of this study was to determine which of three competing models will provide, reliable, interpretable, and responsive measures of elementary students' understanding of the nature of science (NOS). The Nature of Science Instrument-Elementary (NOSI-E), a 28-item Rasch-based instrument, was used to assess students' NOS…

Descriptors: Scientific Principles, Science Tests, Elementary School Students, Item Response Theory

Psychometric Properties of the Revised Purdue Spatial Visualization Tests: Visualization of Rotations (The Revised PSVT-R)

Direct link

Yoon, So Yoon – ProQuest LLC, 2011

Working under classical test theory (CTT) and item response theory (IRT) frameworks, this study investigated psychometric properties of the Revised Purdue Spatial Visualization Tests: Visualization of Rotations (Revised PSVT:R). The original version, the PSVT:R was designed by Guay (1976) to measure spatial visualization ability in…

Descriptors: Undergraduate Students, Test Bias, Guessing (Tests), Construct Validity

Measuring the Speaking Proficiency of Advanced EFL Learners in China: The CET-SET Solution

Peer reviewed

Direct link

Zhang, Ying; Elder, Catherine – Language Assessment Quarterly, 2009

The College English Test-Spoken English Test is a nationwide spoken English test designed to assess the oral communicative ability of Chinese university and college students who have undertaken compulsory English study at a Chinese university. This article describes the test and evaluates it in terms of reliability, validity, authenticity,…

Descriptors: Test Results, Language Tests, Rating Scales, Foreign Countries

Cross-Cultural Bias Analysis of Cattell Culture-Fair Intelligence Test.

Nenty, H. Johnson – 1986

The Cattell Culture Fair Intelligence Test (CCFIT) was administered to a large sample of American, Nigerian, and Indian adolescents, and item data were examined for cultural bias. The CCFIT was designed to measure fluid intelligence, which is not influenced by cultural differences. Four different item analysis techniques were used to determine…

Descriptors: Construct Validity, Cross Cultural Studies, Cultural Influences, Culture Fair Tests

Dudek, Christopher M.	2
Reddy, Linda A.	2
Carle, Jill	1
Cater, Melissa	1
Chavez, Suzette	1
Cheng, Britte H.	1
Colker, Alexis M.	1
Dahlke, Katie	1
DeBarger, Angela	1
Delgado, Cesar	1
Elder, Catherine	1
Fabiano, Gregory	1
Fabiano, Gregory A.	1
Ferstel, Sarah D.	1
Garland, Marshall	1
Georgios Zacharis	1
Gravel, Jenna	1
Haertel, Geneva	1
Hassan, Nurul Huda	1
Hawkinson, Laura	1
Hsu, Louis	1
Marshall, Jill A.	1
Martin, Alejandra	1
Martínez, Carmen	1
Mislevy, Robert J.	1
More ▼