ERIC - Search Results

Publication Date

In 2026	0
Since 2025	2
Since 2022 (last 5 years)	5
Since 2017 (last 10 years)	8
Since 2007 (last 20 years)	48

Descriptor

Classification	79
Evaluation Methods	79
Measurement Techniques	22
Computer Assisted Testing	21
Educational Assessment	20
Educational Testing	20
Psychometrics	20
Models	19
Comparative Analysis	15
Student Evaluation	15
Testing	15
Diagnostic Tests	13
Foreign Countries	13
Measurement	13
Test Items	13
Definitions	10
Hypothesis Testing	10
Item Response Theory	10
Test Construction	10
Tests	9
Evaluation Problems	8
Research Methodology	8
Test Theory	8
Testing Problems	8
Cognitive Processes	7
More ▼

Publication Type

Journal Articles	50
Reports - Research	25
Opinion Papers	17
Reports - Evaluative	12
Reports - Descriptive	10
Speeches/Meeting Papers	8
Information Analyses	4
Collected Works - Proceedings	3
Dissertations/Theses -…	3
Books	2
Collected Works - General	1
Guides - Classroom - Teacher	1
Guides - Non-Classroom	1
More ▼

Education Level

Elementary Secondary Education	12
Higher Education	5
Postsecondary Education	4
Secondary Education	4
Elementary Education	3
Early Childhood Education	2
Grade 4	2
Grade 6	2
Grade 8	2
Junior High Schools	2
Middle Schools	2
Grade 1	1
Grade 2	1
Grade 3	1
Grade 5	1
Grade 7	1
High Schools	1
Intermediate Grades	1
Kindergarten	1
Primary Education	1
More ▼

Audience

Practitioners	3
Researchers	3
Teachers	2
Administrators	1
Policymakers	1
Students	1

Location

United States	4
United Kingdom	3
United Kingdom (England)	3
Australia	2
California	2
Israel	2
Netherlands	2
Taiwan	2
United Kingdom (Wales)	2
Asia	1
Brazil	1
Canada	1
China	1
Connecticut	1
Denmark	1
Egypt	1
Estonia	1
Finland	1
Florida	1
France	1
Germany	1
Greece	1
Hawaii	1
Ireland	1
Italy	1
More ▼

Laws, Policies, & Programs

Education for All Handicapped…	1
No Child Left Behind Act 2001	1

Assessments and Surveys

Advanced Placement…	2
SAT (College Admission Test)	2
Wechsler Intelligence Scale…	2
Bem Sex Role Inventory	1
California Achievement Tests	1
Kaufman Assessment Battery…	1
Lorge Thorndike Intelligence…	1
Program for International…	1
System of Multicultural…	1
Test of English as a Foreign…	1

What Works Clearinghouse Rating

Showing 1 to 15 of 79 results Save | Export

Embeddings for Automatic Short Answer Grading: A Scoping Review

Peer reviewed

Direct link

Putnikovic, Marko; Jovanovic, Jelena – IEEE Transactions on Learning Technologies, 2023

Automatic grading of short answers is an important task in computer-assisted assessment (CAA). Recently, embeddings, as semantic-rich textual representations, have been increasingly used to represent short answers and predict the grade. Despite the recent trend of applying embeddings in automatic short answer grading (ASAG), there are no…

Descriptors: Automation, Computer Assisted Testing, Grading, Natural Language Processing

Estimating Classification Accuracy and Consistency Indices for Multiple Measures with the Simple Structure MIRT Model

Peer reviewed

Direct link

Park, Seohee; Kim, Kyung Yong; Lee, Won-Chan – Journal of Educational Measurement, 2023

Multiple measures, such as multiple content domains or multiple types of performance, are used in various testing programs to classify examinees for screening or selection. Despite the popular usages of multiple measures, there is little research on classification consistency and accuracy of multiple measures. Accordingly, this study introduces an…

Descriptors: Testing, Computation, Classification, Accuracy

Detecting Compromised Items with Response Times Using a Bayesian Change-Point Approach

Peer reviewed

Direct link

Yang Du; Susu Zhang – Journal of Educational and Behavioral Statistics, 2025

Item compromise has long posed challenges in educational measurement, jeopardizing both test validity and test security of continuous tests. Detecting compromised items is therefore crucial to address this concern. The present literature on compromised item detection reveals two notable gaps: First, the majority of existing methods are based upon…

Descriptors: Item Response Theory, Item Analysis, Bayesian Statistics, Educational Assessment

Evaluating Methods for Assessing Model Fit in Diagnostic Classification Models

Peer reviewed
PDF on ERIC

Download full text

W. Jake Thompson – Grantee Submission, 2024

Diagnostic classification models (DCMs) are psychometric models that can be used to estimate the presence or absence of psychological traits, or proficiency on fine-grained skills. Critical to the use of any psychometric model in practice, including DCMs, is an evaluation of model fit. Traditionally, DCMs have been estimated with maximum…

Descriptors: Bayesian Statistics, Classification, Psychometrics, Goodness of Fit

Three Metrics for Monitoring Educational Progress When Tested Populations Change

Peer reviewed

Direct link

Andrew Ho – Teachers College Record, 2025

Background/Context: Public monitoring of educational progress and inequality often involves tracking changes in the percentage of "proficient" students across groups and over time. These trends are important signals of state and district provision of educational opportunity. I show how known flaws of this percentage metric, sometimes…

Descriptors: Educational Assessment, Progress Monitoring, Educational Trends, Educational Opportunities

A Set-Theoretic Approach to Bayesian Process Tracing

Peer reviewed

Direct link

Barrenechea, Rodrigo; Mahoney, James – Sociological Methods & Research, 2019

This article develops a set-theoretic approach to Bayes's theorem and Bayesian process tracing. In the approach, hypothesis testing is the procedure whereby one updates beliefs by narrowing the range of states of the world that are regarded as possible, thus diminishing the domain in which the actual world can reside. By explicitly connecting…

Descriptors: Bayesian Statistics, Hypothesis Testing, Qualitative Research, Research Methodology

Document Level Assessment of Document Retrieval Systems in a Pairwise System Evaluation

Peer reviewed
PDF on ERIC

Download full text

Rajagopal, Prabha; Ravana, Sri Devi – Information Research: An International Electronic Journal, 2017

Introduction: The use of averaged topic-level scores can result in the loss of valuable data and can cause misinterpretation of the effectiveness of system performance. This study aims to use the scores of each document to evaluate document retrieval systems in a pairwise system evaluation. Method: The chosen evaluation metrics are document-level…

Descriptors: Information Retrieval, Documentation, Scores, Information Systems

A Monte Carlo Simulation Comparing the Statistical Precision of Two High-Stakes Teacher Evaluation Methods: A Value-Added Model and a Composite Measure

Direct link

Spencer, Bryden – ProQuest LLC, 2016

Value-added models are a class of growth models used in education to assign responsibility for student growth to teachers or schools. For value-added models to be used fairly, sufficient statistical precision is necessary for accurate teacher classification. Previous research indicated precision below practical limits. An alternative approach has…

Descriptors: Monte Carlo Methods, Comparative Analysis, Accuracy, High Stakes Tests

Using the Clinical Interview and Curriculum Based Measurement to Examine Risk Levels

Peer reviewed

Direct link

Ginsburg, Herbert P.; Lee, Young-Sun; Pappas, Sandra – ZDM: The International Journal on Mathematics Education, 2016

This paper investigates the power of the computer guided clinical interview (CI) and new curriculum based measurement (CBM) measures to identify and help children at risk of low mathematics achievement. We use data from large numbers of children in Kindergarten through Grade 3 to investigate the construct validity of CBM risk categories. The basic…

Descriptors: Interviews, Curriculum Based Assessment, Evaluation Methods, At Risk Students

A Comparison of Computer-Based Classification Testing Approaches Using Mixed-Format Tests with the Generalized Partial Credit Model

Direct link

Kim, Jiseon – ProQuest LLC, 2010

Classification testing has been widely used to make categorical decisions by determining whether an examinee has a certain degree of ability required by established standards. As computer technologies have developed, classification testing has become more computerized. Several approaches have been proposed and investigated in the context of…

Descriptors: Test Length, Computer Assisted Testing, Classification, Probability

Comparison between Dichotomous and Polytomous Scoring of Innovative Items in a Large-Scale Computerized Adaptive Test

Peer reviewed

Direct link

Jiao, Hong; Liu, Junhui; Haynie, Kathleen; Woo, Ada; Gorham, Jerry – Educational and Psychological Measurement, 2012

This study explored the impact of partial credit scoring of one type of innovative items (multiple-response items) in a computerized adaptive version of a large-scale licensure pretest and operational test settings. The impacts of partial credit scoring on the estimation of the ability parameters and classification decisions in operational test…

Descriptors: Test Items, Computer Assisted Testing, Measures (Individuals), Scoring

The Versatility of SpAM: A Fast, Efficient, Spatial Method of Data Collection for Multidimensional Scaling

Peer reviewed

Direct link

Hout, Michael C.; Goldinger, Stephen D.; Ferguson, Ryan W. – Journal of Experimental Psychology: General, 2013

Although traditional methods to collect similarity data (for multidimensional scaling [MDS]) are robust, they share a key shortcoming. Specifically, the possible pairwise comparisons in any set of objects grow rapidly as a function of set size. This leads to lengthy experimental protocols, or procedures that involve scaling stimulus subsets. We…

Descriptors: Visual Stimuli, Research Methodology, Problem Solving, Multidimensional Scaling

Relationships between Lower Limb Muscle Strength and Locomotor Capacity in Children and Adolescents with Cerebral Palsy Who Walk Independently

Peer reviewed

Direct link

Ferland, Chantale; Lepage, Celine; Moffet, Helene; Maltais, Desiree B. – Physical & Occupational Therapy in Pediatrics, 2012

This study aimed to quantify relationships between lower limb muscle strength and locomotor capacity for children and adolescents with cerebral palsy (CP) to identify key muscle groups for strength training. Fifty 6- to 16-year-olds with CP (Gross Motor Function Classification System level I or II) participated. Isometric muscle strength of hip…

Descriptors: Muscular Strength, Physical Fitness, Cerebral Palsy, Classification

In School Settings, Are All RCTs (Randomized Control Trials) Exploratory?

Direct link

Newman, Denis; Jaciw, Andrew P. – Empirical Education Inc., 2012

The motivation for this paper is the authors' recent work on several randomized control trials in which they found the primary result, which averaged across subgroups or sites, to be moderated by demographic or site characteristics. They are led to examine a distinction that the Institute of Education Sciences (IES) makes between "confirmatory"…

Descriptors: Educational Research, Research Methodology, Research Design, Classification

A Proposed Framework of Test Administration Methods

Peer reviewed

Direct link

Thompson, Nathan A. – Journal of Applied Testing Technology, 2008

The widespread application of personal computers to educational and psychological testing has substantially increased the number of test administration methodologies available to testing programs. Many of these mediums are referred to by their acronyms, such as CAT, CBT, CCT, and LOFT. The similarities between the acronyms and the methods…

Descriptors: Testing Programs, Psychological Testing, Classification, Educational Testing

Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4 | 5 | 6

Measurement:…	15
Educational and Psychological…	3
ProQuest LLC	3
International Educational…	2
Journal of Applied Testing…	2
Journal of Child Sexual Abuse	2
Applied Measurement in…	1
Assessment in Education:…	1
Audiovisual Instruction	1
Corwin Press	1
EDUCAUSE Quarterly	1
ETS Research Report Series	1
Education Next	1
Educational Research and…	1
Educational Technology &…	1
Empirical Education Inc.	1
Eye on Education	1
Grantee Submission	1
Higher Education Research and…	1
IEEE Transactions on Learning…	1
Information Research: An…	1
International Association for…	1
International Journal of…	1
Journal of Child Psychology…	1
Journal of Educational…	1
More ▼

Jiao, Hong	2
Rupp, Andre A.	2
Wilhelm, Oliver	2
Abedi, Jamal	1
Andrew Ho	1
Bagnato, Stephen J.	1
Baker, Russell K.	1
Barnes, Tiffany, Ed.	1
Barrenechea, Rodrigo	1
Batinic, Bernad	1
Bechger, Timo	1
Beguin, A. A.	1
Bejar, Isaac I.	1
Ben-David, Arie	1
Ben-Shalom, Uri	1
Bolen, Jacqueline M.	1
Boulton-Lewis, Gillian M.	1
Boyle, Michael H.	1
Buckendahl, Chad W.	1
COX, RICHARD C.	1
CROMWELL, RUE L.	1
Carstensen, Claus H.	1
Chang, Wen-Chih	1
Chao, Louis R.	1
More ▼