ERIC - Search Results

Publication Date

In 2026	0
Since 2025	0
Since 2022 (last 5 years)	0
Since 2017 (last 10 years)	3
Since 2007 (last 20 years)	9

Descriptor

Comparative Analysis	9
Computation	9
Nonparametric Statistics	9
Simulation	4
Item Response Theory	3
Scores	3
Statistical Analysis	3
Test Length	3
Error of Measurement	2
Test Items	2
Academic Standards	1
Affective Behavior	1
Anxiety	1
Audiovisual Instruction	1
Bias	1
Change	1
Cognitive Processes	1
Computer Games	1
Computer Software	1
Computer Uses in Education	1
Correlation	1
Data	1
Educational Games	1
Effect Size	1
Evaluation Methods	1
More ▼

Source

Applied Psychological…	3
Applied Measurement in…	1
Educational Assessment	1
Educational Testing Service	1
IEEE Transactions on Learning…	1
Journal of Behavioral…	1
Journal of Educational and…	1

Author

Sinharay, Sandip	2
Codding, Robin S.	1
Cowell, Ryan	1
Cui, Zhongmin	1
Gloster, Andrew T.	1
Gould, Kaitlin	1
Guo, Hongwen	1
Hooper, Jay	1
Ketamo, Harri	1
Kiili, Kristian	1
Kleinert, Whitney L.	1
Klotsche, Jens	1
Kolen, Michael J.	1
Minami, Takuya	1
Nandakumar, Ratna	1
Penfield, Randall D.	1
Yu, Feng	1
Zhang, Yanwei	1
More ▼

Publication Type

Journal Articles	8
Reports - Research	7
Reports - Evaluative	2
Speeches/Meeting Papers	1

Education Level

Grade 6	1
Higher Education	1
Postsecondary Education	1

Audience

Location

Finland

Laws, Policies, & Programs

Assessments and Surveys

What Works Clearinghouse Rating

Showing all 9 results Save | Export

Are the Nonparametric Person-Fit Statistics More Powerful than Their Parametric Counterparts? Revisiting the Simulations in Karabatsos (2003)

Peer reviewed

Direct link

Sinharay, Sandip – Applied Measurement in Education, 2017

Karabatsos compared the power of 36 person-fit statistics using receiver operating characteristics curves and found the "H[superscript T]" statistic to be the most powerful in identifying aberrant examinees. He found three statistics, "C", "MCI", and "U3", to be the next most powerful. These four statistics,…

Descriptors: Nonparametric Statistics, Goodness of Fit, Simulation, Comparative Analysis

A Meta-Analysis of the Taped Problems Intervention

Peer reviewed

Direct link

Kleinert, Whitney L.; Codding, Robin S.; Minami, Takuya; Gould, Kaitlin – Journal of Behavioral Education, 2018

Taped problems is an intervention strategy for addressing mathematics fluency that has been evaluated in multiple single-case design studies. Although its efficacy has been supported in individual studies, no comprehensive quantitative synthesis has been conducted on taped problems. The purpose of this study was to synthesize the literature that…

Descriptors: Meta Analysis, Intervention, Statistical Analysis, Literature Reviews

Evaluating Cognitive and Affective Outcomes of a Digital Game-Based Math Test

Peer reviewed

Direct link

Kiili, Kristian; Ketamo, Harri – IEEE Transactions on Learning Technologies, 2018

Even though digital learning games have become common in education, relatively little is known about the usefulness of game-based assessment. This paper aims to explore if a game-based math test can provide added value to math education with respect to cognitive and affective outcomes. We used in-game measures, embedded in the game called Semideus…

Descriptors: Mathematics Tests, Outcomes of Education, Fractions, Grade 6

Standards-Based Grading: History Adjusted True Score

Peer reviewed

Direct link

Hooper, Jay; Cowell, Ryan – Educational Assessment, 2014

There has been much research and discussion on the principles of standards-based grading, and there is a growing consensus of best practice. Even so, the actual process of implementing standards-based grading at a school or district level can be a significant challenge. There are very practical questions that remain unclear, such as how the grades…

Descriptors: True Scores, Grading, Academic Standards, Computation

Measurement Error in Nonparametric Item Response Curve Estimation. Research Report. ETS RR-11-28

Download full text

Guo, Hongwen; Sinharay, Sandip – Educational Testing Service, 2011

Nonparametric, or kernel, estimation of item response curve (IRC) is a concern theoretically and operationally. Accuracy of this estimation, often used in item analysis in testing programs, is biased when the observed scores are used as the regressor because the observed scores are contaminated by measurement error. In this study, we investigate…

Descriptors: Error of Measurement, Nonparametric Statistics, Item Response Theory, Computation

Estimating a Meaningful Point of Change: A Comparison of Exploratory Techniques Based on Nonparametric Regression

Peer reviewed

Direct link

Klotsche, Jens; Gloster, Andrew T. – Journal of Educational and Behavioral Statistics, 2012

Longitudinal studies are increasingly common in psychological research. Characterized by repeated measurements, longitudinal designs aim to observe phenomena that change over time. One important question involves identification of the exact point in time when the observed phenomena begin to meaningfully change above and beyond baseline…

Descriptors: Longitudinal Studies, Psychological Studies, Nonparametric Statistics, Regression (Statistics)

A Comparison of Bias Correction Adjustments for the DETECT Procedure

Peer reviewed

Direct link

Nandakumar, Ratna; Yu, Feng; Zhang, Yanwei – Applied Psychological Measurement, 2011

DETECT is a nonparametric methodology to identify the dimensional structure underlying test data. The associated DETECT index, "D[subscript max]," denotes the degree of multidimensionality in data. Conditional covariances (CCOV) are the building blocks of this index. In specifying population CCOVs, the latent test composite [theta][subscript TT]…

Descriptors: Nonparametric Statistics, Statistical Analysis, Tests, Data

Three Classes of Nonparametric Differential Step Functioning Effect Estimators

Peer reviewed

Direct link

Penfield, Randall D. – Applied Psychological Measurement, 2008

The examination of measurement invariance in polytomous items is complicated by the possibility that the magnitude and sign of lack of invariance may vary across the steps underlying the set of polytomous response options, a concept referred to as differential step functioning (DSF). This article describes three classes of nonparametric DSF effect…

Descriptors: Simulation, Nonparametric Statistics, Item Response Theory, Computation

Comparison of Parametric and Nonparametric Bootstrap Methods for Estimating Random Error in Equipercentile Equating

Peer reviewed

Direct link

Cui, Zhongmin; Kolen, Michael J. – Applied Psychological Measurement, 2008

This article considers two methods of estimating standard errors of equipercentile equating: the parametric bootstrap method and the nonparametric bootstrap method. Using a simulation study, these two methods are compared under three sample sizes (300, 1,000, and 3,000), for two test content areas (the Iowa Tests of Basic Skills Maps and Diagrams…

Descriptors: Test Length, Test Content, Simulation, Computation