Publication Date
| In 2026 | 0 |
| Since 2025 | 0 |
| Since 2022 (last 5 years) | 0 |
| Since 2017 (last 10 years) | 3 |
| Since 2007 (last 20 years) | 9 |
Descriptor
| Comparative Analysis | 9 |
| Computation | 9 |
| Nonparametric Statistics | 9 |
| Simulation | 4 |
| Item Response Theory | 3 |
| Scores | 3 |
| Statistical Analysis | 3 |
| Test Length | 3 |
| Error of Measurement | 2 |
| Test Items | 2 |
| Academic Standards | 1 |
| More ▼ | |
Source
| Applied Psychological… | 3 |
| Applied Measurement in… | 1 |
| Educational Assessment | 1 |
| Educational Testing Service | 1 |
| IEEE Transactions on Learning… | 1 |
| Journal of Behavioral… | 1 |
| Journal of Educational and… | 1 |
Author
| Sinharay, Sandip | 2 |
| Codding, Robin S. | 1 |
| Cowell, Ryan | 1 |
| Cui, Zhongmin | 1 |
| Gloster, Andrew T. | 1 |
| Gould, Kaitlin | 1 |
| Guo, Hongwen | 1 |
| Hooper, Jay | 1 |
| Ketamo, Harri | 1 |
| Kiili, Kristian | 1 |
| Kleinert, Whitney L. | 1 |
| More ▼ | |
Publication Type
| Journal Articles | 8 |
| Reports - Research | 7 |
| Reports - Evaluative | 2 |
| Speeches/Meeting Papers | 1 |
Education Level
| Grade 6 | 1 |
| Higher Education | 1 |
| Postsecondary Education | 1 |
Audience
Location
| Finland | 1 |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
Sinharay, Sandip – Applied Measurement in Education, 2017
Karabatsos compared the power of 36 person-fit statistics using receiver operating characteristics curves and found the "H[superscript T]" statistic to be the most powerful in identifying aberrant examinees. He found three statistics, "C", "MCI", and "U3", to be the next most powerful. These four statistics,…
Descriptors: Nonparametric Statistics, Goodness of Fit, Simulation, Comparative Analysis
Kleinert, Whitney L.; Codding, Robin S.; Minami, Takuya; Gould, Kaitlin – Journal of Behavioral Education, 2018
Taped problems is an intervention strategy for addressing mathematics fluency that has been evaluated in multiple single-case design studies. Although its efficacy has been supported in individual studies, no comprehensive quantitative synthesis has been conducted on taped problems. The purpose of this study was to synthesize the literature that…
Descriptors: Meta Analysis, Intervention, Statistical Analysis, Literature Reviews
Kiili, Kristian; Ketamo, Harri – IEEE Transactions on Learning Technologies, 2018
Even though digital learning games have become common in education, relatively little is known about the usefulness of game-based assessment. This paper aims to explore if a game-based math test can provide added value to math education with respect to cognitive and affective outcomes. We used in-game measures, embedded in the game called Semideus…
Descriptors: Mathematics Tests, Outcomes of Education, Fractions, Grade 6
Hooper, Jay; Cowell, Ryan – Educational Assessment, 2014
There has been much research and discussion on the principles of standards-based grading, and there is a growing consensus of best practice. Even so, the actual process of implementing standards-based grading at a school or district level can be a significant challenge. There are very practical questions that remain unclear, such as how the grades…
Descriptors: True Scores, Grading, Academic Standards, Computation
Guo, Hongwen; Sinharay, Sandip – Educational Testing Service, 2011
Nonparametric, or kernel, estimation of item response curve (IRC) is a concern theoretically and operationally. Accuracy of this estimation, often used in item analysis in testing programs, is biased when the observed scores are used as the regressor because the observed scores are contaminated by measurement error. In this study, we investigate…
Descriptors: Error of Measurement, Nonparametric Statistics, Item Response Theory, Computation
Klotsche, Jens; Gloster, Andrew T. – Journal of Educational and Behavioral Statistics, 2012
Longitudinal studies are increasingly common in psychological research. Characterized by repeated measurements, longitudinal designs aim to observe phenomena that change over time. One important question involves identification of the exact point in time when the observed phenomena begin to meaningfully change above and beyond baseline…
Descriptors: Longitudinal Studies, Psychological Studies, Nonparametric Statistics, Regression (Statistics)
Nandakumar, Ratna; Yu, Feng; Zhang, Yanwei – Applied Psychological Measurement, 2011
DETECT is a nonparametric methodology to identify the dimensional structure underlying test data. The associated DETECT index, "D[subscript max]," denotes the degree of multidimensionality in data. Conditional covariances (CCOV) are the building blocks of this index. In specifying population CCOVs, the latent test composite [theta][subscript TT]…
Descriptors: Nonparametric Statistics, Statistical Analysis, Tests, Data
Penfield, Randall D. – Applied Psychological Measurement, 2008
The examination of measurement invariance in polytomous items is complicated by the possibility that the magnitude and sign of lack of invariance may vary across the steps underlying the set of polytomous response options, a concept referred to as differential step functioning (DSF). This article describes three classes of nonparametric DSF effect…
Descriptors: Simulation, Nonparametric Statistics, Item Response Theory, Computation
Cui, Zhongmin; Kolen, Michael J. – Applied Psychological Measurement, 2008
This article considers two methods of estimating standard errors of equipercentile equating: the parametric bootstrap method and the nonparametric bootstrap method. Using a simulation study, these two methods are compared under three sample sizes (300, 1,000, and 3,000), for two test content areas (the Iowa Tests of Basic Skills Maps and Diagrams…
Descriptors: Test Length, Test Content, Simulation, Computation

Peer reviewed
Direct link
