Publication Date
| In 2026 | 0 |
| Since 2025 | 0 |
| Since 2022 (last 5 years) | 5 |
| Since 2017 (last 10 years) | 42 |
| Since 2007 (last 20 years) | 105 |
Descriptor
| Accuracy | 106 |
| Statistical Analysis | 106 |
| Feedback (Response) | 47 |
| Item Response Theory | 37 |
| Foreign Countries | 36 |
| English (Second Language) | 28 |
| Second Language Learning | 28 |
| Second Language Instruction | 24 |
| Error Correction | 21 |
| Pretests Posttests | 21 |
| Test Items | 21 |
| More ▼ | |
Source
Author
Publication Type
| Reports - Research | 98 |
| Journal Articles | 95 |
| Tests/Questionnaires | 8 |
| Dissertations/Theses -… | 6 |
| Speeches/Meeting Papers | 2 |
| Collected Works - Proceedings | 1 |
| Dissertations/Theses -… | 1 |
| Information Analyses | 1 |
Education Level
| Higher Education | 35 |
| Postsecondary Education | 23 |
| Secondary Education | 7 |
| Elementary Education | 6 |
| Early Childhood Education | 5 |
| High Schools | 4 |
| Primary Education | 4 |
| Adult Education | 3 |
| Grade 1 | 2 |
| Grade 7 | 2 |
| Junior High Schools | 2 |
| More ▼ | |
Audience
Location
| Iran | 9 |
| Netherlands | 4 |
| China | 3 |
| Australia | 2 |
| Belgium | 2 |
| Indonesia | 2 |
| Iowa | 2 |
| Spain | 2 |
| Arizona | 1 |
| California | 1 |
| California (Santa Barbara) | 1 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
Wu, Tong; Kim, Stella Y.; Westine, Carl – Educational and Psychological Measurement, 2023
For large-scale assessments, data are often collected with missing responses. Despite the wide use of item response theory (IRT) in many testing programs, however, the existing literature offers little insight into the effectiveness of various approaches to handling missing responses in the context of scale linking. Scale linking is commonly used…
Descriptors: Data Analysis, Responses, Statistical Analysis, Measurement
Wind, Stefanie A.; Schumacker, Randall E. – Educational and Psychological Measurement, 2021
Researchers frequently use Rasch models to analyze survey responses because these models provide accurate parameter estimates for items and examinees when there are missing data. However, researchers have not fully considered how missing data affect the accuracy of dimensionality assessment in Rasch analyses such as principal components analysis…
Descriptors: Item Response Theory, Data, Factor Analysis, Accuracy
Benton, Tom; Williamson, Joanna – Research Matters, 2022
Equating methods are designed to adjust between alternate versions of assessments targeting the same content at the same level, with the aim that scores from the different versions can be used interchangeably. The statistical processes used in equating have, however, been extended to statistically "link" assessments that differ, such as…
Descriptors: Statistical Analysis, Equated Scores, Definitions, Alternative Assessment
Alahmadi, Sarah; Jones, Andrew T.; Barry, Carol L.; Ibáñez, Beatriz – Applied Measurement in Education, 2023
Rasch common-item equating is often used in high-stakes testing to maintain equivalent passing standards across test administrations. If unaddressed, item parameter drift poses a major threat to the accuracy of Rasch common-item equating. We compared the performance of well-established and newly developed drift detection methods in small and large…
Descriptors: Equated Scores, Item Response Theory, Sample Size, Test Items
Kalkan, Ömür Kaya – Measurement: Interdisciplinary Research and Perspectives, 2022
The four-parameter logistic (4PL) Item Response Theory (IRT) model has recently been reconsidered in the literature due to the advances in the statistical modeling software and the recent developments in the estimation of the 4PL IRT model parameters. The current simulation study evaluated the performance of expectation-maximization (EM),…
Descriptors: Comparative Analysis, Sample Size, Test Length, Algorithms
Luo, Jiaorong; Yang, Mingcheng; Wang, Ling – Journal of Experimental Psychology: Learning, Memory, and Cognition, 2023
The increased Simon effect with increasing the ratio of congruent trials may be interpreted by both attention modulation and irrelevant stimulus-response (S-R) associations learning accounts, although the reversed Simon effect with increasing the ratio of incongruent trials provides evidence supporting the latter account. To investigate if…
Descriptors: Foreign Countries, Responses, Reaction Time, Accuracy
Xiao, Jiaying; Bulut, Okan – Educational and Psychological Measurement, 2020
Large amounts of missing data could distort item parameter estimation and lead to biased ability estimates in educational assessments. Therefore, missing responses should be handled properly before estimating any parameters. In this study, two Monte Carlo simulation studies were conducted to compare the performance of four methods in handling…
Descriptors: Data, Computation, Ability, Maximum Likelihood Statistics
Zhang, Zhonghua – Applied Measurement in Education, 2020
The characteristic curve methods have been applied to estimate the equating coefficients in test equating under the graded response model (GRM). However, the approaches for obtaining the standard errors for the estimates of these coefficients have not been developed and examined. In this study, the delta method was applied to derive the…
Descriptors: Error of Measurement, Computation, Equated Scores, True Scores
Castellano, Katherine E.; McCaffrey, Daniel F. – Journal of Educational Measurement, 2020
The residual gain score has been of historical interest, and its percentile rank has been of interest more recently given its close correspondence to the popular Student Growth Percentile. However, these estimators suffer from low accuracy and systematic bias (bias conditional on prior latent achievement). This article explores three…
Descriptors: Accuracy, Student Evaluation, Measurement Techniques, Evaluation Methods
Chou, Winston; Imai, Kosuke; Rosenfeld, Bryn – Sociological Methods & Research, 2020
Scholars increasingly rely on indirect questioning techniques to reduce social desirability bias and item nonresponse for sensitive survey questions. The major drawback of these approaches, however, is their inefficiency relative to direct questioning. We show how to improve the statistical analysis of the list experiment, randomized response…
Descriptors: Surveys, Test Items, Questioning Techniques, Statistical Analysis
Qiu, Yuxi; Huggins-Manley, Anne Corinne – Educational and Psychological Measurement, 2019
This study aimed to assess the accuracy of the empirical item characteristic curve (EICC) preequating method given the presence of test speededness. The simulation design of this study considered the proportion of speededness, speededness point, speededness rate, proportion of missing on speeded items, sample size, and test length. After crossing…
Descriptors: Accuracy, Equated Scores, Test Items, Nonparametric Statistics
Bashkov, Bozhidar M.; Clauser, Jerome C. – Practical Assessment, Research & Evaluation, 2019
Successful testing programs rely on high-quality test items to produce reliable scores and defensible exams. However, determining what statistical screening criteria are most appropriate to support these goals can be daunting. This study describes and demonstrates cost-benefit analysis as an empirical approach to determining appropriate screening…
Descriptors: Test Items, Test Reliability, Evaluation Criteria, Accuracy
Freedberg, Michael; Schacherer, Jonathan; Hazeltine, Eliot – Journal of Experimental Psychology: Learning, Memory, and Cognition, 2016
Reward has been shown to change behavior as a result of incentive learning (by motivating the individual to increase their effort) and instrumental learning (by increasing the frequency of a particular behavior). However, Palminteri et al. (2011) demonstrated that reward can also improve the incidental learning of a motor skill even when…
Descriptors: Incidental Learning, Associative Learning, Rewards, Incentives
Peterson, Dwight J.; Naveh-Benjamin, Moshe – Journal of Experimental Psychology: Learning, Memory, and Cognition, 2017
An important yet unresolved question regarding visual working memory (VWM) relates to whether or not binding processes within VWM require additional attentional resources compared with processing solely the individual components comprising these bindings. Previous findings indicate that binding of surface features (e.g., colored shapes) within VWM…
Descriptors: Attention, Short Term Memory, Visual Perception, Cognitive Processes
Kang, Hyeon-Ah; Lu, Ying; Chang, Hua-Hua – Applied Measurement in Education, 2017
Increasing use of item pools in large-scale educational assessments calls for an appropriate scaling procedure to achieve a common metric among field-tested items. The present study examines scaling procedures for developing a new item pool under a spiraled block linking design. The three scaling procedures are considered: (a) concurrent…
Descriptors: Item Response Theory, Accuracy, Educational Assessment, Test Items

Peer reviewed
Direct link
