Publication Date
| In 2026 | 0 |
| Since 2025 | 0 |
| Since 2022 (last 5 years) | 3 |
| Since 2017 (last 10 years) | 14 |
| Since 2007 (last 20 years) | 31 |
Descriptor
| Probability | 48 |
| Test Items | 48 |
| Item Response Theory | 19 |
| Statistics | 15 |
| Bayesian Statistics | 13 |
| Maximum Likelihood Statistics | 13 |
| Scores | 12 |
| Difficulty Level | 11 |
| Foreign Countries | 11 |
| Models | 11 |
| Simulation | 11 |
| More ▼ | |
Source
Author
| Mislevy, Robert J. | 2 |
| Wilson, Mark | 2 |
| Yuan, Ke-Hai | 2 |
| Abdel-fattah, Abdel-fattah A. | 1 |
| Abu-Ghazalah, Rashid M. | 1 |
| Agus, Mirian | 1 |
| Andreas Kurz | 1 |
| Atar, Burcu | 1 |
| Baker, Thomas A., III. | 1 |
| Byon, Kevin K. | 1 |
| Can Gürer | 1 |
| More ▼ | |
Publication Type
Education Level
| Higher Education | 9 |
| Postsecondary Education | 7 |
| Elementary Education | 4 |
| Elementary Secondary Education | 4 |
| Secondary Education | 3 |
| Junior High Schools | 2 |
| Middle Schools | 2 |
| Grade 12 | 1 |
| Grade 4 | 1 |
| Grade 8 | 1 |
| High Schools | 1 |
| More ▼ | |
Audience
| Teachers | 2 |
| Practitioners | 1 |
| Researchers | 1 |
Location
| Australia | 2 |
| Canada | 2 |
| Belgium | 1 |
| China | 1 |
| Cyprus | 1 |
| Indonesia | 1 |
| Iran | 1 |
| Israel | 1 |
| Italy (Milan) | 1 |
| Japan | 1 |
| Mexico | 1 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
| National Assessment of… | 3 |
| Program for International… | 2 |
| Graduate Record Examinations | 1 |
| Trends in International… | 1 |
What Works Clearinghouse Rating
Clemens Draxler; Andreas Kurz; Can Gürer; Jan Philipp Nolte – Journal of Educational and Behavioral Statistics, 2024
A modified and improved inductive inferential approach to evaluate item discriminations in a conditional maximum likelihood and Rasch modeling framework is suggested. The new approach involves the derivation of four hypothesis tests. It implies a linear restriction of the assumed set of probability distributions in the classical approach that…
Descriptors: Inferences, Test Items, Item Analysis, Maximum Likelihood Statistics
Kim, Stella Yun; Lee, Won-Chan – Applied Measurement in Education, 2023
This study evaluates various scoring methods including number-correct scoring, IRT theta scoring, and hybrid scoring in terms of scale-score stability over time. A simulation study was conducted to examine the relative performance of five scoring methods in terms of preserving the first two moments of scale scores for a population in a chain of…
Descriptors: Scoring, Comparative Analysis, Item Response Theory, Simulation
Abu-Ghazalah, Rashid M.; Dubins, David N.; Poon, Gregory M. K. – Applied Measurement in Education, 2023
Multiple choice results are inherently probabilistic outcomes, as correct responses reflect a combination of knowledge and guessing, while incorrect responses additionally reflect blunder, a confidently committed mistake. To objectively resolve knowledge from responses in an MC test structure, we evaluated probabilistic models that explicitly…
Descriptors: Guessing (Tests), Multiple Choice Tests, Probability, Models
Tingir, Seyfullah – ProQuest LLC, 2019
Educators use various statistical techniques to explain relationships between latent and observable variables. One way to model these relationships is to use Bayesian networks as a scoring model. However, adjusting the conditional probability tables (CPT-parameters) to fit a set of observations is still a challenge when using Bayesian networks. A…
Descriptors: Bayesian Statistics, Statistical Analysis, Scoring, Probability
Marcoulides, Katerina M. – Measurement: Interdisciplinary Research and Perspectives, 2018
This study examined the use of Bayesian analysis methods for the estimation of item parameters in a two-parameter logistic item response theory model. Using simulated data under various design conditions with both informative and non-informative priors, the parameter recovery of Bayesian analysis methods were examined. Overall results showed that…
Descriptors: Bayesian Statistics, Item Response Theory, Probability, Difficulty Level
Sandoval-Bravo, Salvador; Celso-Arellano, Pedro Luis; Gualajara, Victor; Coronado, Semei – European Journal of Contemporary Education, 2019
The objective of this study is to analyze the ability of students of the University Center for the Economic Administrative Sciences which forms part of the University of Guadalajara from different economic-administrative undergraduate programs, to solve distinct problems in the area of probability, applying a multiple-choice instrument aligned to…
Descriptors: Probability, Undergraduate Students, Economics Education, Problem Solving
Tsubaki, Michiko; Ogawara, Wataru; Tanaka, Kenta – International Electronic Journal of Mathematics Education, 2020
This study proposes and examines an analytical method with the aim of improving the quality of education and learning by situating the answers to full descriptive questions in probability and statistics to make variables of learners' comprehension of learned content as answer characteristics, based on actual student mistakes. First, we proposed…
Descriptors: Probability, Statistics, Comprehension, Learning Strategies
Wang, Chao; Lu, Hong – Educational Technology & Society, 2018
This study focused on the effect of examinees' ability levels on the relationship between Reflective-Impulsive (RI) cognitive style and item response time in computerized adaptive testing (CAT). The total of 56 students majoring in Educational Technology from Shandong Normal University participated in this study, and their RI cognitive styles were…
Descriptors: Item Response Theory, Computer Assisted Testing, Cognitive Style, Correlation
Agus, Mirian; Peró-Cebollero, Maribel; Guàrdia-Olmos, Joan; Portoghese, Igor; Mascia, Maria Lidia; Penna, Maria Pietronilla – EURASIA Journal of Mathematics, Science and Technology Education, 2020
This paper reports some experiments on probabilistic reasoning designed to investigate the impact of the probabilistic problem presentation format (verbal-numerical and graphical-pictorial) on subjects' confidence in the correctness of their performance, other than the calibration between confidence and accuracy. To understand the potential effect…
Descriptors: Accuracy, Self Efficacy, Context Effect, Statistics
Hauser, Carl; Thum, Yeow Meng; He, Wei; Ma, Lingling – Educational and Psychological Measurement, 2015
When conducting item reviews, analysts evaluate an array of statistical and graphical information to assess the fit of a field test (FT) item to an item response theory model. The process can be tedious, particularly when the number of human reviews (HR) to be completed is large. Furthermore, such a process leads to decisions that are susceptible…
Descriptors: Test Items, Item Response Theory, Research Methodology, Decision Making
Ting, Mu Yu – EURASIA Journal of Mathematics, Science & Technology Education, 2017
Using the capabilities of expert knowledge structures, the researcher prepared test questions on the university calculus topic of "finding the area by integration." The quiz is divided into two types of multiple choice items (one out of four and one out of many). After the calculus course was taught and tested, the results revealed that…
Descriptors: Calculus, Mathematics Instruction, College Mathematics, Multiple Choice Tests
Maeda, Hotaka; Zhang, Bo – International Journal of Testing, 2017
The omega (?) statistic is reputed to be one of the best indices for detecting answer copying on multiple choice tests, but its performance relies on the accurate estimation of copier ability, which is challenging because responses from the copiers may have been contaminated. We propose an algorithm that aims to identify and delete the suspected…
Descriptors: Cheating, Test Items, Mathematics, Statistics
Dardick, William R.; Mislevy, Robert J. – Educational and Psychological Measurement, 2016
A new variant of the iterative "data = fit + residual" data-analytical approach described by Mosteller and Tukey is proposed and implemented in the context of item response theory psychometric models. Posterior probabilities from a Bayesian mixture model of a Rasch item response theory model and an unscalable latent class are expressed…
Descriptors: Bayesian Statistics, Probability, Data Analysis, Item Response Theory
Mohr, Doris, Ed.; Walcott, Crystal, Ed.; Kloosterman, Peter, Ed. – National Council of Teachers of Mathematics, 2019
"Mathematical Thinking: From Assessment Items to Challenging Tasks" is a compilation of 36 problem-based lessons that encourage students to engage in productive struggle and deep thinking. Its 36 full-length lessons for grades 2-8 are each inspired by an actual test item from the National Assessment of Educational Progress (NAEP).…
Descriptors: Problem Based Learning, Test Items, Elementary School Mathematics, Middle School Mathematics
Liu, Yan; Zumbo, Bruno D.; Gustafson, Paul; Huang, Yi; Kroc, Edward; Wu, Amery D. – Practical Assessment, Research & Evaluation, 2016
A variety of differential item functioning (DIF) methods have been proposed and used for ensuring that a test is fair to all test takers in a target population in the situations of, for example, a test being translated to other languages. However, once a method flags an item as DIF, it is difficult to conclude that the grouping variable (e.g.,…
Descriptors: Test Items, Test Bias, Probability, Scores

Peer reviewed
Direct link
