Publication Date
| In 2026 | 0 |
| Since 2025 | 0 |
| Since 2022 (last 5 years) | 3 |
| Since 2017 (last 10 years) | 4 |
| Since 2007 (last 20 years) | 6 |
Descriptor
| Comparative Analysis | 14 |
| Item Analysis | 14 |
| Probability | 14 |
| Goodness of Fit | 5 |
| Guessing (Tests) | 4 |
| Mathematical Models | 4 |
| Statistical Analysis | 4 |
| Test Construction | 4 |
| Ability | 3 |
| Item Response Theory | 3 |
| Mathematics Tests | 3 |
| More ▼ | |
Source
| Educational Research and… | 1 |
| Educational and Psychological… | 1 |
| International Educational… | 1 |
| International Society for… | 1 |
| Journal of Educational… | 1 |
| National Assessment Governing… | 1 |
| Statistics Education Research… | 1 |
Author
| Bashaw, W. L. | 2 |
| Rentz, R. Robert | 2 |
| Airasian, Peter W. | 1 |
| Bart, William M. | 1 |
| Beretvas, S. Natasha | 1 |
| Brennan, Robert L, | 1 |
| Carmen Batanero | 1 |
| Chu, Wei | 1 |
| Dinero, Thomas E. | 1 |
| Eggen, Theo J. H. M. | 1 |
| Haertel, Edward | 1 |
| More ▼ | |
Publication Type
| Reports - Research | 8 |
| Journal Articles | 4 |
| Speeches/Meeting Papers | 3 |
| Reports - Descriptive | 2 |
| Reports - Evaluative | 2 |
| Tests/Questionnaires | 2 |
| Numerical/Quantitative Data | 1 |
Education Level
| Secondary Education | 2 |
| Elementary Education | 1 |
| Grade 12 | 1 |
| Grade 4 | 1 |
| Grade 8 | 1 |
| High Schools | 1 |
| Higher Education | 1 |
| Intermediate Grades | 1 |
| Junior High Schools | 1 |
| Middle Schools | 1 |
| Postsecondary Education | 1 |
| More ▼ | |
Audience
| Policymakers | 1 |
| Teachers | 1 |
Location
| Costa Rica | 1 |
| Spain | 1 |
Laws, Policies, & Programs
Assessments and Surveys
| National Assessment of… | 1 |
What Works Clearinghouse Rating
Carmen Batanero; Luis A. Hernandez-Solis; Maria M. Gea – Statistics Education Research Journal, 2023
We present an exploratory study of Costa Rican and Spanish students' (11-16-year-olds) competence to compare probabilities in urns and compare ratios in mixture problems. A sample of 704 students in Grades 6 through to Grade 10, 292 from Costa Rica and 412 from Spain, were given one of two forms of a questionnaire with three probability comparison…
Descriptors: Statistics Education, Comparative Analysis, Foreign Countries, Probability
Kolarec, Biserka; Nincevic, Marina – International Society for Technology, Education, and Science, 2022
The object of research is a statistics exam that contains problem tasks. One examiner performed two exam evaluation methods to repeatedly evaluate the exam. The goal was to compare the methods for objectivity. One of the two exam evaluation methods we call a serial evaluation method. The serial evaluation method assumes evaluation of all exam…
Descriptors: Statistics Education, Mathematics Tests, Evaluation Methods, Test Construction
Chu, Wei; Pavlik, Philip I., Jr. – International Educational Data Mining Society, 2023
In adaptive learning systems, various models are employed to obtain the optimal learning schedule and review for a specific learner. Models of learning are used to estimate the learner's current recall probability by incorporating features or predictors proposed by psychological theory or empirically relevant to learners' performance. Logistic…
Descriptors: Reaction Time, Accuracy, Models, Predictor Variables
National Assessment Governing Board, 2017
Since 1973, the National Assessment of Educational Progress (NAEP) has gathered information about student achievement in mathematics. Results of these periodic assessments, produced in print and web-based formats, provide valuable information to a wide variety of audiences. They inform citizens about the nature of students' comprehension of the…
Descriptors: Mathematics Tests, Mathematics Achievement, Mathematics Instruction, Grade 4
Eggen, Theo J. H. M. – Educational Research and Evaluation, 2011
If classification in a limited number of categories is the purpose of testing, computerized adaptive tests (CATs) with algorithms based on sequential statistical testing perform better than estimation-based CATs (e.g., Eggen & Straetmans, 2000). In these computerized classification tests (CCTs), the Sequential Probability Ratio Test (SPRT) (Wald,…
Descriptors: Test Length, Adaptive Testing, Classification, Item Analysis
Wang, Wen-Chung; Huang, Sheng-Yun – Educational and Psychological Measurement, 2011
The one-parameter logistic model with ability-based guessing (1PL-AG) has been recently developed to account for effect of ability on guessing behavior in multiple-choice items. In this study, the authors developed algorithms for computerized classification testing under the 1PL-AG and conducted a series of simulations to evaluate their…
Descriptors: Computer Assisted Testing, Classification, Item Analysis, Probability
Beretvas, S. Natasha; Williams, Natasha J. – Journal of Educational Measurement, 2004
To assess item dimensionality, the following two approaches are described and compared: hierarchical generalized linear model (HGLM) and multidimensional item response theory (MIRT) model. Two generating models are used to simulate dichotomous responses to a 17-item test: the unidimensional and compensatory two-dimensional (C2D) models. For C2D…
Descriptors: Item Response Theory, Test Items, Mathematics Tests, Reading Ability
Dinero, Thomas E.; Haertel, Edward – 1976
This paper will discuss the results of a series of computer simulations comparing the Rasch logistic model to a series of models departing to various degrees from its assumption of equal discrimination power for all items. The results have implications for test construction and test scoring, indicating how closely the conventional raw score…
Descriptors: Comparative Analysis, Computer Programs, Goodness of Fit, Individual Differences
PDF pending restorationRee, Malcolm James – 1978
Item characteristic curve (ICC) theory describes the relationship between the ability of individuals and the probability of their answering a test question correctly; it is useful in estimating test scores, equating the scores of various tests, and scoring responses during adaptive testing. A simulation study of the effectiveness of the following…
Descriptors: Ability, Comparative Analysis, Computer Programs, Item Analysis
Bart, William M.; Airasian, Peter W. – 1976
The question of whether test factor structure is indicative of the test item hierarchy was examined. Data from 1,000 subjects on two sets of five bivalued Law School Admission Test items, which were analyzed with latent trait methods of Bock and Lieberman and of Christoffersson in Psychometrika, were analyzed with an ordering-theoretic method to…
Descriptors: Comparative Analysis, Correlation, Factor Analysis, Factor Structure
Lord, Frederic M. – 1971
A flexilevel test is found to be inferior to a peaked conventional test for measuring examinees in the middle of the ability range, superior for examinees at the extremes. Throughout the entire range of ability, a flexilevel test is much superior to any conventional test that attempts to provide accurate measurement at both extremes. See also ED…
Descriptors: Ability, Comparative Analysis, Difficulty Level, Guessing (Tests)
Brennan, Robert L,; Lockwood, Robert E. – 1979
Procedures for determining cutting scores have been proposed by Angoff and by Nedelsky. Nedelsky's approach requires that a rater examine each distractor within a test item to determine the probability of a minimally competent examinee answering correctly; whereas Angoff uses a judgment based on the whole item, rather than each of its components.…
Descriptors: Achievement Tests, Comparative Analysis, Cutting Scores, Guessing (Tests)
Rentz, R. Robert; Bashaw, W. L. – 1975
In order to determine if Rasch Model procedures have any utility for equating pre-existing tests, this study reanalyzed the data from the equating phase of the Anchor Test Study which used a variety of equipercentile and linear model methods. The tests involved included seven reading test batteries, each having from one to three levels and two…
Descriptors: Comparative Analysis, Elementary Education, Equated Scores, Error of Measurement
Rentz, R. Robert; Bashaw, W. L. – 1975
This volume contains tables of item analysis results obtained by following procedures associated with the Rasch Model for those reading tests used in the Anchor Test Study. Appendix I gives the test names and their corresponding analysis code numbers. Section I (Basic Item Analyses) presents data for the item analysis of each test in a two part…
Descriptors: Comparative Analysis, Elementary Education, Equated Scores, Error of Measurement

Peer reviewed
Direct link
