Publication Date
In 2025 | 0 |
Since 2024 | 0 |
Since 2021 (last 5 years) | 0 |
Since 2016 (last 10 years) | 1 |
Since 2006 (last 20 years) | 2 |
Descriptor
Source
Journal of Applied Testing… | 1 |
Journal of Educational… | 1 |
Journal of Educational and… | 1 |
Psychometrika | 1 |
Teaching Mathematics and Its… | 1 |
US Department of Education | 1 |
Author
Bergstrom, Betty | 1 |
Cobern, William W. | 1 |
Crouch, Rosalind | 1 |
De Boeck, Paul | 1 |
Dickison, Philip | 1 |
Geisinger, Kurt F. | 1 |
Haines, Christopher | 1 |
Hill, Richard K. | 1 |
Howe, Roger | 1 |
Kim, Doyoung | 1 |
Kim, Seock-Ho | 1 |
More ▼ |
Publication Type
Reports - Descriptive | 12 |
Journal Articles | 5 |
Speeches/Meeting Papers | 3 |
Computer Programs | 1 |
Guides - Non-Classroom | 1 |
Reports - Research | 1 |
Audience
Location
Belgium | 1 |
California | 1 |
Laws, Policies, & Programs
Assessments and Surveys
SAT (College Admission Test) | 1 |
What Works Clearinghouse Rating
Dickison, Philip; Luo, Xiao; Kim, Doyoung; Woo, Ada; Muntean, William; Bergstrom, Betty – Journal of Applied Testing Technology, 2016
Designing a theory-based assessment with sound psychometric qualities to measure a higher-order cognitive construct is a highly desired yet challenging task for many practitioners. This paper proposes a framework for designing a theory-based assessment to measure a higher-order cognitive construct. This framework results in a modularized yet…
Descriptors: Thinking Skills, Cognitive Tests, Test Construction, Nursing
Kim, Seock-Ho – 2002
Continuation ratio logits are used to model the possibilities of obtaining ordered categories in a polytomously scored item. This model is an alternative to other models for ordered category items such as the graded response model and the generalized partial credit model. The discussion includes a theoretical development of the model, a…
Descriptors: Ability, Classification, Item Response Theory, Mathematical Models

Thissen, David; Steinberg, Lynne – Psychometrika, 1986
This article organizes models for categorical item response data into three distinct classes. "Difference models" are appropriate for ordered responses, "divide-by-total" models for either ordered or nominal responses, and "left-side added" models for multiple-choice responses with guessing. Details of the taxonomy…
Descriptors: Classification, Item Analysis, Latent Trait Theory, Mathematical Models
Haines, Christopher; Crouch, Rosalind – Teaching Mathematics and Its Applications: An International Journal of the IMA, 2005
In this research paper we discuss how some multiple-choice questions may be used to improve understanding, to develop and to assess modelling capabilities and as an aid to teaching.
Descriptors: Test Items, Multiple Choice Tests, Mathematical Applications, Mathematical Models
Monte Carlo Based Null Distribution for an Alternative Goodness-of-Fit Test Statistic in IRT Models.

Stone, Clement A. – Journal of Educational Measurement, 2000
Describes a goodness-of-fit statistic that considers the imprecision with which ability is estimated and involves constructing item fit tables based on each examinee's posterior distribution of ability, given the likelihood of the response pattern and an assumed marginal ability distribution. Also describes a Monte Carlo resampling procedure to…
Descriptors: Goodness of Fit, Item Response Theory, Mathematical Models, Monte Carlo Methods
Samejima, Fumiko – 1981
In defense of retaining the "latent trait theory" term, instead of replacing it with "item response theory" as some recent research would have it, the following objectives are outlined: (1) investigation of theory and method for estimating the operating characteristics of discrete item responses using a minimum number of…
Descriptors: Adaptive Testing, Computer Assisted Testing, Factor Analysis, Latent Trait Theory

Cobern, William W. – 1986
This computer program, written in BASIC, performs three different calculations of test reliability: (1) the Kuder-Richardson method; (2); the "common split-half" method; and (3) the Rulon-Guttman split-half method. The program reads sequential access data files for microcomputers that have been set up by statistical packages such as…
Descriptors: Computer Software, Difficulty Level, Educational Research, Equations (Mathematics)
Merz, William R. – 1980
Several methods of assessing test item bias are described, and the concept of fair use of tests is examined. A test item is biased if individuals of equal ability have different probabilities of attaining the item correct. The following seven general procedures used to examine test items for bias are summarized and discussed: (1) analysis of…
Descriptors: Comparative Analysis, Evaluation Methods, Factor Analysis, Mathematical Models
Van den Noortgate, Wim; De Boeck, Paul – Journal of Educational and Behavioral Statistics, 2005
Although differential item functioning (DIF) theory traditionally focuses on the behavior of individual items in two (or a few) specific groups, in educational measurement contexts, it is often plausible to regard the set of items as a random sample from a broader category. This article presents logistic mixed models that can be used to model…
Descriptors: Test Bias, Item Response Theory, Educational Assessment, Mathematical Models
Howe, Roger; Scheaffer, Richard; Lindquist, Mary – US Department of Education, 2006
This document contains the framework and a set of recommendations for the NAEP 2007 mathematics assessment, which will assess student achievement nationally and state-by-state, as well as in select urban districts, in grades 4 and 8. It includes descriptions of the mathematical content of the test, the types of test questions, and recommendations…
Descriptors: Grade 4, Mathematical Models, National Competency Tests, Mathematics Instruction
Schultz, Matthew T.; Geisinger, Kurt F. – 1992
Research efforts have established that the Mantel-Haenszel procedure (MHP) is an effective method for detecting the presence of test items exhibiting differential item functioning (DIF). While the MHP has been advocated for situations where item response theory based methods may not be usable, recent findings have suggested that the performance of…
Descriptors: College Entrance Examinations, Comparative Analysis, Control Groups, Equations (Mathematics)
Hill, Richard K. – 1979
Four problems faced by the staff of the California Assessment Program (CAP) were solved by applying Rasch scaling techniques: (1) item cultural bias in the Entry Level Test (ELT) given to all first grade pupils; (2) nonlinear regression analysis of the third grade Reading Test scores; (3) comparison of school growth from grades two to three, using…
Descriptors: Black Students, Cultural Differences, Data Analysis, Difficulty Level