Publication Date
| In 2026 | 0 |
| Since 2025 | 0 |
| Since 2022 (last 5 years) | 1 |
| Since 2017 (last 10 years) | 1 |
| Since 2007 (last 20 years) | 1 |
Descriptor
Source
| Applied Measurement in… | 15 |
Author
| Bart, William M. | 1 |
| Brennan, Robert L. | 1 |
| Cohen, Allan S. | 1 |
| Crouse, Jill D. | 1 |
| Davey, Beth | 1 |
| De Ayala, R. J. | 1 |
| Dorans, Neil J. | 1 |
| Du, Yi | 1 |
| Engelhard, George, Jr. | 1 |
| Feldt, Leonard S. | 1 |
| Harris, Deborah J. | 1 |
| More ▼ | |
Publication Type
| Journal Articles | 15 |
| Reports - Evaluative | 11 |
| Reports - Research | 3 |
| Speeches/Meeting Papers | 2 |
| Information Analyses | 1 |
| Reports - Descriptive | 1 |
Education Level
Audience
Location
Laws, Policies, & Programs
Assessments and Surveys
| National Assessment of… | 1 |
| SAT (College Admission Test) | 1 |
What Works Clearinghouse Rating
Jianbin Fu; Xuan Tan; Patrick C. Kyllonen – Applied Measurement in Education, 2024
A process is proposed to create the one-dimensional expected item characteristic curve (ICC) and test characteristic curve (TCC) for each trait in multidimensional forced-choice questionnaires based on the Rank-2PL (two-parameter logistic) item response theory models for forced-choice items with two or three statements. Some examples of ICC and…
Descriptors: Item Response Theory, Questionnaires, Measurement Techniques, Statistics
Peer reviewedBart, William M.; Williams-Morris, Ruth – Applied Measurement in Education, 1990
Refined item digraph analysis (RIDA) is a way of studying diagnostic and prescriptive testing. It permits assessment of a test item's diagnostic value by examining the extent to which the item has properties of ideal items. RIDA is illustrated with the Orange Juice Test, which assesses the proportionality concept. (TJH)
Descriptors: Diagnostic Tests, Evaluation Methods, Item Analysis, Mathematical Models
Peer reviewedDu, Yi; And Others – Applied Measurement in Education, 1993
A new computerized mastery test is described that builds on the Lewis and Sheehan procedure (sequential testlets) (1990), but uses fuzzy set decision theory to determine stopping rules and the Rasch model to calibrate items and estimate abilities. Differences between fuzzy set and Bayesian methods are illustrated through an example. (SLD)
Descriptors: Bayesian Statistics, Comparative Analysis, Computer Assisted Testing, Estimation (Mathematics)
Peer reviewedDorans, Neil J. – Applied Measurement in Education, 1990
The equating methods and sampling designs used in the empirical studies in this special issue on the use of matched samples for test equating are described. Four requisites for equating are listed, and the invariance of equating functions requisite is identified as the focus of this issue. (SLD)
Descriptors: Equated Scores, Equations (Mathematics), Evaluation Methods, Item Response Theory
Peer reviewedBrennan, Robert L. – Applied Measurement in Education, 1992
A conceptual framework and heuristic model for considering the existence, magnitude, and consequences of context effects are presented through an extension of some generalizability theory concepts. Context effects are often misunderstood, and current measurement models have serious limitations for examining them. Their importance needs to be…
Descriptors: Adaptive Testing, Context Effect, Equated Scores, Equations (Mathematics)
Peer reviewedHarris, Deborah J.; Crouse, Jill D. – Applied Measurement in Education, 1993
Criteria used in the equating process proposed in the literature are reviewed. The discussion begins by examining how equating is defined. The controversy over the best criterion, the utility of some, and whether a criterion is needed at all means that much work needs to be done in this area. (SLD)
Descriptors: Data Collection, Definitions, Equated Scores, Evaluation Criteria
Peer reviewedFeldt, Leonard S. – Applied Measurement in Education, 1993
The recommendation that the reliability of multiple-choice tests will be enhanced if the distribution of item difficulties is concentrated at approximately 0.50 is reinforced and extended in this article by viewing the 0/1 item scoring as a dichotomization of an underlying normally distributed ability score. (SLD)
Descriptors: Ability, Difficulty Level, Guessing (Tests), Mathematical Models
Peer reviewedDe Ayala, R. J.; And Others – Applied Measurement in Education, 1992
A study involving 1,000 simulated examinees compared the partial credit and graded response models in computerized adaptive testing (CAT). The graded response model fit the data well and provided slightly more accurate ability estimates than those of the partial credit model. Benefits of polytomous model-based CATs are discussed. (SLD)
Descriptors: Adaptive Testing, Comparative Analysis, Computer Assisted Testing, Computer Simulation
Peer reviewedWelch, Catherine; Hoover, H. D. – Applied Measurement in Education, 1993
Methodology is suggested for several statistical procedures to detect polytomously scored items that function differently for two subgroups of examinees. The 3 methods are alternative ways of combining the data from 2 x "k" tables. Simulation results demonstrate the superiority of two of the methods, designated HW1 and HW3. (SLD)
Descriptors: Computer Simulation, Effect Size, Equations (Mathematics), Estimation (Mathematics)
Peer reviewedHolland, Paul W.; Wainer, Howard – Applied Measurement in Education, 1990
Two attempts to adjust state mean Scholastic Aptitude Test (SAT) scores for differential participation rates are examined. Both attempts are rejected, and five rules for performing adjustments are outlined to foster follow-up checks on untested assumptions. National Assessment of Educational Progress state data are determined to be more accurate.…
Descriptors: College Applicants, College Entrance Examinations, Estimation (Mathematics), Item Bias
Peer reviewedSchiel, Jeffrey L.; Shaw, Dale G. – Applied Measurement in Education, 1992
Changes in information retention resulting from changes in reliability and number of intervals in scale construction were studied to provide quantitative information to help in decisions about choosing intervals. Information retention reached a maximum when the number of intervals was about 8 or more and reliability was near 1.0. (SLD)
Descriptors: Decision Making, Knowledge Level, Mathematical Models, Monte Carlo Methods
Peer reviewedCohen, Allan S.; Kim, Seock-Ho – Applied Measurement in Education, 1992
Studied effects of students' use of calculators with 2 experimental forms of a university mathematics test taken by 765 and 725 college students, respectively. Calculator effects are not found for overall scores but are seen for some individual items. Analysis at the item level makes the actual impact apparent. (SLD)
Descriptors: Calculators, College Students, Educational Technology, Equations (Mathematics)
Peer reviewedMiller, Timothy R.; Hirsch, Thomas M. – Applied Measurement in Education, 1992
A procedure for interpreting multiple-discrimination indices from a multidimensional item-response theory analysis is described and demonstrated with responses of 1,635 high school students to a multiple-choice test. The procedure consists of converting discrimination parameter estimates to direction cosines and analyzing the angular distances…
Descriptors: Ability, Cluster Analysis, Comparative Analysis, Estimation (Mathematics)
Peer reviewedEngelhard, George, Jr. – Applied Measurement in Education, 1992
A Many-Faceted Rasch Model (FACETS) for measurement of writing ability is described, and its use in solving measurement problems in large-scale assessment is illustrated with a random sample of 1,000 students from Georgia's Eighth Grade Writing Test. It is a promising approach to assessment through written compositions. (SLD)
Descriptors: Educational Assessment, Essays, Evaluation Problems, Grade 8
Peer reviewedDavey, Beth; Macready, George B. – Applied Measurement in Education, 1990
The usefulness of latent class modeling in addressing several measurement issues is demonstrated via a study of 74 good and 74 poor readers in grades 5 and 6. Procedures were particularly useful for assessing the hierarchical relation among skills and for exploring issues related to item domains. (SLD)
Descriptors: Comparative Testing, Elementary School Students, Grade 5, Grade 6

Direct link
