NotesFAQContact Us
Collection
Advanced
Search Tips
Showing 1 to 15 of 25 results Save | Export
Peer reviewed Peer reviewed
Direct linkDirect link
Sun, Ting; Kim, Stella Yun – Measurement: Interdisciplinary Research and Perspectives, 2021
In many large testing programs, equipercentile equating has been widely used under a random groups design to adjust test difficulty between forms. However, one thorny issue occurs with equipercentile equating when a particular score has no observed frequency. The purpose of this study is to suggest and evaluate six potential methods in…
Descriptors: Equated Scores, Test Length, Sample Size, Methods
Peer reviewed Peer reviewed
Direct linkDirect link
Dimitrov, Dimiter M.; Atanasov, Dimitar V. – Measurement: Interdisciplinary Research and Perspectives, 2021
This study offers an approach to test equating under the latent D-scoring method (DSM-L) using the nonequivalent groups with anchor tests (NEAT) design. The accuracy of the test equating was examined via a simulation study under a 3 × 3 design by two conditions: group ability at three levels and test difficulty at three levels. The results for…
Descriptors: Equated Scores, Scoring, Test Items, Accuracy
Peer reviewed Peer reviewed
Direct linkDirect link
Luo, Yong – Measurement: Interdisciplinary Research and Perspectives, 2021
To date, only frequentist model-selection methods have been studied with mixed-format data in the context of IRT model-selection, and it is unknown how popular Bayesian model-selection methods such as DIC, WAIC, and LOO perform. In this study, we present the results of a comprehensive simulation study that compared the performances of eight…
Descriptors: Item Response Theory, Test Format, Selection, Methods
Peer reviewed Peer reviewed
Direct linkDirect link
Fisher, William P., Jr. – Measurement: Interdisciplinary Research and Perspectives, 2017
In this commentary on "Rethinking Traditional Methods of Survey Validation," found in this issue of "Measurement: Interdisciplinary Research and Perspectives," William Fisher writes that Maul's paper raises issues of validity in survey-based measurement that deserve far wider consideration and scrutiny than they typically…
Descriptors: Surveys, Validity, Measurement Techniques, Methods
Peer reviewed Peer reviewed
Direct linkDirect link
Wyse, Adam E. – Measurement: Interdisciplinary Research and Perspectives, 2018
A key part of determining cut-scores when performing Angoff standard setting is utilizing equating methods to place standard-setting ratings onto the scale used to report scores to examinees. This article describes three equating methods that can be employed to place Angoff ratings onto the scale used to report scores to examinees when applying…
Descriptors: Standard Setting (Scoring), Equated Scores, Probability, Regression (Statistics)
Peer reviewed Peer reviewed
Direct linkDirect link
Kane, Mike – Measurement: Interdisciplinary Research and Perspectives, 2017
In the article "Rethinking Traditional Methods of Survey Validation" Andrew Maul describes a minimalist validation methodology for survey instruments, which he suggests is widely used in some areas of psychology and then critiques this methodology empirically and conceptually. He provides a reduction ad absurdum argument by showing that…
Descriptors: Surveys, Validity, Psychological Characteristics, Methods
Peer reviewed Peer reviewed
Direct linkDirect link
Mari, Luca – Measurement: Interdisciplinary Research and Perspectives, 2017
In his focus article, "Rethinking Traditional Methods of Survey Validation," published in this issue of "Measurement: Interdisciplinary Research and Perspectives," Andrew Maul introduces and discusses several foundational issues and concludes that self-report measures may be particularly difficult to validate and may fall short…
Descriptors: Surveys, Validity, Measurement Techniques, Methods
Peer reviewed Peer reviewed
Direct linkDirect link
Cappaert, Kevin J.; Wen, Yao; Chang, Yu-Feng – Measurement: Interdisciplinary Research and Perspectives, 2018
Events such as curriculum changes or practice effects can lead to item parameter drift (IPD) in computer adaptive testing (CAT). The current investigation introduced a point- and weight-adjusted D[superscript 2] method for IPD detection for use in a CAT environment when items are suspected of drifting across test administrations. Type I error and…
Descriptors: Adaptive Testing, Computer Assisted Testing, Test Items, Identification
Peer reviewed Peer reviewed
Direct linkDirect link
Duckor, Brent – Measurement: Interdisciplinary Research and Perspectives, 2017
In Andrew Maul's focus paper "Rethinking Traditional Methods of Survey Validation'" published in this issue of "Measurement: Interdisciplinary Research and Perspectives," Maul contends that self-report measures may be particularly difficult to validate. He cautions that such techniques may fall short of providing the kinds of…
Descriptors: Surveys, Validity, Measurement Techniques, Psychological Testing
Peer reviewed Peer reviewed
Direct linkDirect link
Bao, Yu; Bradshaw, Laine – Measurement: Interdisciplinary Research and Perspectives, 2018
Diagnostic classification models (DCMs) can provide multidimensional diagnostic feedback about students' mastery levels of knowledge components or attributes. One advantage of using DCMs is the ability to accurately and reliably classify students into mastery levels with a relatively small number of items per attribute. Combining DCMs with…
Descriptors: Test Items, Selection, Adaptive Testing, Computer Assisted Testing
Peer reviewed Peer reviewed
Direct linkDirect link
Torres Irribarra, David – Measurement: Interdisciplinary Research and Perspectives, 2017
Maul's paper, "Rethinking Traditional Methods of Survey Validation," is a clever and pointed indictment of a set of specific but widespread practices in psychological measurement and the social sciences at large. Through it, Maul highlights central issues in the way to approach theory building and theory testing, bringing to mind the…
Descriptors: Surveys, Validity, Methods, Psychological Characteristics
Peer reviewed Peer reviewed
Direct linkDirect link
Maul, Andrew – Measurement: Interdisciplinary Research and Perspectives, 2017
It is commonly believed that self-report, survey-based instruments can be used to measure a wide range of psychological attributes, such as self-control, growth mindsets, and grit. Increasingly, such instruments are being used not only for basic research but also for supporting decisions regarding educational policy and accountability. The…
Descriptors: Surveys, Validity, Methods, Psychological Characteristics
Peer reviewed Peer reviewed
Direct linkDirect link
Briggs, Derek C.; Peck, Frederick A. – Measurement: Interdisciplinary Research and Perspectives, 2015
The concept of growth is at the foundation of the policy and practice around systems of educational accountability. It is also at the foundation of what teachers concern themselves with on a daily basis as they help children learn. Yet there is a disconnect between the criterion-referenced intuitions that parents and teachers have for what it…
Descriptors: Achievement Gains, Scaling, Scores, Inferences
Peer reviewed Peer reviewed
Direct linkDirect link
Schulz, E. Matthew – Measurement: Interdisciplinary Research and Perspectives, 2013
In this article, E. Matthew Schulz responds to Adam Wyse's article, "Construct Maps as a Foundation for Standard Setting." In doing so, he asserts that one of the most important ideas in Wyse's work is that information used in standard setting needs to be better represented through the use of graphics. However, he's not…
Descriptors: Standard Setting (Scoring), Maps, Item Response Theory, Test Items
Peer reviewed Peer reviewed
Direct linkDirect link
Kingston, Neal M.; Tiemann, Gail C.; Loughran, Jessica T. – Measurement: Interdisciplinary Research and Perspectives, 2013
The authors of this article comment on "Construct Maps as a Foundation for Standard Setting," by Adam E. Wyse (this issue) in which Wyse presents construct maps, a visual display of a variety of sources of evidence that support standard-setting decisions, and shows how this approach could be used with a variety of existing…
Descriptors: Standard Setting (Scoring), Maps, Methods, Misconceptions
Previous Page | Next Page »
Pages: 1  |  2