ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	3
Since 2016 (last 10 years)	12
Since 2006 (last 20 years)	21

Source

Measurement:…

Publication Type

Journal Articles	25
Opinion Papers	10
Reports - Evaluative	10
Reports - Research	10
Reports - Descriptive	1

Education Level

Elementary Secondary Education	3
Elementary Education	1
Grade 5	1
Higher Education	1
Intermediate Grades	1

Audience

Location

Laws, Policies, & Programs

Assessments and Surveys

National Assessment of…

What Works Clearinghouse Rating

Showing 1 to 15 of 25 results Save | Export

Evaluating Six Approaches to Handling Zero-Frequency Scores under Equipercentile Equating

Peer reviewed

Direct link

Sun, Ting; Kim, Stella Yun – Measurement: Interdisciplinary Research and Perspectives, 2021

In many large testing programs, equipercentile equating has been widely used under a random groups design to adjust test difficulty between forms. However, one thorny issue occurs with equipercentile equating when a particular score has no observed frequency. The purpose of this study is to suggest and evaluate six potential methods in…

Descriptors: Equated Scores, Test Length, Sample Size, Methods

An Approach to Test Equating under the Latent "D"-Scoring Method

Peer reviewed

Direct link

Dimitrov, Dimiter M.; Atanasov, Dimitar V. – Measurement: Interdisciplinary Research and Perspectives, 2021

This study offers an approach to test equating under the latent D-scoring method (DSM-L) using the nonequivalent groups with anchor tests (NEAT) design. The accuracy of the test equating was examined via a simulation study under a 3 × 3 design by two conditions: group ability at three levels and test difficulty at three levels. The results for…

Descriptors: Equated Scores, Scoring, Test Items, Accuracy

A Comparison of Common IRT Model-Selection Methods with Mixed-Format Tests

Peer reviewed

Direct link

Luo, Yong – Measurement: Interdisciplinary Research and Perspectives, 2021

To date, only frequentist model-selection methods have been studied with mixed-format data in the context of IRT model-selection, and it is unknown how popular Bayesian model-selection methods such as DIC, WAIC, and LOO perform. In this study, we present the results of a comprehensive simulation study that compared the performances of eight…

Descriptors: Item Response Theory, Test Format, Selection, Methods

Suggestions for Rethinking Validation

Peer reviewed

Direct link

Fisher, William P., Jr. – Measurement: Interdisciplinary Research and Perspectives, 2017

In this commentary on "Rethinking Traditional Methods of Survey Validation," found in this issue of "Measurement: Interdisciplinary Research and Perspectives," William Fisher writes that Maul's paper raises issues of validity in survey-based measurement that deserve far wider consideration and scrutiny than they typically…

Descriptors: Surveys, Validity, Measurement Techniques, Methods

Equating Angoff Standard-Setting Ratings with the Rasch Model

Peer reviewed

Direct link

Wyse, Adam E. – Measurement: Interdisciplinary Research and Perspectives, 2018

A key part of determining cut-scores when performing Angoff standard setting is utilizing equating methods to place standard-setting ratings onto the scale used to report scores to examinees. This article describes three equating methods that can be employed to place Angoff ratings onto the scale used to report scores to examinees when applying…

Descriptors: Standard Setting (Scoring), Equated Scores, Probability, Regression (Statistics)

Causal Interpretations of Psychological Attributes

Peer reviewed

Direct link

Kane, Mike – Measurement: Interdisciplinary Research and Perspectives, 2017

In the article "Rethinking Traditional Methods of Survey Validation" Andrew Maul describes a minimalist validation methodology for survey instruments, which he suggests is widely used in some areas of psychology and then critiques this methodology empirically and conceptually. He provides a reduction ad absurdum argument by showing that…

Descriptors: Surveys, Validity, Psychological Characteristics, Methods

Can Formal Methods Provide (Necessary and) Sufficient Conditions for Measurement?

Peer reviewed

Direct link

Mari, Luca – Measurement: Interdisciplinary Research and Perspectives, 2017

In his focus article, "Rethinking Traditional Methods of Survey Validation," published in this issue of "Measurement: Interdisciplinary Research and Perspectives," Andrew Maul introduces and discusses several foundational issues and concludes that self-report measures may be particularly difficult to validate and may fall short…

Descriptors: Surveys, Validity, Measurement Techniques, Methods

Evaluating CAT-Adjusted Approaches for Suspected Item Parameter Drift Detection

Peer reviewed

Direct link

Cappaert, Kevin J.; Wen, Yao; Chang, Yu-Feng – Measurement: Interdisciplinary Research and Perspectives, 2018

Events such as curriculum changes or practice effects can lead to item parameter drift (IPD) in computer adaptive testing (CAT). The current investigation introduced a point- and weight-adjusted D[superscript 2] method for IPD detection for use in a CAT environment when items are suspected of drifting across test administrations. Type I error and…

Descriptors: Adaptive Testing, Computer Assisted Testing, Test Items, Identification

Beyond Immanent and Transcendent Critique: Exploring Maul's Argument within the NRC (2001) Framework

Peer reviewed

Direct link

Duckor, Brent – Measurement: Interdisciplinary Research and Perspectives, 2017

In Andrew Maul's focus paper "Rethinking Traditional Methods of Survey Validation'" published in this issue of "Measurement: Interdisciplinary Research and Perspectives," Maul contends that self-report measures may be particularly difficult to validate. He cautions that such techniques may fall short of providing the kinds of…

Descriptors: Surveys, Validity, Measurement Techniques, Psychological Testing

Attribute-Level Item Selection Method for DCM-CAT

Peer reviewed

Direct link

Bao, Yu; Bradshaw, Laine – Measurement: Interdisciplinary Research and Perspectives, 2018

Diagnostic classification models (DCMs) can provide multidimensional diagnostic feedback about students' mastery levels of knowledge components or attributes. One advantage of using DCMs is the ability to accurately and reliably classify students into mastery levels with a relatively small number of items per attribute. Combining DCMs with…

Descriptors: Test Items, Selection, Adaptive Testing, Computer Assisted Testing

Measuring Well What is Ill Defined?

Peer reviewed

Direct link

Torres Irribarra, David – Measurement: Interdisciplinary Research and Perspectives, 2017

Maul's paper, "Rethinking Traditional Methods of Survey Validation," is a clever and pointed indictment of a set of specific but widespread practices in psychological measurement and the social sciences at large. Through it, Maul highlights central issues in the way to approach theory building and theory testing, bringing to mind the…

Descriptors: Surveys, Validity, Methods, Psychological Characteristics

Rethinking Traditional Methods of Survey Validation

Peer reviewed

Direct link

Maul, Andrew – Measurement: Interdisciplinary Research and Perspectives, 2017

It is commonly believed that self-report, survey-based instruments can be used to measure a wide range of psychological attributes, such as self-control, growth mindsets, and grit. Increasingly, such instruments are being used not only for basic research but also for supporting decisions regarding educational policy and accountability. The…

Descriptors: Surveys, Validity, Methods, Psychological Characteristics

Using Learning Progressions to Design Vertical Scales That Support Coherent Inferences about Student Growth

Peer reviewed

Direct link

Briggs, Derek C.; Peck, Frederick A. – Measurement: Interdisciplinary Research and Perspectives, 2015

The concept of growth is at the foundation of the policy and practice around systems of educational accountability. It is also at the foundation of what teachers concern themselves with on a daily basis as they help children learn. Yet there is a disconnect between the criterion-referenced intuitions that parents and teachers have for what it…

Descriptors: Achievement Gains, Scaling, Scores, Inferences

What Is Essential in Standard Setting and Construct Maps? Commentary on Adam E. Wyse's "Construct Maps as a Foundation for Standard Setting"

Peer reviewed

Direct link

Schulz, E. Matthew – Measurement: Interdisciplinary Research and Perspectives, 2013

In this article, E. Matthew Schulz responds to Adam Wyse's article, "Construct Maps as a Foundation for Standard Setting." In doing so, he asserts that one of the most important ideas in Wyse's work is that information used in standard setting needs to be better represented through the use of graphics. However, he's not…

Descriptors: Standard Setting (Scoring), Maps, Item Response Theory, Test Items

Commentary on "Construct Maps as a Foundation for Standard Setting"

Peer reviewed

Direct link

Kingston, Neal M.; Tiemann, Gail C.; Loughran, Jessica T. – Measurement: Interdisciplinary Research and Perspectives, 2013

The authors of this article comment on "Construct Maps as a Foundation for Standard Setting," by Adam E. Wyse (this issue) in which Wyse presents construct maps, a visual display of a variety of sources of evidence that support standard-setting decisions, and shows how this approach could be used with a variety of existing…

Descriptors: Standard Setting (Scoring), Maps, Methods, Misconceptions

Previous Page | Next Page »

Pages: 1 | 2

Methods	25
Test Items	7
Equated Scores	6
Surveys	6
Validity	6
Item Response Theory	5
Measurement Techniques	5
Standard Setting (Scoring)	5
Test Construction	5
Psychological Testing	4
Standards	4
Accuracy	3
Cutting Scores	3
Maps	3
Measurement	3
Psychological Characteristics	3
Regression (Statistics)	3
Reliability	3
Statistical Inference	3
Test Length	3
Accountability	2
Adaptive Testing	2
Bibliometrics	2
Citation Analysis	2
Computer Assisted Testing	2
More ▼

Kane, Michael T.	3
Mroch, Andrew A.	3
Ripkey, Douglas R.	3
Suh, Youngsuk	3
Kane, Michael	2
Wyse, Adam E.	2
Atanasov, Dimitar V.	1
Bao, Yu	1
Bradshaw, Laine	1
Briggs, Derek C.	1
Cappaert, Kevin J.	1
Chang, Yu-Feng	1
Dimitrov, Dimiter M.	1
Duckor, Brent	1
Fisher, William P., Jr.	1
Huff, Kristen	1
Kane, Mike	1
Kim, Stella Yun	1
Kingston, Neal M.	1
Lewison, Grant	1
Loughran, Jessica T.	1
Luo, Yong	1
Mari, Luca	1
Maul, Andrew	1
Mislevy, Robert J.	1
More ▼