ERIC - Search Results

Publication Date

In 2026	0
Since 2025	0
Since 2022 (last 5 years)	9
Since 2017 (last 10 years)	18

Source

Educational and Psychological…

Publication Type

Journal Articles	18
Reports - Research	14
Reports - Evaluative	3
Reports - Descriptive	1

Education Level

Audience

Location

Laws, Policies, & Programs

Assessments and Surveys

What Works Clearinghouse Rating

Showing 1 to 15 of 18 results Save | Export

Detecting Rating Scale Malfunctioning with the Partial Credit Model and Generalized Partial Credit Model

Peer reviewed

Direct link

Wind, Stefanie A. – Educational and Psychological Measurement, 2023

Rating scale analysis techniques provide researchers with practical tools for examining the degree to which ordinal rating scales (e.g., Likert-type scales or performance assessment rating scales) function in psychometrically useful ways. When rating scales function as expected, researchers can interpret ratings in the intended direction (i.e.,…

Descriptors: Rating Scales, Testing Problems, Item Response Theory, Models

The Impact of Insufficient Effort Responses on the Order of Category Thresholds in the Polytomous Rasch Model

Peer reviewed

Direct link

Kuan-Yu Jin; Thomas Eckes – Educational and Psychological Measurement, 2024

Insufficient effort responding (IER) refers to a lack of effort when answering survey or questionnaire items. Such items typically offer more than two ordered response categories, with Likert-type scales as the most prominent example. The underlying assumption is that the successive categories reflect increasing levels of the latent variable…

Descriptors: Item Response Theory, Test Items, Test Wiseness, Surveys

On the Pitfalls of Estimating and Using Standardized Reliability Coefficients

Peer reviewed

Direct link

Raykov, Tenko; Marcoulides, George A. – Educational and Psychological Measurement, 2021

The population discrepancy between unstandardized and standardized reliability of homogeneous multicomponent measuring instruments is examined. Within a latent variable modeling framework, it is shown that the standardized reliability coefficient for unidimensional scales can be markedly higher than the corresponding unstandardized reliability…

Descriptors: Test Reliability, Computation, Measures (Individuals), Research Problems

Two-Method Measurement Planned Missing Data with Purposefully Selected Samples

Peer reviewed

Direct link

Menglin Xu; Jessica A. R. Logan – Educational and Psychological Measurement, 2024

Research designs that include planned missing data are gaining popularity in applied education research. These methods have traditionally relied on introducing missingness into data collections using the missing completely at random (MCAR) mechanism. This study assesses whether planned missingness can also be implemented when data are instead…

Descriptors: Research Design, Research Methodology, Monte Carlo Methods, Statistical Analysis

Using Multiple Imputation to Account for the Uncertainty Due to Missing Data in the Context of Factor Retention

Peer reviewed

Direct link

Yan Xia; Selim Havan – Educational and Psychological Measurement, 2024

Although parallel analysis has been found to be an accurate method for determining the number of factors in many conditions with complete data, its application under missing data is limited. The existing literature recommends that, after using an appropriate multiple imputation method, researchers either apply parallel analysis to every imputed…

Descriptors: Data Interpretation, Factor Analysis, Statistical Inference, Research Problems

Factor Retention in Exploratory Factor Analysis with Missing Data

Peer reviewed

Direct link

Goretzko, David – Educational and Psychological Measurement, 2022

Determining the number of factors in exploratory factor analysis is arguably the most crucial decision a researcher faces when conducting the analysis. While several simulation studies exist that compare various so-called factor retention criteria under different data conditions, little is known about the impact of missing data on this process.…

Descriptors: Factor Analysis, Research Problems, Data, Prediction

A Robust Method for Detecting Item Misfit in Large-Scale Assessments

Peer reviewed

Direct link

von Davier, Matthias; Bezirhan, Ummugul – Educational and Psychological Measurement, 2023

Viable methods for the identification of item misfit or Differential Item Functioning (DIF) are central to scale construction and sound measurement. Many approaches rely on the derivation of a limiting distribution under the assumption that a certain model fits the data perfectly. Typical DIF assumptions such as the monotonicity and population…

Descriptors: Robustness (Statistics), Test Items, Item Analysis, Goodness of Fit

Estimating Probabilities of Passing for Examinees with Incomplete Data in Mastery Tests

Peer reviewed

Direct link

Sinharay, Sandip – Educational and Psychological Measurement, 2022

Administrative problems such as computer malfunction and power outage occasionally lead to missing item scores and hence to incomplete data on mastery tests such as the AP and U.S. Medical Licensing examinations. Investigators are often interested in estimating the probabilities of passing of the examinees with incomplete data on mastery tests.…

Descriptors: Mastery Tests, Computer Assisted Testing, Probability, Test Wiseness

Using the Coefficient of Confidence to Make the Philosophical Switch from a Posteriori to a Priori Inferential Statistics

Peer reviewed

Direct link

Trafimow, David – Educational and Psychological Measurement, 2017

There has been much controversy over the null hypothesis significance testing procedure, with much of the criticism centered on the problem of inverse inference. Specifically, p gives the probability of the finding (or one more extreme) given the null hypothesis, whereas the null hypothesis significance testing procedure involves drawing a…

Descriptors: Statistical Inference, Hypothesis Testing, Probability, Intervals

The Effects of Sample Size on the Estimation of Regression Mixture Models

Peer reviewed

Direct link

Jaki, Thomas; Kim, Minjung; Lamont, Andrea; George, Melissa; Chang, Chi; Feaster, Daniel; Van Horn, M. Lee – Educational and Psychological Measurement, 2019

Regression mixture models are a statistical approach used for estimating heterogeneity in effects. This study investigates the impact of sample size on regression mixture's ability to produce "stable" results. Monte Carlo simulations and analysis of resamples from an application data set were used to illustrate the types of problems that…

Descriptors: Sample Size, Computation, Regression (Statistics), Reliability

Hybrid Threshold-Based Sequential Procedures for Detecting Compromised Items in a Computerized Adaptive Testing Licensure Exam

Peer reviewed

Direct link

Lee, Chansoon; Qian, Hong – Educational and Psychological Measurement, 2022

Using classical test theory and item response theory, this study applied sequential procedures to a real operational item pool in a variable-length computerized adaptive testing (CAT) to detect items whose security may be compromised. Moreover, this study proposed a hybrid threshold approach to improve the detection power of the sequential…

Descriptors: Computer Assisted Testing, Adaptive Testing, Licensing Examinations (Professions), Item Response Theory

Assessing Ability Recovery of the Sequential IRT Model with Unstructured Multiple-Attempt Data

Peer reviewed
PDF on ERIC

Download full text

Direct link

Ziying Li; A. Corinne Huggins-Manley; Walter L. Leite; M. David Miller; Eric A. Wright – Educational and Psychological Measurement, 2022

The unstructured multiple-attempt (MA) item response data in virtual learning environments (VLEs) are often from student-selected assessment data sets, which include missing data, single-attempt responses, multiple-attempt responses, and unknown growth ability across attempts, leading to a complex and complicated scenario for using this kind of…

Descriptors: Sequential Approach, Item Response Theory, Data, Simulation

Does the Effect of a Time Limit for Testing Impair Structural Investigations by Means of Confirmatory Factor Models?

Peer reviewed

Direct link

Schweizer, Karl; Reiß, Siegbert; Troche, Stefan – Educational and Psychological Measurement, 2019

The article reports three simulation studies conducted to find out whether the effect of a time limit for testing impairs model fit in investigations of structural validity, whether the representation of the assumed source of the effect prevents impairment of model fit and whether it is possible to identify and discriminate this method effect from…

Descriptors: Timed Tests, Testing, Barriers, Testing Problems

Imputation Methods to Deal with Missing Responses in Computerized Adaptive Multistage Testing

Peer reviewed

Direct link

Cetin-Berber, Dee Duygu; Sari, Halil Ibrahim; Huggins-Manley, Anne Corinne – Educational and Psychological Measurement, 2019

Routing examinees to modules based on their ability level is a very important aspect in computerized adaptive multistage testing. However, the presence of missing responses may complicate estimation of examinee ability, which may result in misrouting of individuals. Therefore, missing responses should be handled carefully. This study investigated…

Descriptors: Computer Assisted Testing, Adaptive Testing, Error of Measurement, Research Problems

Hypothesis Testing in the Real World

Peer reviewed

Direct link

Miller, Jeff – Educational and Psychological Measurement, 2017

Critics of null hypothesis significance testing suggest that (a) its basic logic is invalid and (b) it addresses a question that is of no interest. In contrast to (a), I argue that the underlying logic of hypothesis testing is actually extremely straightforward and compelling. To substantiate that, I present examples showing that hypothesis…

Descriptors: Hypothesis Testing, Testing Problems, Test Validity, Relevance (Education)

Previous Page | Next Page »

Pages: 1 | 2

Research Problems	10
Item Response Theory	7
Testing Problems	7
Error of Measurement	6
Sample Size	5
Computation	4
Hypothesis Testing	4
Simulation	4
Test Items	4
Accuracy	3
Bayesian Statistics	3
Computer Assisted Testing	3
Factor Analysis	3
Goodness of Fit	3
Models	3
Probability	3
Statistical Analysis	3
Statistical Inference	3
Adaptive Testing	2
Classification	2
Data	2
Data Analysis	2
Data Interpretation	2
Effect Size	2
Identification	2
More ▼

Sinharay, Sandip	2
A. Corinne Huggins-Manley	1
Bezirhan, Ummugul	1
Cetin-Berber, Dee Duygu	1
Chang, Chi	1
Eric A. Wright	1
Feaster, Daniel	1
García-Pérez, Miguel A.	1
George, Melissa	1
Goretzko, David	1
Huggins-Manley, Anne Corinne	1
Jaki, Thomas	1
Jamil, Tahira	1
Jessica A. R. Logan	1
Johnson, Matthew S.	1
Kim, Minjung	1
Kuan-Yu Jin	1
Lamont, Andrea	1
Lee, Chansoon	1
Ly, Alexander	1
M. David Miller	1
Marcoulides, George A.	1
Marsman, Maarten	1
Menglin Xu	1
Miller, Jeff	1
More ▼