Publication Date
| In 2026 | 0 |
| Since 2025 | 52 |
| Since 2022 (last 5 years) | 410 |
| Since 2017 (last 10 years) | 913 |
| Since 2007 (last 20 years) | 1964 |
Descriptor
Source
Author
Publication Type
Education Level
Audience
| Researchers | 93 |
| Practitioners | 23 |
| Teachers | 22 |
| Policymakers | 10 |
| Administrators | 5 |
| Students | 4 |
| Counselors | 2 |
| Parents | 2 |
| Community | 1 |
Location
| United States | 47 |
| Germany | 42 |
| Australia | 34 |
| Canada | 27 |
| Turkey | 27 |
| California | 22 |
| United Kingdom (England) | 20 |
| Netherlands | 18 |
| China | 17 |
| New York | 15 |
| United Kingdom | 15 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Does not meet standards | 1 |
Höhne, Jan Karem; Krebs, Dagmar – International Journal of Social Research Methodology, 2021
Measuring respondents' attitudes is a crucial task in numerous social science disciplines. A popular way to measure attitudes is to use survey questions with rating scales. However, research has shown that especially the design of rating scales can have a profound impact on respondents' answer behavior. While some scale design aspects, such as…
Descriptors: Attitude Measures, Rating Scales, Telephone Surveys, Response Style (Tests)
Rank-Normalization, Folding, and Localization: An Improved [R-Hat] for Assessing Convergence of MCMC
Aki Vehtari; Andrew Gelman; Daniel Simpson; Bob Carpenter; Paul-Christian Burkner – Grantee Submission, 2021
Markov chain Monte Carlo is a key computational tool in Bayesian statistics, but it can be challenging to monitor the convergence of an iterative stochastic algorithm. In this paper we show that the convergence diagnostic [R-hat] of Gelman and Rubin (1992) has serious flaws. Traditional [R-hat] will fail to correctly diagnose convergence failures…
Descriptors: Markov Processes, Monte Carlo Methods, Bayesian Statistics, Efficiency
Sophie Lilit Litschwartz – ProQuest LLC, 2021
In education research test scores are a common object of analysis. Across studies test scores can be an important outcome, a highly predictive covariate, or a means of assigning treatment. However, test scores are a measure of an underlying proficiency we can't observe directly and so contain error. This measurement error has implications for how…
Descriptors: Scores, Inferences, Educational Research, Evaluation Methods
Rüttenauer, Tobias – Sociological Methods & Research, 2022
Spatial regression models provide the opportunity to analyze spatial data and spatial processes. Yet, several model specifications can be used, all assuming different types of spatial dependence. This study summarizes the most commonly used spatial regression models and offers a comparison of their performance by using Monte Carlo experiments. In…
Descriptors: Models, Monte Carlo Methods, Social Science Research, Data Analysis
Cooperman, Allison W.; Weiss, David J.; Wang, Chun – Educational and Psychological Measurement, 2022
Adaptive measurement of change (AMC) is a psychometric method for measuring intra-individual change on one or more latent traits across testing occasions. Three hypothesis tests--a Z test, likelihood ratio test, and score ratio index--have demonstrated desirable statistical properties in this context, including low false positive rates and high…
Descriptors: Error of Measurement, Psychometrics, Hypothesis Testing, Simulation
Warne, Russell T. – Journal of Advanced Academics, 2022
Recently, Picho-Kiroga (2021) published a meta-analysis on the effect of stereotype threat on females. Their conclusion was that the average effect size for stereotype threat studies was d = .28, but that effects are overstated because the majority of studies on stereotype threat in females include methodological characteristics that inflate the…
Descriptors: Sex Stereotypes, Females, Meta Analysis, Effect Size
Fu, Yuanshu; Wen, Zhonglin; Wang, Yang – Educational and Psychological Measurement, 2022
Composite reliability, or coefficient omega, can be estimated using structural equation modeling. Composite reliability is usually estimated under the basic independent clusters model of confirmatory factor analysis (ICM-CFA). However, due to the existence of cross-loadings, the model fit of the exploratory structural equation model (ESEM) is…
Descriptors: Comparative Analysis, Structural Equation Models, Factor Analysis, Reliability
Tucci, Alexander; Plante, Elena; Heilmann, John J.; Miller, Jon F. – Journal of Speech, Language, and Hearing Research, 2022
Purpose: This exploratory study sought to establish the psychometric stability of a dynamic norming system using the Systematic Analysis of Language Transcripts (SALT) databases. Dynamic norming is the process by which clinicians select a subset of the normative database sample matched to their individual client's demographic characteristics.…
Descriptors: Norms, Psychometrics, Databases, Error of Measurement
Silber, Henning; Roßmann, Joss; Gummer, Tobias – Field Methods, 2022
Attention checks detect inattentiveness by instructing respondents to perform a specific task. However, while respondents may correctly process the task, they may choose to not comply with the instructions. We investigated the issue of noncompliance in attention checks in two web surveys. In Study 1, we measured respondents' attitudes toward…
Descriptors: Compliance (Psychology), Attention, Task Analysis, Online Surveys
Miyazaki, Yasuo; Kamata, Akihito; Uekawa, Kazuaki; Sun, Yizhi – Educational and Psychological Measurement, 2022
This paper investigated consequences of measurement error in the pretest on the estimate of the treatment effect in a pretest-posttest design with the analysis of covariance (ANCOVA) model, focusing on both the direction and magnitude of its bias. Some prior studies have examined the magnitude of the bias due to measurement error and suggested…
Descriptors: Error of Measurement, Pretesting, Pretests Posttests, Statistical Bias
An Analysis of Differential Bundle Functioning in Multidimensional Tests Using the SIBTEST Procedure
Özdogan, Didem; Kelecioglu, Hülya – International Journal of Assessment Tools in Education, 2022
This study aims to analyze the differential bundle functioning in multidimensional tests with a specific purpose to detect this effect through differentiating the location of the item with DIF in the test, the correlation between the dimensions, the sample size, and the ratio of reference to focal group size. The first 10 items of the test that is…
Descriptors: Correlation, Sample Size, Test Items, Item Analysis
Erik-Jan van Kesteren; Daniel L. Oberski – Structural Equation Modeling: A Multidisciplinary Journal, 2022
Structural equation modeling (SEM) is being applied to ever more complex data types and questions, often requiring extensions such as regularization or novel fitting functions. To extend SEM, researchers currently need to completely reformulate SEM and its optimization algorithm -- a challenging and time-consuming task. In this paper, we introduce…
Descriptors: Structural Equation Models, Computation, Graphs, Algorithms
von Hippel, Paul T. – Sociological Methods & Research, 2020
When using multiple imputation, users often want to know how many imputations they need. An old answer is that 2-10 imputations usually suffice, but this recommendation only addresses the efficiency of point estimates. You may need more imputations if, in addition to efficient point estimates, you also want standard error (SE) estimates that would…
Descriptors: Computation, Error of Measurement, Data Analysis, Children
Grant, Chris; Beach, Tyson A. C.; Hogg-Johnson, Sheilah; Chivers, Michael; Howarth, Samuel J. – Measurement in Physical Education and Exercise Science, 2020
This study evaluated whether real-time applied load feedback and a predefined applied load limit improved inter-session reliability and measurement error of passive glenohumeral rotation range-of-motion measurements. Twenty-one male recreational overhead athletes completed two data collection sessions, approximately 1-week apart. Measurements of…
Descriptors: Reliability, Measurement, Males, Athletes
Kim, Stella Yun; Lee, Won-Chan – Applied Measurement in Education, 2023
This study evaluates various scoring methods including number-correct scoring, IRT theta scoring, and hybrid scoring in terms of scale-score stability over time. A simulation study was conducted to examine the relative performance of five scoring methods in terms of preserving the first two moments of scale scores for a population in a chain of…
Descriptors: Scoring, Comparative Analysis, Item Response Theory, Simulation

Peer reviewed
Direct link
