NotesFAQContact Us
Collection
Advanced
Search Tips
Showing 1 to 15 of 23 results Save | Export
Peer reviewed Peer reviewed
Direct linkDirect link
Ranger, Jochen; Schmidt, Nico; Wolgast, Anett – Educational and Psychological Measurement, 2023
Recent approaches to the detection of cheaters in tests employ detectors from the field of machine learning. Detectors based on supervised learning algorithms achieve high accuracy but require labeled data sets with identified cheaters for training. Labeled data sets are usually not available at an early stage of the assessment period. In this…
Descriptors: Identification, Cheating, Information Retrieval, Tests
Peer reviewed Peer reviewed
Direct linkDirect link
Wind, Stefanie A.; Ge, Yuan – Educational and Psychological Measurement, 2021
Practical constraints in rater-mediated assessments limit the availability of complete data. Instead, most scoring procedures include one or two ratings for each performance, with overlapping performances across raters or linking sets of multiple-choice items to facilitate model estimation. These incomplete scoring designs present challenges for…
Descriptors: Evaluators, Scoring, Data Collection, Design
Peer reviewed Peer reviewed
Direct linkDirect link
Agley, Jon; Tidd, David; Jun, Mikyoung; Eldridge, Lori; Xiao, Yunyu; Sussman, Steve; Jayawardene, Wasantha; Agley, Daniel; Gassman, Ruth; Dickinson, Stephanie L. – Educational and Psychological Measurement, 2021
Prospective longitudinal data collection is an important way for researchers and evaluators to assess change. In school-based settings, for low-risk and/or likely-beneficial interventions or surveys, data quality and ethical standards are both arguably stronger when using a waiver of parental consent--but doing so often requires the use of…
Descriptors: Data Analysis, Longitudinal Studies, Data Collection, Intervention
Peer reviewed Peer reviewed
Direct linkDirect link
Wind, Stefanie A.; Guo, Wenjing – Educational and Psychological Measurement, 2019
Rater effects, or raters' tendencies to assign ratings to performances that are different from the ratings that the performances warranted, are well documented in rater-mediated assessments across a variety of disciplines. In many real-data studies of rater effects, researchers have reported that raters exhibit more than one effect, such as a…
Descriptors: Evaluators, Bias, Scoring, Data Collection
Peer reviewed Peer reviewed
Direct linkDirect link
Audette, Lillian M.; Hammond, Marie S.; Rochester, Natalie K. – Educational and Psychological Measurement, 2020
Longitudinal studies are commonly used in the social and behavioral sciences to answer a wide variety of research questions. Longitudinal researchers often collect data anonymously from participants when studying sensitive topics to ensure that accurate information is provided. One difficulty gathering longitudinal anonymous data is that of…
Descriptors: Research Methodology, Longitudinal Studies, Research Design, Social Science Research
Peer reviewed Peer reviewed
Direct linkDirect link
Trafimow, David; MacDonald, Justin A. – Educational and Psychological Measurement, 2017
Typically, in education and psychology research, the investigator collects data and subsequently performs descriptive and inferential statistics. For example, a researcher might compute group means and use the null hypothesis significance testing procedure to draw conclusions about the populations from which the groups were drawn. We propose an…
Descriptors: Statistical Inference, Statistics, Data Collection, Equations (Mathematics)
Peer reviewed Peer reviewed
Direct linkDirect link
Park, Jungkyu; Yu, Hsiu-Ting – Educational and Psychological Measurement, 2016
The multilevel latent class model (MLCM) is a multilevel extension of a latent class model (LCM) that is used to analyze nested structure data structure. The nonparametric version of an MLCM assumes a discrete latent variable at a higher-level nesting structure to account for the dependency among observations nested within a higher-level unit. In…
Descriptors: Hierarchical Linear Modeling, Nonparametric Statistics, Data Analysis, Simulation
Peer reviewed Peer reviewed
Direct linkDirect link
Wind, Stefanie A.; Jones, Eli – Educational and Psychological Measurement, 2018
Previous research includes frequent admonitions regarding the importance of establishing connectivity in data collection designs prior to the application of Rasch models. However, details regarding the influence of characteristics of the linking sets used to establish connections among facets, such as locations on the latent variable, model-data…
Descriptors: Data Collection, Goodness of Fit, Computation, Networks
Peer reviewed Peer reviewed
Direct linkDirect link
Marcoulides, Katerina M.; Grimm, Kevin J. – Educational and Psychological Measurement, 2017
Synthesizing results from multiple studies is a daunting task during which researchers must tackle a variety of challenges. The task is even more demanding when studying developmental processes longitudinally and when different instruments are used to measure constructs. Data integration methodology is an emerging field that enables researchers to…
Descriptors: Growth Models, Longitudinal Studies, Mathematics Skills, Achievement Tests
Peer reviewed Peer reviewed
Direct linkDirect link
Peng, Chao-Ying Joanne; Zhu, Jin – Educational and Psychological Measurement, 2008
For the past 25 years, methodological advances have been made in missing data treatment. Most published work has focused on missing data in dependent variables under various conditions. The present study seeks to fill the void by comparing two approaches for handling missing data in categorical covariates in logistic regression: the…
Descriptors: Regression (Statistics), Comparative Analysis, Evaluation Methods, Equations (Mathematics)
Peer reviewed Peer reviewed
Direct linkDirect link
Christ, Theodore J.; Riley-Tillman, T. Chris; Chafouleas, Sandra M.; Boice, Christina H. – Educational and Psychological Measurement, 2010
Generalizability theory was used to examine the generalizability and dependability of outcomes from two single-item Direct Behavior Rating (DBR) scales: DBR of actively manipulating and DBR of visually distracted. DBR is a behavioral assessment tool with specific instrumentation and procedures that can be used by a variety of service delivery…
Descriptors: Generalizability Theory, Student Behavior, Data Collection, Student Evaluation
Peer reviewed Peer reviewed
Direct linkDirect link
Gardner, Donald G.; Pierce, Jon L. – Educational and Psychological Measurement, 2010
The authors empirically examined two operationalizations of the core self-evaluation construct: (a) the Judge, Erez, Bono, and Thoresen 12-item scale and (b) a composite measure of self-esteem, self-efficacy, locus of control, and neuroticism.The study found that the composite scale relates more strongly than the shorter scale to performance,…
Descriptors: Locus of Control, Self Efficacy, Construct Validity, Measures (Individuals)
Peer reviewed Peer reviewed
Harwell, Michael – Educational and Psychological Measurement, 1999
Suggests two complementary frameworks for providing validity evidence for rating data. One conceptualizes raters as data collection instruments that should be subject to traditional procedures for establishing validity evidence; the other evaluates studies using raters from an experimental design perspective allowing the assessment of the study's…
Descriptors: Data Analysis, Data Collection, Educational Research, Models
Peer reviewed Peer reviewed
Daniel, Wayne W.; And Others – Educational and Psychological Measurement, 1982
To test the use of Bayes's theorem to adjust for nonresponse bias, 600 hospitals were used in a simulated sample survey. On the basis of known information on five variables, Bayes's formula correctly predicted the status of 92 of the 100 "nonrespondents" relative to a sixth variable. (Author/BW)
Descriptors: Bayesian Statistics, Data Analysis, Data Collection, Hospitals
Peer reviewed Peer reviewed
Direct linkDirect link
von Davier, Alina A.; Wilson, Christine – Educational and Psychological Measurement, 2007
This article discusses the assumptions required by the item response theory (IRT) true-score equating method (with Stocking & Lord, 1983; scaling approach), which is commonly used in the nonequivalent groups with an anchor data-collection design. More precisely, this article investigates the assumptions made at each step by the IRT approach to…
Descriptors: Calculus, Item Response Theory, Scores, Data Collection
Previous Page | Next Page ยป
Pages: 1  |  2