ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	3
Since 2016 (last 10 years)	9
Since 2006 (last 20 years)	13

Descriptor

Data Collection	23
Research Design	6
Data Analysis	5
Rating Scales	5
Equations (Mathematics)	4
Longitudinal Studies	4
Bayesian Statistics	3
Comparative Analysis	3
Evaluators	3
Identification	3
Research Methodology	3
Scores	3
Accuracy	2
Bias	2
Computation	2
Educational Research	2
Evaluation Methods	2
Factor Analysis	2
Item Response Theory	2
Mathematical Models	2
Measurement Techniques	2
Measures (Individuals)	2
Multiple Choice Tests	2
Reliability	2
Research Problems	2
More ▼

Source

Educational and Psychological…

Publication Type

Journal Articles	20
Reports - Research	11
Reports - Evaluative	8
Reports - Descriptive	1

Education Level

Elementary Education	1
Grade 10	1
Grade 4	1
Grade 7	1
High Schools	1
Higher Education	1
Intermediate Grades	1
Junior High Schools	1
Middle Schools	1
Postsecondary Education	1
Secondary Education	1
More ▼

Audience

Location

Laws, Policies, & Programs

Assessments and Surveys

ACT Assessment	1
Woodcock Johnson Psycho…	1

What Works Clearinghouse Rating

Showing 1 to 15 of 23 results Save | Export

Detecting Cheating in Large-Scale Assessment: The Transfer of Detectors to New Tests

Peer reviewed

Direct link

Ranger, Jochen; Schmidt, Nico; Wolgast, Anett – Educational and Psychological Measurement, 2023

Recent approaches to the detection of cheaters in tests employ detectors from the field of machine learning. Detectors based on supervised learning algorithms achieve high accuracy but require labeled data sets with identified cheaters for training. Labeled data sets are usually not available at an early stage of the assessment period. In this…

Descriptors: Identification, Cheating, Information Retrieval, Tests

Detecting Rater Biases in Sparse Rater-Mediated Assessment Networks

Peer reviewed

Direct link

Wind, Stefanie A.; Ge, Yuan – Educational and Psychological Measurement, 2021

Practical constraints in rater-mediated assessments limit the availability of complete data. Instead, most scoring procedures include one or two ratings for each performance, with overlapping performances across raters or linking sets of multiple-choice items to facilitate model estimation. These incomplete scoring designs present challenges for…

Descriptors: Evaluators, Scoring, Data Collection, Design

Developing and Validating a Novel Anonymous Method for Matching Longitudinal School-Based Data

Peer reviewed

Direct link

Agley, Jon; Tidd, David; Jun, Mikyoung; Eldridge, Lori; Xiao, Yunyu; Sussman, Steve; Jayawardene, Wasantha; Agley, Daniel; Gassman, Ruth; Dickinson, Stephanie L. – Educational and Psychological Measurement, 2021

Prospective longitudinal data collection is an important way for researchers and evaluators to assess change. In school-based settings, for low-risk and/or likely-beneficial interventions or surveys, data quality and ethical standards are both arguably stronger when using a waiver of parental consent--but doing so often requires the use of…

Descriptors: Data Analysis, Longitudinal Studies, Data Collection, Intervention

Exploring the Combined Effects of Rater Misfit and Differential Rater Functioning in Performance Assessments

Peer reviewed

Direct link

Wind, Stefanie A.; Guo, Wenjing – Educational and Psychological Measurement, 2019

Rater effects, or raters' tendencies to assign ratings to performances that are different from the ratings that the performances warranted, are well documented in rater-mediated assessments across a variety of disciplines. In many real-data studies of rater effects, researchers have reported that raters exhibit more than one effect, such as a…

Descriptors: Evaluators, Bias, Scoring, Data Collection

Methodological Issues with Coding Participants in Anonymous Psychological Longitudinal Studies

Peer reviewed

Direct link

Audette, Lillian M.; Hammond, Marie S.; Rochester, Natalie K. – Educational and Psychological Measurement, 2020

Longitudinal studies are commonly used in the social and behavioral sciences to answer a wide variety of research questions. Longitudinal researchers often collect data anonymously from participants when studying sensitive topics to ensure that accurate information is provided. One difficulty gathering longitudinal anonymous data is that of…

Descriptors: Research Methodology, Longitudinal Studies, Research Design, Social Science Research

Performing Inferential Statistics Prior to Data Collection

Peer reviewed

Direct link

Trafimow, David; MacDonald, Justin A. – Educational and Psychological Measurement, 2017

Typically, in education and psychology research, the investigator collects data and subsequently performs descriptive and inferential statistics. For example, a researcher might compute group means and use the null hypothesis significance testing procedure to draw conclusions about the populations from which the groups were drawn. We propose an…

Descriptors: Statistical Inference, Statistics, Data Collection, Equations (Mathematics)

The Impact of Ignoring the Level of Nesting Structure in Nonparametric Multilevel Latent Class Models

Peer reviewed

Direct link

Park, Jungkyu; Yu, Hsiu-Ting – Educational and Psychological Measurement, 2016

The multilevel latent class model (MLCM) is a multilevel extension of a latent class model (LCM) that is used to analyze nested structure data structure. The nonparametric version of an MLCM assumes a discrete latent variable at a higher-level nesting structure to account for the dependency among observations nested within a higher-level unit. In…

Descriptors: Hierarchical Linear Modeling, Nonparametric Statistics, Data Analysis, Simulation

The Stabilizing Influences of Linking Set Size and Model-Data Fit in Sparse Rater-Mediated Assessment Networks

Peer reviewed

Direct link

Wind, Stefanie A.; Jones, Eli – Educational and Psychological Measurement, 2018

Previous research includes frequent admonitions regarding the importance of establishing connectivity in data collection designs prior to the application of Rasch models. However, details regarding the influence of characteristics of the linking sets used to establish connections among facets, such as locations on the latent variable, model-data…

Descriptors: Data Collection, Goodness of Fit, Computation, Networks

Data Integration Approaches to Longitudinal Growth Modeling

Peer reviewed

Direct link

Marcoulides, Katerina M.; Grimm, Kevin J. – Educational and Psychological Measurement, 2017

Synthesizing results from multiple studies is a daunting task during which researchers must tackle a variety of challenges. The task is even more demanding when studying developmental processes longitudinally and when different instruments are used to measure constructs. Data integration methodology is an emerging field that enables researchers to…

Descriptors: Growth Models, Longitudinal Studies, Mathematics Skills, Achievement Tests

Comparison of Two Approaches for Handling Missing Covariates in Logistic Regression

Peer reviewed

Direct link

Peng, Chao-Ying Joanne; Zhu, Jin – Educational and Psychological Measurement, 2008

For the past 25 years, methodological advances have been made in missing data treatment. Most published work has focused on missing data in dependent variables under various conditions. The present study seeks to fill the void by comparing two approaches for handling missing data in categorical covariates in logistic regression: the…

Descriptors: Regression (Statistics), Comparative Analysis, Evaluation Methods, Equations (Mathematics)

Direct Behavior Rating (DBR): Generalizability and Dependability across Raters and Observations

Peer reviewed

Direct link

Christ, Theodore J.; Riley-Tillman, T. Chris; Chafouleas, Sandra M.; Boice, Christina H. – Educational and Psychological Measurement, 2010

Generalizability theory was used to examine the generalizability and dependability of outcomes from two single-item Direct Behavior Rating (DBR) scales: DBR of actively manipulating and DBR of visually distracted. DBR is a behavioral assessment tool with specific instrumentation and procedures that can be used by a variety of service delivery…

Descriptors: Generalizability Theory, Student Behavior, Data Collection, Student Evaluation

The Core Self-Evaluation Scale: Further Construct Validation Evidence

Peer reviewed

Direct link

Gardner, Donald G.; Pierce, Jon L. – Educational and Psychological Measurement, 2010

The authors empirically examined two operationalizations of the core self-evaluation construct: (a) the Judge, Erez, Bono, and Thoresen 12-item scale and (b) a composite measure of self-esteem, self-efficacy, locus of control, and neuroticism.The study found that the composite scale relates more strongly than the shorter scale to performance,…

Descriptors: Locus of Control, Self Efficacy, Construct Validity, Measures (Individuals)

Evaluating the Validity of Educational Rating Data.

Peer reviewed

Harwell, Michael – Educational and Psychological Measurement, 1999

Suggests two complementary frameworks for providing validity evidence for rating data. One conceptualizes raters as data collection instruments that should be subject to traditional procedures for establishing validity evidence; the other evaluates studies using raters from an experimental design perspective allowing the assessment of the study's…

Descriptors: Data Analysis, Data Collection, Educational Research, Models

An Adjustment for Nonresponse in Sample Surveys.

Peer reviewed

Daniel, Wayne W.; And Others – Educational and Psychological Measurement, 1982

To test the use of Bayes's theorem to adjust for nonresponse bias, 600 hospitals were used in a simulated sample survey. On the basis of known information on five variables, Bayes's formula correctly predicted the status of 92 of the 100 "nonrespondents" relative to a sixth variable. (Author/BW)

Descriptors: Bayesian Statistics, Data Analysis, Data Collection, Hospitals

IRT True-Score Test Equating: A Guide through Assumptions and Applications

Peer reviewed

Direct link

von Davier, Alina A.; Wilson, Christine – Educational and Psychological Measurement, 2007

This article discusses the assumptions required by the item response theory (IRT) true-score equating method (with Stocking & Lord, 1983; scaling approach), which is commonly used in the nonequivalent groups with an anchor data-collection design. More precisely, this article investigates the assumptions made at each step by the IRT approach to…

Descriptors: Calculus, Item Response Theory, Scores, Data Collection

Previous Page | Next Page »

Pages: 1 | 2

Wind, Stefanie A.	3
Agley, Daniel	1
Agley, Jon	1
Audette, Lillian M.	1
Boice, Christina H.	1
Chafouleas, Sandra M.	1
Chappell, David	1
Christ, Theodore J.	1
Daniel, Wayne W.	1
Dickinson, Stephanie L.	1
Eldridge, Lori	1
Gardner, Donald G.	1
Gassman, Ruth	1
Ge, Yuan	1
Goldstein, Zvi	1
Grimm, Kevin J.	1
Guo, Wenjing	1
Hammond, Marie S.	1
Harwell, Michael	1
Hopkins, Kenneth D.	1
Humphreys, Marie Adele	1
Jayawardene, Wasantha	1
Jones, Eli	1
Jun, Mikyoung	1
More ▼