ERIC - Search Results

Publication Date

In 2026	0
Since 2025	0
Since 2022 (last 5 years)	4
Since 2017 (last 10 years)	10
Since 2007 (last 20 years)	18

Source

Applied Measurement in…

Publication Type

Journal Articles	19
Reports - Research	19
Tests/Questionnaires	2
Information Analyses	1

Education Level

Secondary Education	3
Elementary Education	2
Grade 11	2
High Schools	2
Elementary Secondary Education	1
Grade 10	1
Grade 4	1
Grade 9	1
Junior High Schools	1
Middle Schools	1

Audience

Researchers

Location

Europe	1
Netherlands	1

Laws, Policies, & Programs

Assessments and Surveys

What Works Clearinghouse Rating

Showing 1 to 15 of 19 results Save | Export

Analyzing Student Response Processes to Evaluate Success on a Technology-Based Problem-Solving Task

Peer reviewed

Direct link

Han, Yuting; Wilson, Mark – Applied Measurement in Education, 2022

A technology-based problem-solving test can automatically capture all the actions of students when they complete tasks and save them as process data. Response sequences are the external manifestations of the latent intellectual activities of the students, and it contains rich information about students' abilities and different problem-solving…

Descriptors: Technology Uses in Education, Problem Solving, 21st Century Skills, Evaluation Methods

Combining Nonparametric and Parametric Item Response Theory to Explore Data Quality: Illustrations and a Simulation Study

Peer reviewed

Direct link

Stefanie A. Wind; Benjamin Lugu – Applied Measurement in Education, 2024

Researchers who use measurement models for evaluation purposes often select models with stringent requirements, such as Rasch models, which are parametric. Mokken Scale Analysis (MSA) offers a theory-driven nonparametric modeling approach that may be more appropriate for some measurement applications. Researchers have discussed using MSA as a…

Descriptors: Item Response Theory, Data Analysis, Simulation, Nonparametric Statistics

Prediction of Essay Scores from Writing Process and Product Features Using Data Mining Methods

Peer reviewed

Direct link

Sinharay, Sandip; Zhang, Mo; Deane, Paul – Applied Measurement in Education, 2019

Analysis of keystroke logging data is of increasing interest, as evident from a substantial amount of recent research on the topic. Some of the research on keystroke logging data has focused on the prediction of essay scores from keystroke logging features, but linear regression is the only prediction method that has been used in this research.…

Descriptors: Scores, Prediction, Writing Processes, Data Analysis

Determining Reliability of Daily Measures: An Illustration with Data on Teacher Stress

Peer reviewed

Direct link

van Alphen, Thijmen; Jak, Suzanne; Jansen in de Wal, Joost; Schuitema, Jaap; Peetsma, Thea – Applied Measurement in Education, 2022

Intensive longitudinal data is increasingly used to study state-like processes such as changes in daily stress. Measures aimed at collecting such data require the same level of scrutiny regarding scale reliability as traditional questionnaires. The most prevalent methods used to assess reliability of intensive longitudinal measures are based on…

Descriptors: Test Reliability, Measures (Individuals), Anxiety, Data Collection

Traditional vs Intersectional DIF Analysis: Considerations and a Comparison Using State Testing Data

Peer reviewed

Direct link

Tony Albano; Brian F. French; Thao Thu Vo – Applied Measurement in Education, 2024

Recent research has demonstrated an intersectional approach to the study of differential item functioning (DIF). This approach expands DIF to account for the interactions between what have traditionally been treated as separate grouping variables. In this paper, we compare traditional and intersectional DIF analyses using data from a state testing…

Descriptors: Test Items, Item Analysis, Data Use, Standardized Tests

Gathering Response Process Data for a Problem-Solving Measure through Whole-Class Think Alouds

Peer reviewed

Direct link

Bostic, Jonathan David; Sondergeld, Toni A.; Matney, Gabriel; Stone, Gregory; Hicks, Tiara – Applied Measurement in Education, 2021

Response process validity evidence provides a window into a respondent's cognitive processing. The purpose of this study is to describe a new data collection tool called a whole-class think aloud (WCTA). This work is performed as part of test development for a series of problem-solving measures to be used in elementary and middle grades. Data from…

Descriptors: Data Collection, Protocol Analysis, Problem Solving, Cognitive Processes

Using the Bayes Factors to Evaluate Person Fit in the Item Response Theory

Peer reviewed

Direct link

Pan, Tianshu; Yin, Yue – Applied Measurement in Education, 2017

In this article, we propose using the Bayes factors (BF) to evaluate person fit in item response theory models under the framework of Bayesian evaluation of an informative diagnostic hypothesis. We first discuss the theoretical foundation for this application and how to analyze person fit using BF. To demonstrate the feasibility of this approach,…

Descriptors: Bayesian Statistics, Goodness of Fit, Item Response Theory, Monte Carlo Methods

Developing an Automated Writing Placement System for ESL Learners

Peer reviewed

Direct link

Yannakoudakis, Helen; Andersen, Øistein E.; Geranpayeh, Ardeshir; Briscoe, Ted; Nicholls, Diane – Applied Measurement in Education, 2018

There are quite a few challenges in the development of an automated writing placement model for non-native English learners, among them the fact that exams that encompass the full range of language proficiency exhibited at different stages of learning are hard to design. However, acquisition of appropriate training data that are relevant to the…

Descriptors: Automation, Data Processing, Student Placement, English Language Learners

Estimating Variance Components from Sparse Data Matrices in Large-Scale Educational Assessments

Peer reviewed

Direct link

DeMars, Christine – Applied Measurement in Education, 2015

In generalizability theory studies in large-scale testing contexts, sometimes a facet is very sparsely crossed with the object of measurement. For example, when assessments are scored by human raters, it may not be practical to have every rater score all students. Sometimes the scoring is systematically designed such that the raters are…

Descriptors: Educational Assessment, Measurement, Data, Generalizability Theory

Bi-Factor MIRT Observed-Score Equating for Mixed-Format Tests

Peer reviewed

Direct link

Lee, Guemin; Lee, Won-Chan – Applied Measurement in Education, 2016

The main purposes of this study were to develop bi-factor multidimensional item response theory (BF-MIRT) observed-score equating procedures for mixed-format tests and to investigate relative appropriateness of the proposed procedures. Using data from a large-scale testing program, three types of pseudo data sets were formulated: matched samples,…

Descriptors: Test Format, Multidimensional Scaling, Item Response Theory, Equated Scores

The Comparability of Scores from Different Digital Devices: A Literature Review and Synthesis with Recommendations for Practice

Peer reviewed

Direct link

Dadey, Nathan; Lyons, Susan; DePascale, Charles – Applied Measurement in Education, 2018

Evidence of comparability is generally needed whenever there are variations in the conditions of an assessment administration, including variations introduced by the administration of an assessment on multiple digital devices (e.g., tablet, laptop, desktop). This article is meant to provide a comprehensive examination of issues relevant to the…

Descriptors: Evaluation Methods, Computer Assisted Testing, Educational Technology, Technology Uses in Education

Multilevel Latent Class Analysis for Large-Scale Educational Assessment Data: Exploring the Relation between the Curriculum and Students' Mathematical Strategies

Peer reviewed

Direct link

Fagginger Auer, Marije F.; Hickendorff, Marian; Van Putten, Cornelis M.; Béguin, Anton A.; Heiser, Willem J. – Applied Measurement in Education, 2016

A first application of multilevel latent class analysis (MLCA) to educational large-scale assessment data is demonstrated. This statistical technique addresses several of the challenges that assessment data offers. Importantly, MLCA allows modeling of the often ignored teacher effects and of the joint influence of teacher and student variables.…

Descriptors: Educational Assessment, Multivariate Analysis, Classification, Data

A Bayesian Hierarchical Selection Model for Academic Growth with Missing Data

Peer reviewed

Direct link

Allen, Jeff – Applied Measurement in Education, 2017

Using a sample of schools testing annually in grades 9-11 with a vertically linked series of assessments, a latent growth curve model is used to model test scores with student intercepts and slopes nested within school. Missed assessments can occur because of student mobility, student dropout, absenteeism, and other reasons. Missing data…

Descriptors: Achievement Gains, Academic Achievement, Growth Models, Scores

The Use of Multiple Imputation for Missing Data in Uniform DIF Analysis: Power and Type I Error Rates

Peer reviewed

Direct link

Finch, Holmes – Applied Measurement in Education, 2011

Methods of uniform differential item functioning (DIF) detection have been extensively studied in the complete data case. However, less work has been done examining the performance of these methods when missing item responses are present. Research that has been done in this regard appears to indicate that treating missing item responses as…

Descriptors: Test Bias, Data Analysis, Error of Measurement

A Bootstrap Generalization of Modified Parallel Analysis for IRT Dimensionality Assessment

Peer reviewed

Direct link

Finch, Holmes; Monahan, Patrick – Applied Measurement in Education, 2008

This article introduces a bootstrap generalization to the Modified Parallel Analysis (MPA) method of test dimensionality assessment using factor analysis. This methodology, based on the use of Marginal Maximum Likelihood nonlinear factor analysis, provides for the calculation of a test statistic based on a parametric bootstrap using the MPA…

Descriptors: Monte Carlo Methods, Factor Analysis, Generalization, Methods

Previous Page | Next Page »

Pages: 1 | 2

Data Analysis	9
Scores	7
Evaluation Methods	6
Data Collection	5
Item Response Theory	5
Simulation	4
Test Bias	4
Test Items	4
Comparative Analysis	3
Data	3
Educational Assessment	3
Error of Measurement	3
Monte Carlo Methods	3
Problem Solving	3
Academic Achievement	2
Bayesian Statistics	2
Classification	2
Computation	2
Data Processing	2
Elementary School Students	2
Factor Analysis	2
Foreign Countries	2
Goodness of Fit	2
Grade 11	2
Hierarchical Linear Modeling	2
More ▼

Finch, Holmes	2
Abedi, Jamal	1
Allen, Jeff	1
Andersen, Øistein E.	1
Benjamin Lugu	1
Bostic, Jonathan David	1
Brian F. French	1
Briscoe, Ted	1
Béguin, Anton A.	1
Cline, Frederick	1
Cook, Linda	1
Dadey, Nathan	1
DeMars, Christine	1
DePascale, Charles	1
Deane, Paul	1
Eignor, Daniel	1
Fagginger Auer, Marije F.	1
Geranpayeh, Ardeshir	1
Han, Yuting	1
Heiser, Willem J.	1
Hejri, Fereshteh	1
Hickendorff, Marian	1
Hicks, Tiara	1
Jak, Suzanne	1
Jansen in de Wal, Joost	1
More ▼