Publication Date
| In 2026 | 0 |
| Since 2025 | 53 |
| Since 2022 (last 5 years) | 411 |
| Since 2017 (last 10 years) | 914 |
| Since 2007 (last 20 years) | 1965 |
Descriptor
Source
Author
Publication Type
Education Level
Audience
| Researchers | 93 |
| Practitioners | 23 |
| Teachers | 22 |
| Policymakers | 10 |
| Administrators | 5 |
| Students | 4 |
| Counselors | 2 |
| Parents | 2 |
| Community | 1 |
Location
| United States | 47 |
| Germany | 42 |
| Australia | 34 |
| Canada | 27 |
| Turkey | 27 |
| California | 22 |
| United Kingdom (England) | 20 |
| Netherlands | 18 |
| China | 17 |
| New York | 15 |
| United Kingdom | 15 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Does not meet standards | 1 |
Kritika Thapa – ProQuest LLC, 2023
Measurement invariance is crucial for making valid comparisons across different groups (Kline, 2016; Vandenberg, 2002). To address the challenges associated with invariance testing such as large sample size requirements, the complexity of the model, etc., applied researchers have incorporated parcels. Parcels have been shown to alleviate skewness,…
Descriptors: Elementary Secondary Education, Achievement Tests, Foreign Countries, International Assessment
Robitzsch, Alexander; Lüdtke, Oliver – Assessment in Education: Principles, Policy & Practice, 2019
One major aim of international large-scale assessments (ILSAs) is to monitor changes in student performance over time. To accomplish this task, a set of common items is repeatedly administered in each assessment and linking methods are used to align the results from the different assessments on a common scale. The present article introduces a…
Descriptors: Achievement Tests, Foreign Countries, International Assessment, Secondary School Students
Howe, Roger – ZDM: The International Journal on Mathematics Education, 2019
This paper makes a proposal, from the perspective of a research mathematician interested in mathematics education, for broadening and deepening whole number arithmetic instruction, to make it more relevant for the twenty-first century, in particular, to enable students to deal with large numbers, arguably an essential skill for modern citizenship.…
Descriptors: Number Concepts, Numbers, Error of Measurement, Computation
Vegetabile, Brian G.; Stout-Oswald, Stephanie A.; Davis, Elysia Poggi; Baram, Tallie Z.; Stern, Hal S. – Journal of Educational and Behavioral Statistics, 2019
Predictability of behavior is an important characteristic in many fields including biology, medicine, marketing, and education. When a sequence of actions performed by an individual can be modeled as a stationary time-homogeneous Markov chain the predictability of the individual's behavior can be quantified by the entropy rate of the process. This…
Descriptors: Markov Processes, Prediction, Behavior, Computation
Chang, Wanchen; Pituch, Keenan A. – Journal of Experimental Education, 2019
When data for multiple outcomes are collected in a multilevel design, researchers can select a univariate or multivariate analysis to examine group-mean differences. When correlated outcomes are incomplete, a multivariate multilevel model (MVMM) may provide greater power than univariate multilevel models (MLMs). For a two-group multilevel design…
Descriptors: Hierarchical Linear Modeling, Multivariate Analysis, Research Problems, Error of Measurement
Xin Liu; Kajsa Yang Hansen; Jan De Neve; Martin Valcke – Instructional Science: An International Journal of the Learning Sciences, 2024
The present study examines the measurement property of instructional quality in mathematics education, building on data from teachers and students, by combing TALIS 2013 and PISA 2012 linkage data from seven countries. Confirmatory factor analysis was applied to examine the dimensionality of the construct instructional quality in mathematics…
Descriptors: Teacher Attitudes, Student Attitudes, Mathematics Instruction, Educational Quality
Chengyu Cui; Chun Wang; Gongjun Xu – Grantee Submission, 2024
Multidimensional item response theory (MIRT) models have generated increasing interest in the psychometrics literature. Efficient approaches for estimating MIRT models with dichotomous responses have been developed, but constructing an equally efficient and robust algorithm for polytomous models has received limited attention. To address this gap,…
Descriptors: Item Response Theory, Accuracy, Simulation, Psychometrics
Kate E. Walton – ACT, Inc., 2024
There is a tradeoff between scale length and psychometric concerns. The two are, in fact, directly linked. Generally, when scales are shortened, reliability is reduced, and when scales are lengthened, reliability is improved, provided the items added to the scale are comparable psychometrically (AERA et al., 2014). Scale reliability, in turn,…
Descriptors: Psychometrics, Error of Measurement, Rating Scales, Reliability
Priemer, Burkhard; Hellwig, Julia – International Journal of Science and Mathematics Education, 2018
Estimating measurement uncertainties is important for experimental scientific work. However, this is very often neglected in school curricula and teaching practice, even though experimental work is seen as a fundamental part of teaching science. In order to call attention to the relevance of measurement uncertainties, we developed a comprehensive…
Descriptors: Measurement, Error of Measurement, Secondary School Students, Models
Tourangeau, Roger – Quality Assurance in Education: An International Perspective, 2018
Purpose: This paper aims to examine the cognitive processes involved in answering survey questions. It also briefly discusses how the cognitive viewpoint has been challenged by other approaches (such as conversational analysis). Design/methodology/approach: The paper reviews the major components of the response process and summarizes work…
Descriptors: Surveys, Cognitive Processes, Error of Measurement, Accuracy
Mangione, Kathleen K.; Macropol, Kathy; Jia, Yanxia; Tevald, Michael; Harris, Shane; Wolff, Edward; Craik, Rebecca – Measurement in Physical Education and Exercise Science, 2018
Heart rate (HR) by time curves could be useful as a measure of treatment fidelity (TF). The purposes were to describe the frequency of common recording irregularities (e.g. errors) observed during exercise, validate a process to correct those errors, and determine whether there is a clinically meaningful benefit to data correction. In total, 1895…
Descriptors: Exercise, Older Adults, Metabolism, Injuries
Greifer, Noah – ProQuest LLC, 2018
There has been some research in the use of propensity scores in the context of measurement error in the confounding variables; one recommended method is to generate estimates of the mis-measured covariate using a latent variable model, and to use those estimates (i.e., factor scores) in place of the covariate. I describe a simulation study…
Descriptors: Evaluation Methods, Probability, Scores, Statistical Analysis
Park, Sung Eun; Ahn, Soyeon; Zopluoglu, Cengiz – Educational and Psychological Measurement, 2021
This study presents a new approach to synthesizing differential item functioning (DIF) effect size: First, using correlation matrices from each study, we perform a multigroup confirmatory factor analysis (MGCFA) that examines measurement invariance of a test item between two subgroups (i.e., focal and reference groups). Then we synthesize, across…
Descriptors: Item Analysis, Effect Size, Difficulty Level, Monte Carlo Methods
Erman Aslanoglu, Aslihan; Sata, Mehmet – Participatory Educational Research, 2021
When students present writing tasks that require higher order thinking skills to work, one of the most important problems is scoring these writing tasks objectively. The fact that raters give scores below or above their performance based on several environmental factors affects the consistency of the measurements. Inconsistencies in scoring…
Descriptors: Interrater Reliability, Evaluators, Error of Measurement, Writing Evaluation
Lenz, A. Stephen; Li, Chi – Measurement and Evaluation in Counseling and Development, 2022
The factor structure, measurement invariance, and internal consistency of the Patient Health Questionnaire for Depression and Anxiety (PHQ-4) was examined with a rural, predominately Hispanic sample (N = 711). Findings supported use of a one-factor model across gender, age groups, and Spanish-speaking groups. Counseling practice and research…
Descriptors: Psychometrics, Error of Measurement, Patients, Questionnaires

Direct link
Peer reviewed
