Publication Date
| In 2026 | 0 |
| Since 2025 | 52 |
| Since 2022 (last 5 years) | 410 |
| Since 2017 (last 10 years) | 913 |
| Since 2007 (last 20 years) | 1964 |
Descriptor
Source
Author
Publication Type
Education Level
Audience
| Researchers | 93 |
| Practitioners | 23 |
| Teachers | 22 |
| Policymakers | 10 |
| Administrators | 5 |
| Students | 4 |
| Counselors | 2 |
| Parents | 2 |
| Community | 1 |
Location
| United States | 47 |
| Germany | 42 |
| Australia | 34 |
| Canada | 27 |
| Turkey | 27 |
| California | 22 |
| United Kingdom (England) | 20 |
| Netherlands | 18 |
| China | 17 |
| New York | 15 |
| United Kingdom | 15 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Does not meet standards | 1 |
Simpson, Adrian – Educational Research and Evaluation, 2019
Effect size is the basis of much evidence-based education policymaking. In particular, it is assumed to measure the educational effectiveness of interventions. Policy is being driven by the influential work of John Hattie, the Education Endowment Foundation, and others, which is grounded in this assumption. This article demonstrates the assumption…
Descriptors: Effect Size, Educational Policy, Educational Research, Policy Formation
Scott, Paul Wesley – Practical Assessment, Research & Evaluation, 2019
Two approaches to causal inference in the presence of non-random assignment are presented: The Propensity Score approach which pseudo-randomizes by balancing groups on observed propensity to be in treatment, and the Endogenous Treatment Effects approach which utilizes systems of equations to explicitly model selection into treatment. The three…
Descriptors: Causal Models, Statistical Inference, Probability, Scores
Marcoulides, Katerina M. – Measurement: Interdisciplinary Research and Perspectives, 2019
Longitudinal data analysis has received widespread interest throughout educational, behavioral, and social science research, with latent growth curve modeling currently being one of the most popular methods of analysis. Despite the popularity of latent growth curve modeling, limited attention has been directed toward understanding the issues of…
Descriptors: Reliability, Longitudinal Studies, Growth Models, Structural Equation Models
Schweizer, Karl; Reiß, Siegbert; Troche, Stefan – Educational and Psychological Measurement, 2019
The article reports three simulation studies conducted to find out whether the effect of a time limit for testing impairs model fit in investigations of structural validity, whether the representation of the assumed source of the effect prevents impairment of model fit and whether it is possible to identify and discriminate this method effect from…
Descriptors: Timed Tests, Testing, Barriers, Testing Problems
Nicewander, W. Alan – Educational and Psychological Measurement, 2019
This inquiry is focused on three indicators of the precision of measurement--conditional on fixed values of ?, the latent variable of item response theory (IRT). The indicators that are compared are (1) The traditional, conditional standard errors, s(eX|?) = CSEM; (2) the IRT-based conditional standard errors, s[subscript irt](eX|?)=C[subscript…
Descriptors: Measurement, Accuracy, Scores, Error of Measurement
Teker, Gülsen Tasdelen; Güler, Nese – International Journal of Assessment Tools in Education, 2019
One of the important theories in education and psychology is Generalizability (G) Theory and various properties distinguish it from the other measurement theories. To better understand methodological trends of G theory, a thematic content analysis was conducted. This study analyzes the studies using generalizability theory in the field of…
Descriptors: Generalizability Theory, Content Analysis, Foreign Countries, Education
Lee, Hyung Rock; Lee, Sunbok; Sung, Jaeyun – International Journal of Assessment Tools in Education, 2019
Applying single-level statistical models to multilevel data typically produces underestimated standard errors, which may result in misleading conclusions. This study examined the impact of ignoring multilevel data structure on the estimation of item parameters and their standard errors of the Rasch, two-, and three-parameter logistic models in…
Descriptors: Item Response Theory, Computation, Error of Measurement, Test Bias
Lu, Rui; Keller, Bryan Sean – AERA Online Paper Repository, 2019
When estimating an average treatment effect with observational data, it's possible to get an unbiased estimate of the causal effect if all confounding variables are observed and reliably measured. In education, confounding variables are often latent constructs. Covariate selection methods used in causal inference applications assume that all…
Descriptors: Factor Analysis, Predictor Variables, Monte Carlo Methods, Comparative Analysis
Wang, Chun; Xu, Gongjun; Zhang, Xue – Grantee Submission, 2019
When latent variables are used as outcomes in regression analysis, a common approach that is used to solve the ignored measurement error issue is to take a multilevel perspective on item response modeling (IRT). Although recent computational advancement allow efficient and accurate estimation of multilevel IRT models, we argue that a two-stage…
Descriptors: Error of Measurement, Item Response Theory, Regression (Statistics), Evaluation Methods
Wang, Chun; Zhang, Xue – Grantee Submission, 2019
The relations among alternative parameterizations of the binary factor analysis (FA) model and two-parameter logistic (2PL) item response theory (IRT) model have been thoroughly discussed in literature (e.g., Lord & Novick, 1968; Takane & de Leeuw, 1987; McDonald, 1999; Wirth & Edwards, 2007; Kamata & Bauer, 2008). However, the…
Descriptors: Test Items, Error of Measurement, Item Response Theory, Factor Analysis
Qian, Jiahe – ETS Research Report Series, 2020
The finite population correction (FPC) factor is often used to adjust variance estimators for survey data sampled from a finite population without replacement. As a replicated resampling approach, the jackknife approach is usually implemented without the FPC factor incorporated in its variance estimates. A paradigm is proposed to compare the…
Descriptors: Computation, Sampling, Data, Statistical Analysis
Esarey, Justin; Valdes, Natalie – Assessment & Evaluation in Higher Education, 2020
Scholarly debate about student evaluations of teaching (SETs) often focuses on whether SETs are valid, reliable and unbiased. In this article, we assume the most optimistic conditions for SETs that are supported by the empirical literature. Specifically, we assume that SETs are moderately correlated with teaching quality (student learning and…
Descriptors: Student Evaluation of Teacher Performance, Bias, Reliability, Validity
Lichtenstein, Robert – Communique, 2020
Appropriate interpretation of assessment data requires an appreciation that tools are subject to measurement error. School psychologists recognize, at least on an intellectual level, that measures are imperfect--that test scores and other quantitative measures (e.g., rating scales, systematic behavioral observations) are best estimates of…
Descriptors: Error of Measurement, Test Reliability, Pretests Posttests, Standardized Tests
Joyner, Rachel E.; Wagner, Richard K. – Scientific Studies of Reading, 2020
Despite the importance of identifying individuals with reading disabilities, existing operational definitions of reading disability do not result in reliable identification. A large part of the problem arises from measurement error when a cut-point is imposed on a continuous distribution, especially for low base-rate conditions. One way to reduce…
Descriptors: Meta Analysis, Reading Difficulties, Mathematics, Learning Disabilities
Uhrig, S. C. Noah; Watson, Nicole – Sociological Methods & Research, 2020
Test-retest reliability assessments rarely investigate whether reliability itself is stable or change in reliability affects findings from substantive models. Research across the social sciences often recognizes that measurement error could influence results, yet it rarely applies established error correction methods. Focusing on gender wage…
Descriptors: Error of Measurement, Wages, National Surveys, Foreign Countries

Peer reviewed
Direct link
