Publication Date
| In 2026 | 0 |
| Since 2025 | 53 |
| Since 2022 (last 5 years) | 411 |
| Since 2017 (last 10 years) | 914 |
| Since 2007 (last 20 years) | 1965 |
Descriptor
Source
Author
Publication Type
Education Level
Audience
| Researchers | 93 |
| Practitioners | 23 |
| Teachers | 22 |
| Policymakers | 10 |
| Administrators | 5 |
| Students | 4 |
| Counselors | 2 |
| Parents | 2 |
| Community | 1 |
Location
| United States | 47 |
| Germany | 42 |
| Australia | 34 |
| Canada | 27 |
| Turkey | 27 |
| California | 22 |
| United Kingdom (England) | 20 |
| Netherlands | 18 |
| China | 17 |
| New York | 15 |
| United Kingdom | 15 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Does not meet standards | 1 |
Regional Educational Laboratory Mid-Atlantic, 2023
The "Stabilizing Subgroup Proficiency Results to Improve the Identification of Low-Performing Schools" study used Bayesian stabilization to improve the reliability (long-term stability) of subgroup proficiency measures that the Pennsylvania Department of Education (PDE) uses to identify schools for Targeted Support and Improvement (TSI)…
Descriptors: At Risk Students, Low Achievement, Error of Measurement, Measurement Techniques
López-López, José Antonio; Van den Noortgate, Wim; Tanner-Smith, Emily E.; Wilson, Sandra Jo; Lipsey, Mark W. – Research Synthesis Methods, 2017
Dependent effect sizes are ubiquitous in meta-analysis. Using Monte Carlo simulation, we compared the performance of 2 methods for meta-regression with dependent effect sizes--robust variance estimation (RVE) and 3-level modeling--with the standard meta-analytic method for independent effect sizes. We further compared bias-reduced linearization…
Descriptors: Effect Size, Regression (Statistics), Meta Analysis, Comparative Analysis
Park, Ryoungsun; Kim, Jiseon; Chung, Hyewon; Dodd, Barbara G. – Educational and Psychological Measurement, 2017
The current study proposes novel methods to predict multistage testing (MST) performance without conducting simulations. This method, called MST test information, is based on analytic derivation of standard errors of ability estimates across theta levels. We compared standard errors derived analytically to the simulation results to demonstrate the…
Descriptors: Testing, Performance, Prediction, Error of Measurement
Carroll, Ian A. – ProQuest LLC, 2017
Item exposure control is, relative to adaptive testing, a nascent concept that has emerged only in the last two to three decades on an academic basis as a practical issue in high-stakes computerized adaptive tests. This study aims to implement a new strategy in item exposure control by incorporating the standard error of the ability estimate into…
Descriptors: Test Items, Computer Assisted Testing, Selection, Adaptive Testing
Algesheimer, René; Bagozzi, Richard P.; Dholakia, Utpal M. – Sociological Methods & Research, 2018
We offer a new conceptualization and measurement models for constructs at the group-level of analysis in small group research. The conceptualization starts with classical notions of group behavior proposed by Tönnies, Simmel, and Weber and then draws upon plural subject theory by philosophers Gilbert and Tuomela to frame a new perspective…
Descriptors: Models, Groups, Group Behavior, Theories
Huang, Francis L. – Journal of Experimental Education, 2018
Studies analyzing clustered data sets using both multilevel models (MLMs) and ordinary least squares (OLS) regression have generally concluded that resulting point estimates, but not the standard errors, are comparable with each other. However, the accuracy of the estimates of OLS models is important to consider, as several alternative techniques…
Descriptors: Hierarchical Linear Modeling, Least Squares Statistics, Regression (Statistics), Comparative Analysis
Menéndez-Varela, José-Luis; Gregori-Giralt, Eva – Assessment & Evaluation in Higher Education, 2018
Rubrics are widely used in higher education to assess performance in project-based learning environments. To date, the sources of error that may affect their reliability have not been studied in depth. Using generalisability theory as its starting-point, this article analyses the influence of the assessors and the criteria of the rubrics on the…
Descriptors: Scoring Rubrics, Student Projects, Active Learning, Reliability
Perry, Thomas – Research Papers in Education, 2019
A compositional effect is when pupil attainment is associated with the characteristics of their peers, over and above their own individual characteristics. Pupils at academically selective schools, for example, tend to out-perform similar-ability pupils who are educated with mixed-ability peers. Previous methodological studies however have shown…
Descriptors: Value Added Models, Correlation, Individual Characteristics, Peer Influence
Fujiki, Martin; Brinton, Bonnie; Hart, Craig H.; Olsen, Joseph; Coombs, Maille – Language, Speech, and Hearing Services in Schools, 2019
Purpose: Teacher ratings were used to compare children with developmental language disorders (DLD) and their typically developing peers on 2 subtypes of social withdrawal (shyness and unsociability). Measurement invariance analysis was utilized to determine if teachers rated the 2 groups using the same underlying construct for each of the rating…
Descriptors: Language Impairments, Withdrawal (Psychology), Teacher Attitudes, Comparative Analysis
Mao, Xiulin; Harring, Jeffrey R.; Hancock, Gregory R. – Educational and Psychological Measurement, 2015
Latent interaction models have motivated a great deal of methodological research, mainly in the area of estimating such models. Product-indicator methods have been shown to be competitive with other methods of estimation in terms of parameter bias and standard error accuracy, and their continued popularity in empirical studies is due, in part, to…
Descriptors: Structural Equation Models, Error of Measurement, Algebra, Statistical Analysis
Hansen, Bruce E. – Journal of Economic Education, 2017
The field of econometrics largely started with time series analysis because many early datasets were time-series macroeconomic data. As the field developed, more cross-sectional and longitudinal datasets were collected, which today dominate the majority of academic empirical research. In nonacademic (private sector, central bank, and governmental)…
Descriptors: Economics, Economics Education, Undergraduate Students, College Instruction
Gingerich, Andrea; Ramlo, Susan E.; van der Vleuten, Cees P. M.; Eva, Kevin W.; Regehr, Glenn – Advances in Health Sciences Education, 2017
Whenever multiple observers provide ratings, even of the same performance, inter-rater variation is prevalent. The resulting "idiosyncratic rater variance" is considered to be unusable error of measurement in psychometric models and is a threat to the defensibility of our assessments. Prior studies of inter-rater variation in clinical…
Descriptors: Interrater Reliability, Error of Measurement, Psychometrics, Q Methodology
Finch, Holmes – Psicologica: International Journal of Methodology and Experimental Psychology, 2017
Multilevel models (MLMs) have proven themselves to be very useful in social science research, as data from a variety of sources is sampled such that individuals at level-1 are nested within clusters such as schools, hospitals, counseling centers, and business entities at level-2. MLMs using restricted maximum likelihood estimation (REML) provide…
Descriptors: Hierarchical Linear Modeling, Comparative Analysis, Computation, Robustness (Statistics)
Dogucu, Mine – ProQuest LLC, 2017
When researchers fit statistical models to multiply imputed datasets, they have to fit the model separately for each imputed dataset. Since there are multiple datasets, there are always multiple sets of model results. It is possible for some of these sets of results not to converge while some do converge. This study examined occurrence of such a…
Descriptors: Statistical Analysis, Error of Measurement, Goodness of Fit, Monte Carlo Methods
Maun, Deepak; Shukla, Kathan Dushyant; Chand, Vijaya Sherry – Cogent Education, 2020
Non-cognitive competencies play critical role in education. Interventions targeted at these competencies require access to valid measures. Within Indian context, no such validated measure exists at present, and teachers depend on their subjective evaluation of students' non-cognitive competencies represented via grade or comment on report card.…
Descriptors: Test Validity, Test Reliability, Error of Measurement, Factor Analysis

Peer reviewed
Direct link
