Publication Date
| In 2026 | 0 |
| Since 2025 | 0 |
| Since 2022 (last 5 years) | 1 |
| Since 2017 (last 10 years) | 2 |
| Since 2007 (last 20 years) | 13 |
Descriptor
| Measurement | 14 |
| Models | 14 |
| Sampling | 12 |
| Foreign Countries | 5 |
| Error of Measurement | 4 |
| Computation | 3 |
| Context Effect | 3 |
| Correlation | 3 |
| Evaluation Methods | 3 |
| Scores | 3 |
| Statistical Analysis | 3 |
| More ▼ | |
Source
Author
Publication Type
| Journal Articles | 11 |
| Reports - Research | 7 |
| Reports - Evaluative | 5 |
| Tests/Questionnaires | 2 |
| Numerical/Quantitative Data | 1 |
| Reports - Descriptive | 1 |
Education Level
| Secondary Education | 3 |
| Grade 7 | 2 |
| Grade 8 | 2 |
| Higher Education | 2 |
| Junior High Schools | 2 |
| Middle Schools | 2 |
| Elementary Education | 1 |
| Grade 10 | 1 |
| Grade 4 | 1 |
| Grade 6 | 1 |
| Grade 9 | 1 |
| More ▼ | |
Audience
Laws, Policies, & Programs
Assessments and Surveys
| National Assessment of… | 1 |
What Works Clearinghouse Rating
Zhou, Todd; Jiao, Hong – Educational and Psychological Measurement, 2023
Cheating detection in large-scale assessment received considerable attention in the extant literature. However, none of the previous studies in this line of research investigated the stacking ensemble machine learning algorithm for cheating detection. Furthermore, no study addressed the issue of class imbalance using resampling. This study…
Descriptors: Cheating, Measurement, Artificial Intelligence, Algorithms
Damrongpanit, Suntonrapot – Universal Journal of Educational Research, 2019
The purposes of this study were to test the structural validity and to test the parameters invariance of the self-discipline measurement model for good student citizenship among the models, using the data from the 1,047 complete questionnaires and the reducing length questionnaires with multiple matrix sampling technique. The sample size of this…
Descriptors: Factor Structure, Questionnaires, Test Length, Citizenship
Hecht, Martin; Weirich, Sebastian; Siegle, Thilo; Frey, Andreas – Educational and Psychological Measurement, 2015
Multiple matrix designs are commonly used in large-scale assessments to distribute test items to students. These designs comprise several booklets, each containing a subset of the complete item pool. Besides reducing the test burden of individual students, using various booklets allows aligning the difficulty of the presented items to the assumed…
Descriptors: Measurement, Item Sampling, Statistical Analysis, Models
Strietholt, Rolf; Rosén, Monica; Bos, Wilfried – Large-scale Assessments in Education, 2013
Background: Since the early days of international large-scale assessments, an overarching aim has been to use the world as an educational laboratory so countries can learn from one another and develop educational systems further. Cross-sectional comparisons across countries as well as trend studies derive from the assumption that there are…
Descriptors: Measurement, International Assessment, Foreign Countries, Sampling
Gu, Fei; Skorupski, William P.; Hoyle, Larry; Kingston, Neal M. – Applied Psychological Measurement, 2011
Ramsay-curve item response theory (RC-IRT) is a nonparametric procedure that estimates the latent trait using splines, and no distributional assumption about the latent trait is required. For item parameters of the two-parameter logistic (2-PL), three-parameter logistic (3-PL), and polytomous IRT models, RC-IRT can provide more accurate estimates…
Descriptors: Intervals, Item Response Theory, Models, Evaluation Methods
Ludtke, Oliver; Marsh, Herbert W.; Robitzsch, Alexander; Trautwein, Ulrich – Psychological Methods, 2011
In multilevel modeling, group-level variables (L2) for assessing contextual effects are frequently generated by aggregating variables from a lower level (L1). A major problem of contextual analyses in the social sciences is that there is no error-free measurement of constructs. In the present article, 2 types of error occurring in multilevel data…
Descriptors: Simulation, Educational Psychology, Social Sciences, Measurement
Hjalmarson, Margret A.; Moore, Tamara J.; delMas, Robert – Statistics Education Research Journal, 2011
Results of analysis of responses to a first-year undergraduate engineering activity are presented. Teams of students were asked to develop a procedure for quantifying the roughness of a surface at the nanoscale, which is typical of problems in Materials Engineering where qualities of a material need to be quantified. Thirty-five teams were…
Descriptors: College Freshmen, Engineering, Laboratories, Learning Activities
Jia, Yue; Stokes, Lynne; Harris, Ian; Wang, Yan – Journal of Educational and Behavioral Statistics, 2011
In this article, we consider estimation of parameters of random effects models from samples collected via complex multistage designs. Incorporation of sampling weights is one way to reduce estimation bias due to unequal probabilities of selection. Several weighting methods have been proposed in the literature for estimating the parameters of…
Descriptors: Sampling, Computation, Statistical Bias, Statistical Analysis
Marsh, Herbert W.; Ludtke, Oliver; Nagengast, Benjamin; Trautwein, Ulrich; Morin, Alexandre J. S.; Abduljabbar, Adel S.; Koller, Olaf – Educational Psychologist, 2012
Classroom context and climate are inherently classroom-level (L2) constructs, but applied researchers sometimes--inappropriately--represent them by student-level (L1) responses in single-level models rather than more appropriate multilevel models. Here we focus on important conceptual issues (distinctions between climate and contextual variables;…
Descriptors: Foreign Countries, Classroom Environment, Educational Research, Research Design
Alonzo, Julie; Tindal, Gerald – Behavioral Research and Teaching, 2011
This technical document provides guidance to educators on the creation and interpretation of survey instruments, particularly as they relate to an analysis of program implementation. Illustrative examples are drawn from a survey of educators related to the use of the easyCBM learning system. This document includes specific sections on…
Descriptors: Surveys, Program Implementation, Curriculum Based Assessment, Sampling
Marsh, Herbert W.; Ludtke, Oliver; Robitzsch, Alexander; Trautwein, Ulrich; Asparouhov, Tihomir; Muthen, Bengt; Nagengast, Benjamin – Multivariate Behavioral Research, 2009
This article is a methodological-substantive synergy. Methodologically, we demonstrate latent-variable contextual models that integrate structural equation models (with multiple indicators) and multilevel models. These models simultaneously control for and unconfound measurement error due to sampling of items at the individual (L1) and group (L2)…
Descriptors: Educational Environment, Context Effect, Models, Structural Equation Models
Silvia, Suyapa; Blitstein, Jonathan; Williams, Jason; Ringwalt, Chris; Dusenbury, Linda; Hansen, William – National Center for Education Evaluation and Regional Assistance, 2010
This is the first of two reports that summarize the findings from an impact evaluation of a violence prevention intervention for middle schools. This report discusses findings after 1 year of implementation. A forthcoming report will discuss the findings after 2 years and 3 years of implementation. In 2004, the U.S. Department of Education (ED)…
Descriptors: Middle Schools, Violence, Prevention, Intervention
Peer reviewedJackson, Paul H. – Psychometrika, 1973
This paper deals with the situation where scores on a number of parallel tests are obtained for each of a set of persons, and these persons are assumed to constitute, in so far as their scores for the tests are concerned, a random sample from some population of interest. (Author)
Descriptors: Analysis of Variance, Bayesian Statistics, Measurement, Models
Meyer, Kevin D.; Foster, Jeff L. – International Journal of Testing, 2008
With the increasing globalization of human resources practices, a commensurate increase in demand has occurred for multi-language ("global") personality norms for use in selection and development efforts. The combination of data from multiple translations of a personality assessment into a single norm engenders error from multiple sources. This…
Descriptors: Global Approach, Cultural Differences, Norms, Human Resources

Direct link
