Publication Date
| In 2026 | 0 |
| Since 2025 | 53 |
| Since 2022 (last 5 years) | 411 |
| Since 2017 (last 10 years) | 914 |
| Since 2007 (last 20 years) | 1965 |
Descriptor
Source
Author
Publication Type
Education Level
Audience
| Researchers | 93 |
| Practitioners | 23 |
| Teachers | 22 |
| Policymakers | 10 |
| Administrators | 5 |
| Students | 4 |
| Counselors | 2 |
| Parents | 2 |
| Community | 1 |
Location
| United States | 47 |
| Germany | 42 |
| Australia | 34 |
| Canada | 27 |
| Turkey | 27 |
| California | 22 |
| United Kingdom (England) | 20 |
| Netherlands | 18 |
| China | 17 |
| New York | 15 |
| United Kingdom | 15 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Does not meet standards | 1 |
DeMars, Christine E. – Educational and Psychological Measurement, 2008
The graded response (GR) and generalized partial credit (GPC) models do not imply that examinees ordered by raw observed score will necessarily be ordered on the expected value of the latent trait (OEL). Factors were manipulated to assess whether increased violations of OEL also produced increased Type I error rates in differential item…
Descriptors: Test Items, Raw Scores, Test Theory, Error of Measurement
Kim, Seonghoon; Feldt, Leonard S. – Journal of Educational Measurement, 2008
This article extends the Bonett (2003a) approach to testing the equality of alpha coefficients from two independent samples to the case of m [greater than or equal] 2 independent samples. The extended Fisher-Bonett test and its competitor, the Hakstian-Whalen (1976) test, are illustrated with numerical examples of both hypothesis testing and power…
Descriptors: Tests, Comparative Analysis, Hypothesis Testing, Error of Measurement
Olsen, Robert B.; Unlu, Fatih; Price, Cristofer; Jaciw, Andrew P. – National Center for Education Evaluation and Regional Assistance, 2011
This report examines the differences in impact estimates and standard errors that arise when these are derived using state achievement tests only (as pre-tests and post-tests), study-administered tests only, or some combination of state- and study-administered tests. State tests may yield different evaluation results relative to a test that is…
Descriptors: Achievement Tests, Standardized Tests, State Standards, Reading Achievement
Isenberg, Eric; Hock, Heinrich – Mathematica Policy Research, Inc., 2011
This report presents the value-added models that will be used to measure school and teacher effectiveness in the District of Columbia Public Schools (DCPS) in the 2010-2011 school year. It updates the earlier technical report, "Measuring Value Added for IMPACT and TEAM in DC Public Schools." The earlier report described the methods used…
Descriptors: Public Schools, Teacher Effectiveness, School Effectiveness, Models
Rogers, W. Todd; Lin, Jie; Rinaldi, Christia M. – Applied Measurement in Education, 2011
The evidence gathered in the present study supports the use of the simultaneous development of test items for different languages. The simultaneous approach used in the present study involved writing an item in one language (e.g., French) and, before moving to the development of a second item, translating the item into the second language (e.g.,…
Descriptors: Test Items, Item Analysis, Achievement Tests, French
Teasley, C.E. Wynn; Hornyak, Martin – American Journal of Business Education, 2010
The 2009 college football season is here, but there has been a continuing controversy swirling over how the Football Bowl Subdivision (FBS) selects its national champion. College football uses a multi-criterion decision matrix (MCDM) evaluation technique to determine which two teams will play for the national championship. We analyzed the BCS…
Descriptors: Business Administration, Business Administration Education, Team Sports, College Athletics
O'Toole, John Mitchell; King, Robert A. R. – Language Assessment Quarterly, 2010
This quantitative study intends to better understand the impact of the location of the first deleted word upon the estimation of text difficulty yielded by successive cloze tests based on random deletion from a single passage. The variation in sampling of language features across five cloze tests based on the same passage is random and thus not…
Descriptors: Cloze Procedure, Readability, Nouns, Figurative Language
Ardoin, Scott P.; Christ, Theodore J. – School Psychology Review, 2009
There are relatively few studies that evaluate the quality of progress monitoring estimates derived from curriculum-based measurement of reading. Those studies that are published provide initial evidence for relatively large magnitudes of standard error relative to the expected magnitude of weekly growth. A major contributor to the observed…
Descriptors: Curriculum Based Assessment, Oral Reading, Reading Tests, Formative Evaluation
Klassen, Robert M.; Bong, Mimi; Usher, Ellen L.; Chong, Wan Har; Huan, Vivien S.; Wong, Isabella Y. F.; Georgiou, Tasos – Contemporary Educational Psychology, 2009
The purpose of this article was twofold. The first purpose was to test the validity of the Teachers' Sense of Self-Efficacy Scale (TSES) in five settings--Canada, Cyprus, Korea, Singapore, and the United States. The second purpose was, by extension, to establish the importance of the teacher self-efficacy construct across diverse teaching…
Descriptors: Foreign Countries, Teaching Conditions, Validity, Teacher Characteristics
Murphy, Richard; Weinhardt, Felix – Centre for Economic Performance, 2013
We find an individual's rank within their reference group has effects on later objective outcomes. To evaluate the impact of local rank, we use a large administrative dataset tracking over two million students in England from primary through to secondary school. Academic rank within primary school has sizable, robust and significant effects on…
Descriptors: Foreign Countries, Class Rank, Progress Monitoring, Effect Size
Stubbe, Tobias C. – Educational Research and Evaluation, 2011
The challenge inherent in cross-national research of providing instruments in different languages measuring the same construct is well known. But even instruments in a single language may be biased towards certain countries or regions due to local linguistic specificities. Consequently, it may be appropriate to use different versions of an…
Descriptors: Test Items, International Studies, Foreign Countries, German
van der Ark, L. Andries; Croon, Marcel A.; Sijtsma, Klaas – Psychometrika, 2008
Scalability coefficients play an important role in Mokken scale analysis. For a set of items, scalability coefficients have been defined for each pair of items, for each individual item, and for the entire scale. Hypothesis testing with respect to these scalability coefficients has not been fully developed. This study introduces marginal modelling…
Descriptors: Hypothesis Testing, Item Response Theory, Error of Measurement, Scaling
Battauz, Michela; Bellio, Ruggero; Gori, Enrico – Psychometrika, 2008
The achievement level is a variable measured with error, that can be estimated by means of the Rasch model. Teacher grades also measure the achievement level but they are expressed on a different scale. This paper proposes a method for combining these two scores to obtain a synthetic measure of the achievement level based on the theory developed…
Descriptors: Academic Achievement, Measurement, Error of Measurement, Computation
Sass, D. A.; Schmitt, T. A.; Walker, C. M. – Applied Measurement in Education, 2008
Item response theory (IRT) procedures have been used extensively to study normal latent trait distributions and have been shown to perform well; however, less is known concerning the performance of IRT with non-normal latent trait distributions. This study investigated the degree of latent trait estimation error under normal and non-normal…
Descriptors: Difficulty Level, Item Response Theory, Test Items, Computation
DeVoe, Jill Fleury; Bauer, Lynn – National Center for Education Statistics, 2010
Student victimization in schools is a major concern of educators, policymakers, administrators, parents, and students. Understanding the scope of the criminal victimization of students, as well as the factors associated with it, is an essential step in developing solutions to address the issues of school crime and violence. This report uses data…
Descriptors: Weapons, Crime, Bullying, Criminals

Peer reviewed
Direct link
