Publication Date
| In 2026 | 0 |
| Since 2025 | 53 |
| Since 2022 (last 5 years) | 411 |
| Since 2017 (last 10 years) | 914 |
| Since 2007 (last 20 years) | 1965 |
Descriptor
Source
Author
Publication Type
Education Level
Audience
| Researchers | 93 |
| Practitioners | 23 |
| Teachers | 22 |
| Policymakers | 10 |
| Administrators | 5 |
| Students | 4 |
| Counselors | 2 |
| Parents | 2 |
| Community | 1 |
Location
| United States | 47 |
| Germany | 42 |
| Australia | 34 |
| Canada | 27 |
| Turkey | 27 |
| California | 22 |
| United Kingdom (England) | 20 |
| Netherlands | 18 |
| China | 17 |
| New York | 15 |
| United Kingdom | 15 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Does not meet standards | 1 |
Peer reviewedCarver, Ronald P. – Journal of Experimental Education, 1993
Four things are recommended to minimize the influence or importance of statistical significance testing. Researchers must not neglect to add "statistical" to significant and could interpret results before giving p-values. Effect sizes should be reported with measures of sampling error, and replication can be built into the design. (SLD)
Descriptors: Educational Researchers, Effect Size, Error of Measurement, Research Methodology
Peer reviewedHanson, Bradley A.; And Others – Applied Psychological Measurement, 1993
The delta method was used to derive standard errors (SES) of the Levine observed score and Levine true score linear test equating methods using data from two test forms. SES derived without the normality assumption and bootstrap SES were very close. The situation with skewed score distributions is also discussed. (SLD)
Descriptors: Equated Scores, Equations (Mathematics), Error of Measurement, Sampling
Peer reviewedCernovsky, Zack Z. – Journal of Black Psychology, 1993
Reviews J. P. Rushton's data in "Race Differences in Behavior: A Review and Evolutionary Analysis" (1988), and suggests that aggregating large cohorts of methodologically weak studies results in misleading conclusions. A reanalysis of Rushton's data shows that cranial size is not a feasible indicator of intelligence and is similar in…
Descriptors: Blacks, Data Analysis, Error of Measurement, Intelligence
Peer reviewedZinn, Sandra; McCumber, Stacey; Dahlstrom, W. Grant – Assessment, 1999
Cross-validated the IMM scale of the Minnesota Multiphasic Personality Inventory-Adolescents (MMPI-A), a measure of ego level, with 151 college students. Means and standard deviations were obtained on IMM scale from the MMPI-A and another MMPI version for males and females. (SLD)
Descriptors: Adolescents, College Students, Error of Measurement, Higher Education
Peer reviewedHarris, Diana K.; And Others – Educational Gerontology, 1996
Multiple-choice and true-false versions of Palmore's first Facts on Aging Quiz were completed by 501 college students. Multiple choice reduced the chances of guessing and had less measurement error for average and above-average respondents. (Author/SK)
Descriptors: Aging (Individuals), College Students, Error of Measurement, Guessing (Tests)
Peer reviewedOgasawara, Haruhiko – Journal of Educational and Behavioral Statistics, 2001
Provides asymptotic standard errors of the estimates of equated scores from several types of item response theory (IRT) true score equatings. Equating designs considered cover those with internal or external common items and separate or simultaneous estimation. Uses marginal maximum likelihood estimation for the estimation of item parameters. (SLD)
Descriptors: Equated Scores, Error of Measurement, Estimation (Mathematics), Item Response Theory
Peer reviewedCamilli, Greg – Education Policy Analysis Archives, 1996
Why the standard error must serve as a standard against which educational gains are measured is discussed from a policy analysis perspective, considering technical and policy levels. An online discussion of the issues in the use of the standard error is transcribed and attached. (SLD)
Descriptors: Achievement Gains, Educational Assessment, Error of Measurement, Online Systems
Peer reviewedNering, Michael L. – Applied Psychological Measurement, 1997
Evaluated the distribution of person fit within the computerized-adaptive testing (CAT) environment through simulation. Found that, within the CAT environment, these indexes tend not to follow a standard normal distribution. Person fit indexes had means and standard deviations that were quite different from the expected. (SLD)
Descriptors: Adaptive Testing, Computer Assisted Testing, Error of Measurement, Item Response Theory
Peer reviewedLindner, James R.; Murphy, Tim H.; Briers, Gary E. – Journal of Agricultural Education, 2001
Content analysis of 364 articles in the Journal of Agricultural Education 1990-1999 indicated that a majority of studies did not mention nonresponse error as a threat to external validity, did not attempt to control for nonresponse error, or did not cite literature on handling it. Protocols to address nonresponse error were proposed. (Contains 24…
Descriptors: Agricultural Education, Citations (References), Error of Measurement, Research Problems
Peer reviewedEid, Michael; Diener, Ed – Social Indicators Research, 2004
Subjective well-being (SWB) is an important indicator of quality of life. SWB can be conceptualized as a momentary state (e.g., mood) as well as a relatively stable trait (e.g., life satisfaction). The validity of self-reported trait aspects of SWB has been questioned by experimental studies showing that SWB judgments seem to be strongly context…
Descriptors: Organizations (Groups), Psychological Patterns, Personality, Measurement
Newton, Paul E. – British Educational Research Journal, 2005
Assessment agencies are increasingly facing pressure on two fronts; first, to increase transparency and openness and second, to improve public confidence. Yet, in relation to one of the central concepts of educational measurement--inherent error--many believe that increased public understanding is incompatible with public confidence: a general…
Descriptors: Measurement Techniques, Inferences, Ethics, Public Opinion
Peer reviewedKennedy, Peter E. – Journal of Economic Education, 2005
Getting a "wrong" sign in empirical work is a common phenomenon. Remarkably, econometrics textbooks provide very little information to practitioners on how this problem can arise. The author exposits a long list of ways in which a wrong sign can occur and how it might be corrected.
Descriptors: Economics, Economic Research, Research Methodology, Economic Impact
Van den Noortgate, Wim; Opdenakker, Marie-Christine; Onghena, Patrick – School Effectiveness and School Improvement, 2005
Ignoring a level can have a substantial impact on the conclusions of a multilevel analysis. For intercept-only models and for balanced data, we derive these effects analytically. For more complex random intercept models or for unbalanced data, a simulation study is performed. Most important effects concern estimates and corresponding standard…
Descriptors: Simulation, Educational Research, Computation, Error of Measurement
Yuan, Ke-Hai; Bentler, Peter M. – Educational and Psychological Measurement, 2004
In mean and covariance structure analysis, the chi-square difference test is often applied to evaluate the number of factors, cross-group constraints, and other nested model comparisons. Let model M[a] be the base model within which model M[b] is nested. In practice, this test is commonly used to justify M[b] even when M[a] is misspecified. The…
Descriptors: Statistical Significance, Item Response Theory, Computation, Statistical Analysis
DeMars, Christine E. – Educational and Psychological Measurement, 2005
Type I error rates for PARSCALE's fit statistic were examined. Data were generated to fit the partial credit or graded response model, with test lengths of 10 or 20 items. The ability distribution was simulated to be either normal or uniform. Type I error rates were inflated for the shorter test length and, for the graded-response model, also for…
Descriptors: Test Length, Item Response Theory, Psychometrics, Error of Measurement

Direct link
