ERIC - Search Results

Publication Date

In 2025	0
Since 2024	1
Since 2021 (last 5 years)	1
Since 2016 (last 10 years)	1
Since 2006 (last 20 years)	3

Descriptor

Error of Measurement	14
Measurement Techniques	14
Statistical Studies	14
Research Methodology	5
Educational Research	4
Evaluation Methods	4
Research Problems	4
Sample Size	4
Analysis of Covariance	3
Correlation	3
Mathematical Models	3
Sampling	3
Statistical Distributions	3
Hypothesis Testing	2
Interrater Reliability	2
Item Response Theory	2
Models	2
Monte Carlo Methods	2
Multivariate Analysis	2
Probability	2
Research Design	2
Scores	2
Statistical Analysis	2
Statistical Bias	2
Statistical Data	2
More ▼

Source

British Educational Research…	1
Educational and Psychological…	1
Journal of Educational…	1
Journal of Educational and…	1
National Center for Research…	1
Perceptual and Motor Skills	1

Publication Type

Reports - Research	8
Speeches/Meeting Papers	7
Journal Articles	5
Reports - Evaluative	4
Reports - Descriptive	2

Education Level

Elementary Secondary Education	1
Higher Education	1

Audience

Researchers

Location

United Kingdom (England)

Laws, Policies, & Programs

Assessments and Surveys

Program for International…

What Works Clearinghouse Rating

Showing all 14 results Save | Export

Alternatives to Weighted Item Fit Statistics for Establishing Measurement Invariance in Many Groups

Peer reviewed

Direct link

Sean Joo; Montserrat Valdivia; Dubravka Svetina Valdivia; Leslie Rutkowski – Journal of Educational and Behavioral Statistics, 2024

Evaluating scale comparability in international large-scale assessments depends on measurement invariance (MI). The root mean square deviation (RMSD) is a standard method for establishing MI in several programs, such as the Programme for International Student Assessment and the Programme for the International Assessment of Adult Competencies.…

Descriptors: International Assessment, Monte Carlo Methods, Statistical Studies, Error of Measurement

A New Statistic for Evaluating Item Response Theory Models for Ordinal Data. CRESST Report 839

Download full text

Cai, Li; Monroe, Scott – National Center for Research on Evaluation, Standards, and Student Testing (CRESST), 2014

We propose a new limited-information goodness of fit test statistic C[subscript 2] for ordinal IRT models. The construction of the new statistic lies formally between the M[subscript 2] statistic of Maydeu-Olivares and Joe (2006), which utilizes first and second order marginal probabilities, and the M*[subscript 2] statistic of Cai and Hansen…

Descriptors: Item Response Theory, Models, Goodness of Fit, Probability

Serious Doubts about School Effectiveness

Peer reviewed

Direct link

Gorard, Stephen – British Educational Research Journal, 2010

This paper considers the model of school effectiveness (SE) currently dominant in research, policy and practice in England (although the concerns it raises are international). It shows, principally through consideration of initial and propagated error, that SE results cannot be relied upon. By considering the residual difference between the…

Descriptors: School Effectiveness, Foreign Countries, Scores, Educational Policy

Another Look at Inter-Rater Agreement. Research Report.

Download full text

Zwick, Rebecca – 1986

Most currently used measures of inter-rater agreement for the nominal case incorporate a correction for "chance agreement." The definition of chance agreement is not the same for all coefficients, however. Three chance-corrected coefficients are Cohen's Kappa; Scott's Pi; and the S index of Bennett, Goldstein, and Alpert, which has…

Descriptors: Error of Measurement, Interrater Reliability, Mathematical Models, Measurement Techniques

Preliminary Assessment of Format-Specific Central Tendency and Leniency Error in Summated Rating Scales.

Peer reviewed

Bardo, J.W.; And Others – Perceptual and Motor Skills, 1982

Data for four-, five-, and seven-position Likert formats from 292 undergraduates showed systematic error varied among formats, i.e., central tendency errors tended to increase with increasing number of categories and to reduce variances expected. (Author)

Descriptors: Error of Measurement, Higher Education, Measurement Techniques, Rating Scales

Error Probabilities in Educational and Psychological Research.

Peer reviewed

Westermann, Rainer; Hager, Willi – Journal of Educational Statistics, 1986

The well-known problem of cumulating error probabilities is reconsidered from a general epistemological perspective, namely, the concepts of severity and of fairness of tests. It is shown that not only Type 1 but also Type 2 errors can cumulate. A new adjustment strategy is proposed and applied. (Author/JAZ)

Descriptors: Educational Research, Error of Measurement, Hypothesis Testing, Measurement Techniques

Measurement of Change and the Law of Initial Values: A Computer Simulation Study.

Peer reviewed

Jamieson, John – Educational and Psychological Measurement, 1995

Computer simulations indicate that the correlation between baseline and change, by itself, does not invalidate the use of gain scores to measure change, but when the negative correlation is accompanied by decrease in variance from pretest to posttest, covariance is a superior measure of change. (SLD)

Descriptors: Analysis of Covariance, Change, Computer Simulation, Correlation

Tests of Variance Equality When Distributions Differ in Form, Scale and Location.

Download full text

Olejnik, Stephen F.; Algina, James – 1986

Sampling distributions for ten tests for comparing population variances in a two group design were generated for several combinations of equal and unequal sample sizes, population means, and group variances when distributional forms differed. The ten procedures included: (1) O'Brien's (OB); (2) O'Brien's with adjusted degrees of freedom; (3)…

Descriptors: Error of Measurement, Evaluation Methods, Measurement Techniques, Nonparametric Statistics

Some Cautionary Notes on the Specification and Interpretation of LISREL-type Structural Equation Models.

Download full text

Baldwin, Beatrice – 1986

LISREL-type structural equation modeling is a powerful statistical technique that seems appropriate for social science variables which are complex and difficult to measure. The literature on the specification, estimation, and testing of such models is voluminous. The greatest proportion of this literature, however, focuses on the technical aspects…

Descriptors: Analysis of Covariance, Computer Software, Equations (Mathematics), Error of Measurement

Why Multivariate Methods Are Usually Vital in Research: Some Basic Concepts.

Download full text

Thompson, Bruce – 1994

The present paper suggests that multivariate methods ought to be used more frequently in behavioral research and explores the potential consequences of failing to use multivariate methods when these methods are appropriate. The paper explores in detail two reasons why multivariate methods are usually vital. The first is that they limit the…

Descriptors: Analysis of Covariance, Behavioral Science Research, Causal Models, Correlation

Sampling Procedures Used for National Surveys of Public School Teachers--Problems and Possible Solutions.

Download full text

Pena, Deagelia M.; Henderson, Ronald D. – 1986

The sampling of teachers for nationwide surveys offers a challenging endeavor in obtaining a representative and adequate sample to truly represent opinions of the teachers. Ten national surveys of public school teachers conducted between 1980 and 1985 are presented with respect to their sampling design and procedures. Concepts and theoretical…

Descriptors: Adults, Error of Measurement, Longitudinal Studies, Measurement Techniques

Download full text

Cook, Linda L.; Petersen, Nancy S. – 1986

This paper examines how various equating methods are affected by: (1) sampling error; (2) sample characteristics; and (3) characteristics of anchor test items. It reviews empirical studies that investigated the invariance of equating transformations, and it discusses empirical and simulation studies that focus on how the properties of anchor tests…

Descriptors: Educational Research, Equated Scores, Error of Measurement, Evaluation Methods

An Empirical Comparison of Size and Power of Seven Methods for Analyzing Multivariate Data in the Two-Sample Case.

Download full text

Hummel, Thomas J.; Johnston, Charles B. – 1986

This study investigated seven methods for analyzing multivariate group differences. Bonferroni t statistics, multivariate analysis of variance (MANOVA) followed by analysis of variance (ANOVA), and five other methods were studied using Monte Carlo methods. Methods were compared with respect to (1) experimentwise error rate; (2) power; (3) number…

Descriptors: Analysis of Variance, Comparative Analysis, Correlation, Differences

The Use and Effect of Caution Indices in Detecting Aberrant Patterns of Standard-Setting Recommendations.

Jaeger, Richard M.; Busch, John Christian – 1986

This study explores the use of the modified caution index (MCI) for identifying judges whose patterns of recommendations suggest that their judgments might be based on incomplete information, flawed reasoning, or inattention to their standard-setting tasks. It also examines the effect on test standards and passing rates when the test standards of…

Descriptors: Criterion Referenced Tests, Error of Measurement, Evaluation Methods, High Schools

Algina, James	1
Baldwin, Beatrice	1
Bardo, J.W.	1
Busch, John Christian	1
Cai, Li	1
Cook, Linda L.	1
Dubravka Svetina Valdivia	1
Gorard, Stephen	1
Hager, Willi	1
Henderson, Ronald D.	1
Hummel, Thomas J.	1
Jaeger, Richard M.	1
Jamieson, John	1
Johnston, Charles B.	1
Leslie Rutkowski	1
Monroe, Scott	1
Montserrat Valdivia	1
Olejnik, Stephen F.	1
Pena, Deagelia M.	1
Petersen, Nancy S.	1
Sean Joo	1
Thompson, Bruce	1
Westermann, Rainer	1
Zwick, Rebecca	1
More ▼