ERIC - Search Results

Publication Date

In 2026	0
Since 2025	0
Since 2022 (last 5 years)	0
Since 2017 (last 10 years)	1
Since 2007 (last 20 years)	5

Descriptor

Mathematical Models	92
Statistical Analysis	92
Test Reliability	59
Reliability	33
Error of Measurement	26
Measurement Techniques	19
Analysis of Variance	18
Item Analysis	18
Correlation	17
True Scores	17
Comparative Analysis	16
Sampling	16
Test Construction	16
Test Validity	16
Test Items	14
Factor Analysis	13
Criterion Referenced Tests	12
Goodness of Fit	12
Test Interpretation	12
Equated Scores	11
Latent Trait Theory	10
Research Design	10
Research Methodology	10
Scores	10
Evaluation Methods	9
More ▼

Source

Educational and Psychological…	11
Psychometrika	6
Journal of Educational…	5
Applied Psychological…	2
Journal of Educational…	2
Creativity Research Journal	1
Journal of Educational…	1
Journal of Educational…	1
Research Papers in Education	1

Publication Type

Reports - Research	52
Journal Articles	14
Speeches/Meeting Papers	12
Reports - Evaluative	5
Numerical/Quantitative Data	3
Guides - Non-Classroom	2
Reference Materials -…	2
Reports - Descriptive	2
Guides - General	1
Opinion Papers	1
Reports - General	1
Tests/Questionnaires	1
More ▼

Education Level

Higher Education	2
Postsecondary Education	2
Secondary Education	1

Audience

Researchers

Location

Australia	1
California	1
South Carolina	1
Taiwan (Taipei)	1
United Kingdom (England)	1

Laws, Policies, & Programs

Elementary and Secondary…

Assessments and Surveys

Comprehensive Tests of Basic…	2
Armed Services Vocational…	1
SAT (College Admission Test)	1
Stanford Binet Intelligence…	1

What Works Clearinghouse Rating

Showing 1 to 15 of 92 results Save | Export

A Measurement Is a Choice and Stevens' Scales of Measurement Do Not Help Make It: A Response to Chalmers

Peer reviewed

Direct link

Zumbo, Bruno D.; Kroc, Edward – Educational and Psychological Measurement, 2019

Chalmers recently published a critique of the use of ordinal a[alpha] proposed in Zumbo et al. as a measure of test reliability in certain research settings. In this response, we take up the task of refuting Chalmers' critique. We identify three broad misconceptions that characterize Chalmers' criticisms: (1) confusing assumptions with…

Descriptors: Test Reliability, Statistical Analysis, Misconceptions, Mathematical Models

The Reliability of Setting Grade Boundaries Using Comparative Judgement

Peer reviewed

Direct link

Benton, Tom; Elliott, Gill – Research Papers in Education, 2016

In recent years the use of expert judgement to set and maintain examination standards has been increasingly criticised in favour of approaches based on statistical modelling. This paper reviews existing research on this controversy and attempts to unify the evidence within a framework where expertise is utilised in the form of comparative…

Descriptors: Reliability, Expertise, Mathematical Models, Standard Setting (Scoring)

AMOVA ["Accumulative Manifold Validation Analysis"]: An Advanced Statistical Methodology Designed to Measure and Test the Validity, Reliability, and Overall Efficacy of Inquiry-Based Psychometric Instruments

Peer reviewed
PDF on ERIC

Download full text

Osler, James Edward, II – Journal of Educational Technology, 2015

This monograph provides an epistemological rational for the Accumulative Manifold Validation Analysis [also referred by the acronym "AMOVA"] statistical methodology designed to test psychometric instruments. This form of inquiry is a form of mathematical optimization in the discipline of linear stochastic modelling. AMOVA is an in-depth…

Descriptors: Statistical Analysis, Test Validity, Test Reliability, Inquiry

Improving Creativity Performance Assessment: A Rater Effect Examination with Many Facet Rasch Model

Peer reviewed

Direct link

Hung, Su-Pin; Chen, Po-Hsi; Chen, Hsueh-Chih – Creativity Research Journal, 2012

Product assessment is widely applied in creative studies, typically as an important dependent measure. Within this context, this study had 2 purposes. First, the focus of this research was on methods for investigating possible rater effects, an issue that has not received a great deal of attention in past creativity studies. Second, the…

Descriptors: Item Response Theory, Creativity, Interrater Reliability, Undergraduate Students

Technology Engineering Online Learner Metrics to Analyze Instructional Efficacy

Peer reviewed
PDF on ERIC

Download full text

Osler, James Edward; Mansaray, Mahmud A. – Journal of Educational Technology, 2013

The online deployment of Technology Engineered online Student Ratings of Instruction (SRIs) by colleges and universities in the United States has dynamically changed the deployment of course evaluation. This research investigation is the fourth part of a post hoc study that analytically and psychometrically examines the design, reliability, and…

Descriptors: Course Evaluation, Educational Technology, Black Colleges, Higher Education

The Stability of Results: Some Examples of the Effects of Scale Transformations. Didakometry, No. 42, 1974.

PDF pending restoration

Larsson, Bernt – 1974

This report gives some simple examples of stability for one factor and 2 x 2 factorial analysis of variance, reliability and correlations. The findings are very different: from superstability (no transformation whatsoever can change the result) to almost total instability. This is followed by a discussion of applications to multivariate analysis,…

Descriptors: Analysis of Variance, Correlation, Discriminant Analysis, Factor Analysis

Agreement Coefficients as Indices of Dependability for Domain-Referenced Tests. ACT Technical Bulletin No. 28.

Download full text

Kane, Michael T.; Brennan, Robert L. – 1977

A large number of seemingly diverse coefficients have been proposed as indices of dependability, or reliability, for domain-referenced and/or mastery tests. In this paper, it is shown that most of these indices are special cases of two generalized indices of agreement: one that is corrected for chance, and one that is not. The special cases of…

Descriptors: Bayesian Statistics, Correlation, Criterion Referenced Tests, Cutting Scores

The Kappamax Reliability Index for Decisions in Domain-Referenced Testing.

Download full text

Huynh, Huynh – 1977

The kappamax reliability index of domain-referenced tests is defined as the upper bound of kappa when all possibile cutoff scores are considered. Computational procedures for kappamax are described, as well as its approximation for long tests, based on Kuder-Richardson formula 21. The sampling error of kappamax, and the effects of test length and…

Descriptors: Criterion Referenced Tests, Mathematical Models, Statistical Analysis, Test Reliability

Test Length and the Standard Error of Measurement

Peer reviewed

Gardner, P. L. – Journal of Educational Measurement, 1970

Descriptors: Error of Measurement, Mathematical Models, Statistical Analysis, Test Reliability

Planned and Post Hoc Comparisons in Tests of Concordance and Discordance for G Groups of Judges.

Peer reviewed

Serlin, Ronald C.; Marascuilo, Leonard A. – Journal of Educational Statistics, 1983

Two alternatives to the problems of conducting planned and post hoc comparisons in tests of concordance and discordance for G groups of judges are examined. The two models are illustrated using existing data. (Author/JKS)

Descriptors: Attitude Measures, Comparative Analysis, Interrater Reliability, Mathematical Models

Interjudge Agreement and the Maximum Value of Kappa.

Peer reviewed

Umesh, U. N.; And Others – Educational and Psychological Measurement, 1989

An approach is provided for calculating maximum values of the Kappa statistic of J. Cohen (1960) as a function of observed agreement proportions between evaluators. Separate calculations are required for different matrix sizes and observed agreement levels. (SLD)

Descriptors: Equations (Mathematics), Evaluators, Heuristics, Interrater Reliability

A Note on Huynh's Normal Approximation Procedure for Estimating Criterion-Referenced Reliability.

Peer reviewed

Peng, Chao-Ying, J.; Subkoviak, Michael J. – Journal of Educational Measurement, 1980

Huynh (1976) suggested a method of approximating the reliability coefficient of a mastery test. The present study examines the accuracy of Huynh's approximation and also describes a computationally simpler approximation which appears to be generally more accurate than the former. (Author/RL)

Descriptors: Error of Measurement, Mastery Tests, Mathematical Models, Statistical Analysis

Estimating Reliability from a Single Administration of a Mastery Test.

Download full text

Subkoviak, Michael J. – 1976

A number of different definitions and indices of reliability for mastery tests have recently been proposed in an attempt to cope with possible lack of score variability that attenuates traditional coefficients. One promising index that has been suggested is the proportion of students in a group that are consistently assigned to the same mastery…

Descriptors: Criterion Referenced Tests, Mastery Tests, Mathematical Models, Scores

The Use of Reliability Coefficient to Increase Accuracy of the Calculation of n in Power Formulas.

Download full text

Ary, Donald; Karabinus, Robert – 1975

The power of a statistical test is, in part, a function of the reliability of the dependable variable being analyzed. The substitution of sigma square divided by the reliability coefficient for sigma is proposed. This enables the researcher to incorporate dependent variable reliability information when determining the sample size required for a…

Descriptors: Hypothesis Testing, Mathematical Models, Measurement Techniques, Reliability

Estimating Treatment Effects and Precision for Quasi-Experiments Assuming Differential Group and Individual Growth Patterns.

Download full text

Olejnik, Stephen F.; Porter, Andrew C. – 1978

The statistical properties of two methods of estimating gain scores for groups in quasi-experiments are compared: (1) gains in scores standardized separately for each group; and (2) analysis of covariance with estimated true pretest scores. The fan spread hypothesis is assumed for groups but not necessarily assumed for members of the groups.…

Descriptors: Academic Achievement, Achievement Gains, Analysis of Covariance, Analysis of Variance

Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4 | 5 | 6 | 7

Brennan, Robert L.	3
Huynh, Huynh	3
Subkoviak, Michael J.	3
Bashaw, W. L.	2
Benson, Jeri	2
Cliff, Norman	2
Feldt, Leonard S.	2
Linn, Robert L.	2
Lord, Frederic M.	2
Rentz, R. Robert	2
Serlin, Ronald C.	2
Tucker, Ledyard R.	2
Werts, C. E.	2
Werts, Charles E.	2
Algina, James	1
Ary, Donald	1
Askren, William B.	1
Benton, Tom	1
Besel, Ronald	1
Bradshaw, Stephen C.	1
Budescu, David	1
Byrne, Barbara M.	1
Cahan, Sorel	1
Ceurvorst, Robert W.	1
More ▼