ERIC - Search Results

Publication Date

In 2026	0
Since 2025	0
Since 2022 (last 5 years)	0
Since 2017 (last 10 years)	6
Since 2007 (last 20 years)	12

Descriptor

Classification	17
Statistical Analysis	17
Test Reliability	17
Test Validity	7
Foreign Countries	5
Accuracy	4
Reading Tests	4
Computation	3
Correlation	3
Evaluation Methods	3
Grade 3	3
Grade 6	3
Models	3
Test Construction	3
Test Items	3
Test Results	3
College Students	2
Curriculum Based Assessment	2
Cutting Scores	2
Error of Measurement	2
Ethics	2
Grade 2	2
Grade 4	2
Grade 5	2
Grade 7	2
More ▼

Source

Behavioral Research and…	2
Journal of Psychoeducational…	2
ACT, Inc.	1
Applied Psychological…	1
Chemistry Education Research…	1
Collected Essays on Learning…	1
Crime & Delinquency	1
Educational Evaluation and…	1
Educational and Psychological…	1
Eurasian Journal of…	1
Journal of Educational…	1
Practical Assessment,…	1
Sociological Methods &…	1
More ▼

Publication Type

Journal Articles	12
Reports - Research	12
Numerical/Quantitative Data	3
Reports - Descriptive	2
Guides - General	1
Guides - Non-Classroom	1
Information Analyses	1
Reports - Evaluative	1

Education Level

Secondary Education	5
Grade 6	4
Higher Education	4
Early Childhood Education	3
Elementary Education	3
Grade 3	3
Grade 7	3
Grade 8	3
Intermediate Grades	3
Middle Schools	3
Postsecondary Education	3
Primary Education	3
Grade 10	2
Grade 2	2
Grade 4	2
Grade 5	2
High Schools	2
Junior High Schools	2
Elementary Secondary Education	1
Grade 1	1
Grade 11	1
Grade 12	1
Grade 9	1
More ▼

Audience

Location

Canada	2
Germany	2
Pennsylvania (Pittsburgh)	1
Turkey	1

Laws, Policies, & Programs

Assessments and Surveys

ACT Assessment	1
Beck Anxiety Inventory	1
Center for Epidemiologic…	1

What Works Clearinghouse Rating

Showing 1 to 15 of 17 results Save | Export

Multiple-Component Measurement Instruments in Heterogeneous Populations: Is There a Single Coefficient Alpha?

Peer reviewed

Direct link

Raykov, Tenko; Marcoulides, George A.; Harrison, Michael; Menold, Natalja – Educational and Psychological Measurement, 2019

This note confronts the common use of a single coefficient alpha as an index informing about reliability of a multicomponent measurement instrument in a heterogeneous population. Two or more alpha coefficients could instead be meaningfully associated with a given instrument in finite mixture settings, and this may be increasingly more likely the…

Descriptors: Statistical Analysis, Test Reliability, Measures (Individuals), Computation

Determining Item Screening Criteria Using Cost-Benefit Analysis

Peer reviewed
PDF on ERIC

Download full text

Bashkov, Bozhidar M.; Clauser, Jerome C. – Practical Assessment, Research & Evaluation, 2019

Successful testing programs rely on high-quality test items to produce reliable scores and defensible exams. However, determining what statistical screening criteria are most appropriate to support these goals can be daunting. This study describes and demonstrates cost-benefit analysis as an empirical approach to determining appropriate screening…

Descriptors: Test Items, Test Reliability, Evaluation Criteria, Accuracy

Measurement of Latent Variables with Different Rating Scales: Testing Reliability and Measurement Equivalence by Varying the Verbalization and Number of Categories

Peer reviewed

Direct link

Menold, Natalja; Tausch, Anja – Sociological Methods & Research, 2016

Effects of rating scale forms on cross-sectional reliability and measurement equivalence were investigated. A randomized experimental design was implemented, varying category labels and number of categories. The participants were 800 students at two German universities. In contrast to previous research, reliability assessment method was used,…

Descriptors: Rating Scales, Test Reliability, Measurement, Classification

Screening for Psychological Inflexibility: Initial Validation of the Avoidance and Fusion Questionnaire for Youth as a School Mental Health Screener

Peer reviewed

Direct link

Renshaw, Tyler L. – Journal of Psychoeducational Assessment, 2017

The present study reports on the initial validation of the eight-item version of the Avoidance and Fusion Questionnaire for Youth (AFQ-Y8) as a school mental health screener for identifying clinical-level depression and anxiety caseness within a sample of urban high school students (N = 219). Results indicated that responses to the AFQ-Y8 yielded…

Descriptors: Psychological Characteristics, Screening Tests, Questionnaires, Test Validity

As a Potential Source of Error, Measuring the Tendency of University Students to Copy the Answers: A Scale Development Study

Peer reviewed
PDF on ERIC

Download full text

Demir, Ergul – Eurasian Journal of Educational Research, 2018

Purpose: The answer-copying tendency has the potential to detect suspicious answer patterns for prior distributions of statistical detection techniques. The aim of this study is to develop a valid and reliable measurement tool as a scale in order to observe the tendency of university students' copying of answers. Also, it is aimed to provide…

Descriptors: College Students, Cheating, Test Construction, Student Behavior

Supplementary Report on easyCBM PRF Measures: A Follow-Up to Previous Technical Reports. Technical Report #1806

Download full text

Alonzo, Julie; Anderson, Daniel – Behavioral Research and Teaching, 2018

In response to a request for additional analyses, in particular reporting confidence intervals around the results, we re-analyzed the data from prior studies. This supplementary report presents the results of the additional analyses addressing classification accuracy, reliability, and criterion-related validity evidence. For ease of reference, we…

Descriptors: Curriculum Based Assessment, Computation, Statistical Analysis, Accuracy

Supplementary Report on easyCBM MCRC Measures: A Follow-Up to Previous Technical Reports. Technical Report #1807

Download full text

Alonzo, Julie; Anderson, Daniel – Behavioral Research and Teaching, 2018

Descriptors: Curriculum Based Assessment, Computation, Statistical Analysis, Classification

The Marketing of Canadian University Rankings: A Misadventure Now 24 Years Old

Peer reviewed
PDF on ERIC

Download full text

Cramer, Kenneth M.; Page, Stewart; Burrows, Vanessa; Lamoureux, Chastine; Mackay, Sarah; Pedri, Victoria; Pschibul, Rebecca – Collected Essays on Learning and Teaching, 2016

Based on analyses of Maclean's ranking data pertaining to Canadian universities published over the last 24 years, we present a summary of statistical findings of annual ranking exercises, as well as discussion about their current status and the effects upon student welfare. Some illustrative tables are also presented. Using correlational and…

Descriptors: Foreign Countries, Universities, Classification, Institutional Advancement

Approaches for Combining Multiple Measures of Teacher Performance: Reliability, Validity, and Implications for Evaluation Policy

Peer reviewed

Direct link

Martínez, José Felipe; Schweig, Jonathan; Goldschmidt, Pete – Educational Evaluation and Policy Analysis, 2016

A key question facing teacher evaluation systems is how to combine multiple measures of complex constructs into composite indicators of performance. We use data from the Measures of Effective Teaching (MET) study to investigate the measurement properties of composite indicators obtained under various conjunctive, disjunctive (or complementary),…

Descriptors: Teacher Evaluation, Outcome Measures, Evaluation Methods, Educational Policy

ACT Reporting Category Interpretation Guide: Version 1.0. ACT Working Paper 2016 (05)

Download full text

Powers, Sonya; Li, Dongmei; Suh, Hongwook; Harris, Deborah J. – ACT, Inc., 2016

ACT reporting categories and ACT Readiness Ranges are new features added to the ACT score reports starting in fall 2016. For each reporting category, the number correct score, the maximum points possible, the percent correct, and the ACT Readiness Range, along with an indicator of whether the reporting category score falls within the Readiness…

Descriptors: Scores, Classification, College Entrance Examinations, Error of Measurement

A Longitudinal Study of Reading Comprehension Achievement from Grades 3 to 10: Investigating Models of Stability, Cumulative Growth, and Compensation

Peer reviewed

Direct link

Kwiatkowska-White, Bozena; Kirby, John R.; Lee, Elizabeth A. – Journal of Psychoeducational Assessment, 2016

This longitudinal study of 78 Canadian English-speaking students examined the applicability of the stability, cumulative, and compensatory models in reading comprehension development. Archival government-mandated assessments of reading comprehension at Grades 3, 6, and 10, and the Canadian Test of Basic Skills measure of reading comprehension…

Descriptors: Longitudinal Studies, Reading Comprehension, Reading Achievement, Models

Assessing the Complexity of Students' Knowledge in Chemistry

Peer reviewed

Direct link

Bernholt, Sascha; Parchmann, Ilka – Chemistry Education Research and Practice, 2011

Current reforms in the education policy of various countries are intended to produce a paradigm shift in the educational system towards an outcome orientation. After implementing educational standards as normative objectives, the development of test procedures that adequately reflect these targets and standards is a central problem. This paper…

Descriptors: Science Achievement, Chemistry, Knowledge Level, Science Instruction

A Generalized Anova Model for Estimating the Reliability of Categorical Judgments.

Download full text

Enger, John M.; Whitney, Douglas R. – 1975

There are few existing or widely known measures of agreement applicable when data is nominal or categorical. Most such coefficients are applicable only when judges classify objects or subjects into a single category. A wider range of applications, including those where judges (1) place probabilities on subjects belonging to mutually exclusive and…

Descriptors: Analysis of Variance, Classification, Measurement Techniques, Models

A Consumers' Guide to Criterion-Referenced Test Reliability. Reliability.

Peer reviewed

Berk, Ronald A. – Journal of Educational Measurement, 1980

A dozen different approaches that yield 13 reliability indices for criterion-referenced tests were identified and grouped into three categories: threshold loss function, squared-error loss function, and domain score estimation. Indices were evaluated within each category. (Author/RL)

Descriptors: Classification, Criterion Referenced Tests, Cutting Scores, Evaluation Methods

ITEM SELECTION TECHNIQUES AND EVALUATION OF INSTRUCTIONAL OBJECTIVES.

COX, RICHARD C. – 1965

THE VALIDITY OF AN EDUCATIONAL ACHIEVEMENT TEST DEPENDS UPON THE CORRESPONDENCE BETWEEN SPECIFIED EDUCATIONAL OBJECTIVES AND THE EXTENT TO WHICH THESE OBJECTIVES ARE MEASURED BY THE EVALUATION INSTRUMENT. THIS STUDY IS DESIGNED TO EVALUATE THE EFFECT OF STATISTICAL ITEM SELECTION ON THE STRUCTURE OF THE FINAL EVALUATION INSTRUMENT AS COMPARED WITH…

Descriptors: Achievement Tests, Classification, Educational Objectives, Item Analysis

Previous Page | Next Page »

Pages: 1 | 2

Alonzo, Julie	2
Anderson, Daniel	2
Menold, Natalja	2
Bashkov, Bozhidar M.	1
Berk, Ronald A.	1
Bernholt, Sascha	1
Burrows, Vanessa	1
COX, RICHARD C.	1
Clauser, Jerome C.	1
Cramer, Kenneth M.	1
Demir, Ergul	1
Enger, John M.	1
Goldschmidt, Pete	1
Gottfredson, Stephen D.	1
Harris, Deborah J.	1
Harrison, Michael	1
Kirby, John R.	1
Kwiatkowska-White, Bozena	1
Lamoureux, Chastine	1
Lee, Elizabeth A.	1
Li, Dongmei	1
Mackay, Sarah	1
Marcoulides, George A.	1
Martínez, José Felipe	1
Mellenbergh, Gideon J.	1
More ▼