Publication Date
| In 2026 | 0 |
| Since 2025 | 0 |
| Since 2022 (last 5 years) | 0 |
| Since 2017 (last 10 years) | 6 |
| Since 2007 (last 20 years) | 12 |
Descriptor
| Classification | 17 |
| Statistical Analysis | 17 |
| Test Reliability | 17 |
| Test Validity | 7 |
| Foreign Countries | 5 |
| Accuracy | 4 |
| Reading Tests | 4 |
| Computation | 3 |
| Correlation | 3 |
| Evaluation Methods | 3 |
| Grade 3 | 3 |
| More ▼ | |
Source
Author
| Alonzo, Julie | 2 |
| Anderson, Daniel | 2 |
| Menold, Natalja | 2 |
| Bashkov, Bozhidar M. | 1 |
| Berk, Ronald A. | 1 |
| Bernholt, Sascha | 1 |
| Burrows, Vanessa | 1 |
| COX, RICHARD C. | 1 |
| Clauser, Jerome C. | 1 |
| Cramer, Kenneth M. | 1 |
| Demir, Ergul | 1 |
| More ▼ | |
Publication Type
| Journal Articles | 12 |
| Reports - Research | 12 |
| Numerical/Quantitative Data | 3 |
| Reports - Descriptive | 2 |
| Guides - General | 1 |
| Guides - Non-Classroom | 1 |
| Information Analyses | 1 |
| Reports - Evaluative | 1 |
Education Level
| Secondary Education | 5 |
| Grade 6 | 4 |
| Higher Education | 4 |
| Early Childhood Education | 3 |
| Elementary Education | 3 |
| Grade 3 | 3 |
| Grade 7 | 3 |
| Grade 8 | 3 |
| Intermediate Grades | 3 |
| Middle Schools | 3 |
| Postsecondary Education | 3 |
| More ▼ | |
Audience
Location
| Canada | 2 |
| Germany | 2 |
| Pennsylvania (Pittsburgh) | 1 |
| Turkey | 1 |
Laws, Policies, & Programs
Assessments and Surveys
| ACT Assessment | 1 |
| Beck Anxiety Inventory | 1 |
| Center for Epidemiologic… | 1 |
What Works Clearinghouse Rating
Raykov, Tenko; Marcoulides, George A.; Harrison, Michael; Menold, Natalja – Educational and Psychological Measurement, 2019
This note confronts the common use of a single coefficient alpha as an index informing about reliability of a multicomponent measurement instrument in a heterogeneous population. Two or more alpha coefficients could instead be meaningfully associated with a given instrument in finite mixture settings, and this may be increasingly more likely the…
Descriptors: Statistical Analysis, Test Reliability, Measures (Individuals), Computation
Bashkov, Bozhidar M.; Clauser, Jerome C. – Practical Assessment, Research & Evaluation, 2019
Successful testing programs rely on high-quality test items to produce reliable scores and defensible exams. However, determining what statistical screening criteria are most appropriate to support these goals can be daunting. This study describes and demonstrates cost-benefit analysis as an empirical approach to determining appropriate screening…
Descriptors: Test Items, Test Reliability, Evaluation Criteria, Accuracy
Menold, Natalja; Tausch, Anja – Sociological Methods & Research, 2016
Effects of rating scale forms on cross-sectional reliability and measurement equivalence were investigated. A randomized experimental design was implemented, varying category labels and number of categories. The participants were 800 students at two German universities. In contrast to previous research, reliability assessment method was used,…
Descriptors: Rating Scales, Test Reliability, Measurement, Classification
Renshaw, Tyler L. – Journal of Psychoeducational Assessment, 2017
The present study reports on the initial validation of the eight-item version of the Avoidance and Fusion Questionnaire for Youth (AFQ-Y8) as a school mental health screener for identifying clinical-level depression and anxiety caseness within a sample of urban high school students (N = 219). Results indicated that responses to the AFQ-Y8 yielded…
Descriptors: Psychological Characteristics, Screening Tests, Questionnaires, Test Validity
Demir, Ergul – Eurasian Journal of Educational Research, 2018
Purpose: The answer-copying tendency has the potential to detect suspicious answer patterns for prior distributions of statistical detection techniques. The aim of this study is to develop a valid and reliable measurement tool as a scale in order to observe the tendency of university students' copying of answers. Also, it is aimed to provide…
Descriptors: College Students, Cheating, Test Construction, Student Behavior
Alonzo, Julie; Anderson, Daniel – Behavioral Research and Teaching, 2018
In response to a request for additional analyses, in particular reporting confidence intervals around the results, we re-analyzed the data from prior studies. This supplementary report presents the results of the additional analyses addressing classification accuracy, reliability, and criterion-related validity evidence. For ease of reference, we…
Descriptors: Curriculum Based Assessment, Computation, Statistical Analysis, Accuracy
Alonzo, Julie; Anderson, Daniel – Behavioral Research and Teaching, 2018
In response to a request for additional analyses, in particular reporting confidence intervals around the results, we re-analyzed the data from prior studies. This supplementary report presents the results of the additional analyses addressing classification accuracy, reliability, and criterion-related validity evidence. For ease of reference, we…
Descriptors: Curriculum Based Assessment, Computation, Statistical Analysis, Classification
Cramer, Kenneth M.; Page, Stewart; Burrows, Vanessa; Lamoureux, Chastine; Mackay, Sarah; Pedri, Victoria; Pschibul, Rebecca – Collected Essays on Learning and Teaching, 2016
Based on analyses of Maclean's ranking data pertaining to Canadian universities published over the last 24 years, we present a summary of statistical findings of annual ranking exercises, as well as discussion about their current status and the effects upon student welfare. Some illustrative tables are also presented. Using correlational and…
Descriptors: Foreign Countries, Universities, Classification, Institutional Advancement
Martínez, José Felipe; Schweig, Jonathan; Goldschmidt, Pete – Educational Evaluation and Policy Analysis, 2016
A key question facing teacher evaluation systems is how to combine multiple measures of complex constructs into composite indicators of performance. We use data from the Measures of Effective Teaching (MET) study to investigate the measurement properties of composite indicators obtained under various conjunctive, disjunctive (or complementary),…
Descriptors: Teacher Evaluation, Outcome Measures, Evaluation Methods, Educational Policy
Powers, Sonya; Li, Dongmei; Suh, Hongwook; Harris, Deborah J. – ACT, Inc., 2016
ACT reporting categories and ACT Readiness Ranges are new features added to the ACT score reports starting in fall 2016. For each reporting category, the number correct score, the maximum points possible, the percent correct, and the ACT Readiness Range, along with an indicator of whether the reporting category score falls within the Readiness…
Descriptors: Scores, Classification, College Entrance Examinations, Error of Measurement
Kwiatkowska-White, Bozena; Kirby, John R.; Lee, Elizabeth A. – Journal of Psychoeducational Assessment, 2016
This longitudinal study of 78 Canadian English-speaking students examined the applicability of the stability, cumulative, and compensatory models in reading comprehension development. Archival government-mandated assessments of reading comprehension at Grades 3, 6, and 10, and the Canadian Test of Basic Skills measure of reading comprehension…
Descriptors: Longitudinal Studies, Reading Comprehension, Reading Achievement, Models
Bernholt, Sascha; Parchmann, Ilka – Chemistry Education Research and Practice, 2011
Current reforms in the education policy of various countries are intended to produce a paradigm shift in the educational system towards an outcome orientation. After implementing educational standards as normative objectives, the development of test procedures that adequately reflect these targets and standards is a central problem. This paper…
Descriptors: Science Achievement, Chemistry, Knowledge Level, Science Instruction
Enger, John M.; Whitney, Douglas R. – 1975
There are few existing or widely known measures of agreement applicable when data is nominal or categorical. Most such coefficients are applicable only when judges classify objects or subjects into a single category. A wider range of applications, including those where judges (1) place probabilities on subjects belonging to mutually exclusive and…
Descriptors: Analysis of Variance, Classification, Measurement Techniques, Models
Peer reviewedBerk, Ronald A. – Journal of Educational Measurement, 1980
A dozen different approaches that yield 13 reliability indices for criterion-referenced tests were identified and grouped into three categories: threshold loss function, squared-error loss function, and domain score estimation. Indices were evaluated within each category. (Author/RL)
Descriptors: Classification, Criterion Referenced Tests, Cutting Scores, Evaluation Methods
COX, RICHARD C. – 1965
THE VALIDITY OF AN EDUCATIONAL ACHIEVEMENT TEST DEPENDS UPON THE CORRESPONDENCE BETWEEN SPECIFIED EDUCATIONAL OBJECTIVES AND THE EXTENT TO WHICH THESE OBJECTIVES ARE MEASURED BY THE EVALUATION INSTRUMENT. THIS STUDY IS DESIGNED TO EVALUATE THE EFFECT OF STATISTICAL ITEM SELECTION ON THE STRUCTURE OF THE FINAL EVALUATION INSTRUMENT AS COMPARED WITH…
Descriptors: Achievement Tests, Classification, Educational Objectives, Item Analysis
Previous Page | Next Page »
Pages: 1 | 2
Direct link
