ERIC - Search Results

Publication Date

In 2026	0
Since 2025	1
Since 2022 (last 5 years)	11
Since 2017 (last 10 years)	45
Since 2007 (last 20 years)	123

Descriptor

Test Reliability	152
Test Validity	95
Reliability	84
Test Construction	68
Scoring	48
Tables (Data)	45
Scores	40
Academic Achievement	38
Reading Tests	38
Elementary School Students	35
Elementary Secondary Education	35
Mathematics Tests	34
Statistical Analysis	34
Test Items	34
Interrater Reliability	31
Psychometrics	30
Testing Programs	30
Validity	30
Higher Education	28
Item Response Theory	28
Achievement Tests	27
Grade 5	27
Grade 8	27
Language Arts	27
Student Evaluation	27
More ▼

Publication Type

Numerical/Quantitative Data	252
Reports - Research	127
Reports - Evaluative	68
Reports - Descriptive	41
Tests/Questionnaires	27
Speeches/Meeting Papers	22
Journal Articles	17
Guides - Non-Classroom	10
Collected Works - General	4
Guides - General	3
Books	1
Guides - Classroom - Learner	1
Reference Materials -…	1
More ▼

Education Level

Elementary Education	62
Secondary Education	41
Elementary Secondary Education	40
Middle Schools	38
Early Childhood Education	34
Junior High Schools	34
Primary Education	28
Grade 5	27
Grade 3	23
Grade 7	22
Grade 8	22
Grade 4	21
Grade 6	21
Higher Education	20
Intermediate Grades	20
Kindergarten	17
High Schools	13
Postsecondary Education	13
Grade 1	11
Grade 2	10
Grade 9	6
Grade 10	4
Grade 11	4
Preschool Education	4
Grade 12	3
More ▼

Audience

Practitioners	6
Researchers	5
Administrators	3
Teachers	2
Parents	1
Students	1

Location

Florida	10
New York	8
Illinois	7
Nebraska	7
United States	7
California	6
Maryland	5
Massachusetts	5
Pennsylvania	5
North Carolina	4
Texas	4
Arizona	3
Australia	3
New Mexico	3
Oregon	3
Tennessee	3
Washington	3
District of Columbia	2
Georgia	2
Kentucky	2
Louisiana	2
Ohio	2
South Carolina	2
Virginia	2
Alaska	1
More ▼

Laws, Policies, & Programs

American Recovery and…	6
Race to the Top	6
Individuals with Disabilities…	1
No Child Left Behind Act 2001	1

What Works Clearinghouse Rating

Meets WWC Standards without Reservations	1
Meets WWC Standards with or without Reservations	1

Numerical/Quantitative Data X

Showing 106 to 120 of 252 results Save | Export

Reliability of the Test of Spoken English Revisited. Research Reports, Report 40.

Download full text

Boldt, R. F. – 1992

The Test of Spoken English (TSE) is an internationally administered instrument for assessing nonnative speakers' proficiency in speaking English. The research foundation of the TSE examination described in its manual refers to two sources of variation other than the achievement being measured: interrater reliability and internal consistency.…

Descriptors: Adults, Analysis of Variance, Interrater Reliability, Language Proficiency

Technical Manual: 2002 Series GED Tests

Download full text

Ezzelle, Carol; Setzer, J. Carl – GED Testing Service, 2009

This manual was written to provide technical information regarding the 2002 Series GED (General Educational Development) Tests. Throughout this manual, documentation is provided regarding the development of the GED Tests, data collection activities, as well as reliability and validity evidence. The purpose of this manual is to provide evidence…

Descriptors: High School Equivalency Programs, Testing Programs, Test Validity, Test Reliability

Technical Adequacy of the easyCBM Grade 2 Reading Measures. Technical Report #1004

Download full text

Jamgochian, Elisa; Park, Bitnara Jasmine; Nese, Joseph F. T.; Lai, Cheng-Fei; Saez, Leilani; Anderson, Daniel; Alonzo, Julie; Tindal, Gerald – Behavioral Research and Teaching, 2010

In this technical report, we provide reliability and validity evidence for the easyCBM[R] Reading measures for grade 2 (word and passage reading fluency and multiple choice reading comprehension). Evidence for reliability includes internal consistency and item invariance. Evidence for validity includes concurrent, predictive, and construct…

Descriptors: Grade 2, Reading Comprehension, Testing Programs, Reading Fluency

The Reliability of Teacher Decision-Making in Recommending Accommodations for Large-Scale Tests. Technical Report # 08-01

Download full text

Tindal, Gerald; Lee, Daesik; Geller, Leanne Ketterlin – Behavioral Research and Teaching, 2008

In this paper we review different methods for teachers to recommend accommodations in large scale tests. Then we present data on the stability of their judgments on variables relevant to this decision-making process. The outcomes from the judgments support the need for a more explicit model. Four general categories are presented: student…

Descriptors: Teachers, Reliability, Decision Making, Testing Accommodations

Assessing the Reliability of Tests Used to Make Pass/Fail Decisions.

Peer reviewed

Livingston, Samuel A.; Wingersky, Marilyn A. – Journal of Educational Measurement, 1979

Procedures are described for studying the reliability of decisions based on specific passing scores with tests made up of discrete items and designed to measure continuous rather than categorical traits. These procedures are based on the estimation of the joint distribution of true scores and observed scores. (CTM)

Descriptors: Cutting Scores, Decision Making, Efficiency, Error of Measurement

Statistical Properties of Accountability Measures Based on ACT's Educational Planning and Assessment System. ACT Research Report Series, 2009-1

Download full text

Allen, Jeff; Bassiri, Dina; Noble, Julie – ACT, Inc., 2009

Educational accountability has grown substantially over the last decade, due in large part to the No Child Left Behind Act of 2001. Accordingly, educational researchers and policymakers are interested in the statistical properties of accountability models used for NCLB, such as status, improvement, and growth models; as well as others that are not…

Descriptors: Academic Achievement, High School Students, Accountability, Statistical Analysis

The Lester Attitude toward Death Scale.

Peer reviewed

Lester, David – Omega: Journal of Death and Dying, 1991

Published Lester Attitude toward Death Scale for first time, together with data on its reliability and validity. Notes that scale is different from other fear of death scales in its use of scaled value approach that permits measure of inconsistency in attitudes. (Author)

Descriptors: Attitude Measures, Death, Test Reliability, Test Validity

Rethinking Teacher Evaluation in Chicago: Lessons Learned from Classroom Observations, Principal-Teacher Conferences, and District Implementation. Research Report

Download full text

Sartain, Lauren; Stoelinga, Sara Ray; Brown, Eric R. – Consortium on Chicago School Research, 2011

This report summarizes findings from a two-year study of Chicago's Excellence in Teaching Pilot, which was designed to drive instructional improvement by providing teachers with evidence-based feedback on their strengths and weaknesses. The pilot consisted of training and support for principals and teachers, principal observations of teaching…

Descriptors: Evidence, Feedback (Response), Public Schools, Teacher Effectiveness

Context Bias in the Test of English as a Foreign Language.

Download full text

Angoff, William H. – 1989

This study was undertaken to test the hypothesis that items of the Test of English as a Foreign Language (TOEFL) containing reference to American people, places, customs, etc., tend to favor examinees who have spent some time living in the United States. Two samples of examinees were drawn from the March 1987 TOEFL administration, one tested in…

Descriptors: Context Effect, English (Second Language), Evaluators, Foreign Nationals

New Mexico Standards-Based Assessment Technical Report: Spring 2007 Administration

Download full text

New Mexico Public Education Department, 2007

The purpose of the NMSBA technical report is to provide users and other interested parties with a general overview of and technical characteristics of the 2007 NMSBA. The 2007 technical report contains the following information: (1) Test development; (2) Scoring procedures; (3) Summary of student performance; (4) Statistical analyses of item and…

Descriptors: Interrater Reliability, Standard Setting, Measures (Individuals), Scoring

A Model for Estimating the Reliability of Psychomotor Test Batteries.

Peer reviewed

Wood, Terry M.; Safrit, Margaret J. – Research Quarterly for Exercise and Sport, 1984

A proposed model for estimating psychomotor test battery reliability, based upon canonical correlation analysis, is described. (Author/JMK)

Descriptors: Evaluation Criteria, Multivariate Analysis, Physical Education, Psychomotor Skills

Establishing the Reliability of Student Proficiency Classifications: The Accuracy of Observed Classifications.

Download full text

Hoffman, R. Gene; Wise, Lauress L. – 2000

Classical test theory is based on the concept of a true score for each examinee, defined as the expected or average score across an infinite number of repeated parallel tests. In most cases, there is only a score from a single administration of the test in question. The difference between this single observed score and the underlying true score is…

Descriptors: Achievement, Classification, Observation, Probability

Equal Appearing Interval and Visual Analogue Scaling of Perceptual Roughness and Breathiness

Peer reviewed

Direct link

Yiu, Edwin M.-L.; Ng, Chi-Yan – Clinical Linguistics and Phonetics, 2004

One of the factors that affects the reliability of perceptual voice evaluation is the rating scale. Equal-appearing interval (EAI) and visual analogue (VA) scales are the two most common scales used and have attracted much attention in recent studies of perceptual voice evaluation. Available findings are contradictory, with one study finding the…

Descriptors: Test Reliability, Measurement Techniques, Rating Scales, Phonetics

A Generalizability Approach To Evaluating the Reliability of Testlet-Based Test Scores.

Download full text

Lee, Guemin; Frisbie, David A. – 1997

Previous studies have indicated that the reliability of test scores composed of testlets might be overestimated by conventional item-based reliability estimation methods (R. Thorndike, 1953; A. Anastasi, 1988; S. Sireci, D. Thissen, and H. Wainer, 1991; H. Wainer and D. Thissen, 1996). This study used generalizability theory to investigate the…

Descriptors: Estimation (Mathematics), Generalizability Theory, Reliability, Scores

Tables of Reliability Coefficients for Mastery Tests.

Download full text

Subkoviak, Michael J. – 1985

Current methods of obtaining reliability coefficients for mastery tests are laborious from a practitioner's perspective. Some methods require two test administrations; while others require access to computer facilities and/or advanced measurement and statistical procedures. This report provides tables from which practitioners can read such…

Descriptors: Estimation (Mathematics), Mastery Tests, Statistical Studies, Tables (Data)

« Previous Page | Next Page »

Pages: 1 | ... | 4 | 5 | 6 | 7 | 8 | 9 | 10 | 11 | 12 | ... | 17

Behavioral Research and…	27
Nebraska Department of…	7
US Department of Education	6
National Center for Education…	5
New York State Education…	5
Online Submission	5
Partnership for Assessment of…	5
IDEA Center, Inc.	4
National Center for Education…	4
Regional Educational…	4
ACT, Inc.	3
Florida Center for Reading…	3
Regional Educational…	3
College Board	2
Educational Testing Service	2
GED Testing Service	2
Grantee Submission	2
Maryland State Department of…	2
National Center on…	2
National Centre for…	2
New Meridian Corporation	2
New Mexico Public Education…	2
Regional Educational…	2
Regional Educational…	2
Regional Educational…	2
More ▼

Alonzo, Julie	26
Tindal, Gerald	24
Lai, Cheng-Fei	16
Anderson, Daniel	14
Park, Bitnara Jasmine	13
Irvin, P. Shawn	8
Nese, Joseph F. T.	7
Petscher, Yaacov	5
Gill, Brian	4
Saez, Leilani	4
Benton, Stephen L.	3
Chiang, Hanley	3
Foorman, Barbara R.	3
Jamgochian, Elisa	3
Lipscomb, Stephen	3
Schatschneider, Chris	3
Alley, Gordon R.	2
Bennett, Randy Elliot	2
Brennan, Robert L.	2
Brick, J. Michael	2
Dahlke, Katie	2
Guo, Meixi	2
Hanson, Thomas	2
Jamgochian, Elisa M.	2
More ▼

Stanford Achievement Tests	8
Measures of Academic Progress	5
Program for International…	5
Test of English as a Foreign…	4
Trends in International…	4
Comprehensive Tests of Basic…	3
General Educational…	3
ACT Assessment	2
California Achievement Tests	2
College Student Experiences…	2
Early Childhood Longitudinal…	2
Graduate Record Examinations	2
Law School Admission Test	2
Massachusetts Comprehensive…	2
Medical College Admission Test	2
Metropolitan Achievement Tests	2
National Household Education…	2
Peabody Picture Vocabulary…	2
SAT (College Admission Test)	2
SRA Achievement Series	2
Wechsler Adult Intelligence…	2
Adjustment Scales for…	1
Bayley Scales of Infant…	1
Child Behavior Checklist	1
Classroom Assessment Scoring…	1
More ▼