ERIC - Search Results

Publication Date

In 2026	0
Since 2025	1
Since 2022 (last 5 years)	1
Since 2017 (last 10 years)	3
Since 2007 (last 20 years)	10

Descriptor

Educational Assessment	38
Scores	38
Test Reliability	38
Test Validity	18
State Programs	11
Elementary Secondary Education	10
Test Construction	10
Achievement Tests	9
Performance Based Assessment	9
Test Interpretation	9
Elementary Education	7
Evaluation Methods	7
Scoring	7
Test Results	7
Testing Programs	7
Comparative Analysis	6
Academic Achievement	5
Accountability	5
Elementary School Students	5
Models	5
Portfolios (Background…	5
Student Evaluation	5
Test Use	5
Basic Skills	4
Educational Objectives	4
More ▼

Source

Applied Measurement in…	4
Educational Measurement:…	2
Communique	1
Curriculum and Teaching…	1
ETS Research Report Series	1
Educational Research Quarterly	1
Educational and Psychological…	1
Frontiers of Education in…	1
International Association for…	1
Journal of Applied School…	1
Journal of Educational…	1
NASSP Bulletin	1
SAGE Open	1
More ▼

Publication Type

Reports - Research	16
Journal Articles	15
Reports - Evaluative	14
Speeches/Meeting Papers	6
Reports - Descriptive	3
Collected Works - Proceedings	2
Book/Product Reviews	1
Dissertations/Theses -…	1
Information Analyses	1
Numerical/Quantitative Data	1
Tests/Questionnaires	1
More ▼

Education Level

Elementary Secondary Education	2
Grade 4	2
Grade 8	2
Higher Education	2
Postsecondary Education	2
Grade 3	1
Grade 5	1
Grade 6	1
Grade 7	1
High Schools	1
Secondary Education	1
More ▼

Audience

Practitioners

Location

Vermont	4
Florida	2
Illinois	2
Netherlands	2
Ohio	2
South Korea	2
Spain	2
Alaska	1
Asia	1
Australia	1
Brazil	1
China	1
Colorado	1
Connecticut	1
Denmark	1
Egypt	1
Estonia	1
Germany	1
Greece	1
Hawaii	1
Indonesia	1
Ireland	1
Israel	1
Italy	1
Japan	1
More ▼

Laws, Policies, & Programs

No Child Left Behind Act 2001

Assessments and Surveys

ACT Assessment	2
National Assessment of…	2
Childrens Depression Inventory	1
Pennsylvania Educational…	1
Vineland Adaptive Behavior…	1

What Works Clearinghouse Rating

Showing 1 to 15 of 38 results Save | Export

Modeling Directional Testlet Effects on Multiple Open-Ended Questions

Peer reviewed

Direct link

Kuan-Yu Jin; Wai-Lok Siu – Journal of Educational Measurement, 2025

Educational tests often have a cluster of items linked by a common stimulus ("testlet"). In such a design, the dependencies caused between items are called "testlet effects." In particular, the directional testlet effect (DTE) refers to a recursive influence whereby responses to earlier items can positively or negatively affect…

Descriptors: Models, Test Items, Educational Assessment, Scores

Student Perceptions of Teaching Quality in Five Countries: A Partial Credit Model Approach to Assess Measurement Invariance

Peer reviewed

Direct link

van der Lans, Rikkert M.; Maulana, Ridwan; Helms-Lorenz, Michelle; Fernández-García, Carmen-María; Chun, Seyeoung; de Jager, Thelma; Irnidayanti, Yulia; Inda-Caro, Mercedes; Lee, Okhwa; Coetzee, Thys; Fadhilah, Nurul; Jeon, Meae; Moorer, Peter – SAGE Open, 2021

This study examines measurement invariance of student perceptions of teaching quality collected in five countries: Indonesia (n students = 6,331), the Netherlands (n students = 6,738), South Africa (n students = 3,422), South Korea (n students = 6,997) and Spain (n students = 4,676). The administered questionnaire was the My Teacher Questionnaire…

Descriptors: Foreign Countries, Student Attitudes, Student Evaluation of Teacher Performance, Teacher Effectiveness

Distractor Analysis for Multiple-Choice Tests: An Empirical Study with International Language Assessment Data. Research Report. ETS RR-19-39

Peer reviewed
PDF on ERIC

Download full text

Haberman, Shelby J.; Liu, Yang; Lee, Yi-Hsuan – ETS Research Report Series, 2019

Distractor analyses are routinely conducted in educational assessments with multiple-choice items. In this research report, we focus on three item response models for distractors: (a) the traditional nominal response (NR) model, (b) a combination of a two-parameter logistic model for item scores and a NR model for selections of incorrect…

Descriptors: Multiple Choice Tests, Scores, Test Reliability, High Stakes Tests

Threats to Validity in Accountability Structures for Public Education

Peer reviewed

Direct link

Warlop, Daniel M. – Curriculum and Teaching Dialogue, 2016

This chapter is a research summary of the author's doctoral dissertation completed in May, 2015, which investigates the way Standardized Assessment (SA) is used in state educational accountability structures. This quasi-experimental quantitative study found that SA scores trend towards consistency over time, and that there is additional variance,…

Descriptors: Accountability, Educational Assessment, Student Evaluation, Public Education

Psychoeducational Reports That Matter: A Consumer-Responsive Approach, Part 2

Direct link

Lichtenstein, Robert – Communique, 2013

Assessment of human abilities and behaviors is enormously enhanced by the use of standardized assessment measures that yield norm-referenced scores. As school psychologists, they rely on quantitative findings to anchor their judgments about a child's developmental and educational functioning and to enhance our capacity to draw diagnostic…

Descriptors: Test Results, School Psychologists, Psychoeducational Methods, Scores

A Systematic Review and Psychometric Evaluation of Adaptive Behavior Scales and Recommendations for Practice

Peer reviewed

Direct link

Floyd, Randy G.; Shands, Elizabeth I.; Alfonso, Vincent C.; Phillips, Jessica F.; Autry, Beth K.; Mosteller, Jessica A.; Skinner, Mary; Irby, Sarah – Journal of Applied School Psychology, 2015

Adaptive behavior scales are vital in assessing children and adolescents who experience a range of disabling conditions in school settings. This article presents the results of an evaluation of the design characteristics, norming, scale characteristics, reliability and validity evidence, and bias identification studies supporting 14…

Descriptors: Behavior Rating Scales, Psychometrics, Daily Living Skills, Evaluation Criteria

Test Score or Student Progress? A Value-Added Evaluation of School Effectiveness in Urban China

Peer reviewed

Direct link

Peng, Pai; Hochweber, Jan; Klieme, Eckhard – Frontiers of Education in China, 2013

Outcome-oriented evaluation of school effectiveness is often based on student test scores in certain critical examinations. This study provides another method of evaluation--value-added--which is based on student achievement progress. This paper introduces the method of estimating the value-added score of schools in multi-level models. Based on…

Descriptors: School Effectiveness, Foreign Countries, Achievement Gains, Outcomes of Education

Accessible Reading Assessments for Students with Disabilities: Summary and Conclusions

Peer reviewed

Direct link

Wise, Lauress L. – Applied Measurement in Education, 2010

The articles in this special issue make two important contributions to our understanding of the impact of accommodations on test score validity. First, they illustrate a variety of methods for collection and rigorous analyses of empirical data that can supplant expert judgment of the impact of accommodations. These methods range from internal…

Descriptors: Reading Achievement, Educational Assessment, Test Reliability, Learning Disabilities

Trait Validity and Reliability of TAAS Reading Scores: 1994-1999

Peer reviewed

Direct link

Lorence, Jon – Educational Research Quarterly, 2010

The Texas Assessment of Academic Skills (TAAS) test was the major source of data for the Texas educational accountability system from 1994 through 2002. Contrary to critics who claim that TAAS data are invalid and unreliable measures of student performance, structural equation analyses of TAAS reading data based on the 1994 Texas third grade…

Descriptors: Educational Assessment, High Stakes Tests, Reading Tests, Scores

ACT Test Scores--What's Behind the Decline?

Peer reviewed

Ferguson, Richard L. – NASSP Bulletin, 1976

Declining test scores have been a major cause for concern in the past year. Describes the decline and explores some possible causes. (Editor/RK)

Descriptors: Achievement Tests, Data Analysis, Data Collection, Educational Assessment

Ability Explorer: A Review and Critique.

Download full text

Hoffman, Anne – 1997

The Ability Explorer (AE) is a newly developed self-report inventory of abilities that is appropriate for group or individual administration. There are machine-scorable and hand-scorable versions of the test, and there are two levels. Level 1 is for students from junior high to high school, and Level 2 is for high school students and adults.…

Descriptors: Ability, Adolescents, Adults, Aptitude Tests

Portfolio Assessment: A Theoretical Estimate of Score Reliability.

Peer reviewed

Reckase, Mark D. – Educational Measurement: Issues and Practice, 1995

An example application of portfolio assessment was developed and the model and estimates of reliability derived from the literature were then used to estimate the characteristics of an operational large-scale portfolio assessment program. Costs were estimated to put results in a realistic context. (SLD)

Descriptors: Cost Estimates, Educational Assessment, Educational Theories, Models

Combining Multiple-Choice and Constructed-Response Test Scores: Toward a Marxist Theory of Test Construction.

Peer reviewed

Wainer, Howard; Thissen, David – Applied Measurement in Education, 1993

Because assessment instruments of the future may well be composed of a combination of types of questions, a way to combine those scores effectively is discussed. Two new graphic tools are presented that show that it may not be practical to equalize the reliability of different components. (SLD)

Descriptors: Constructed Response, Educational Assessment, Graphs, Item Response Theory

The Children's Depression Inventory: A Comparison of Generalizability and Classical Test Theory Analyses.

Peer reviewed

Crowley, Susan L.; And Others – Educational and Psychological Measurement, 1994

Dependability of the Children's Depression Inventory (CDI) was studied using both generalizability and classical test score analyses with a sample of 164 elementary school students. Results suggest that sources of error variance interact to decrease dependability of CDI scores. Depression in children might be better assessed through multiple…

Descriptors: Children, Clinical Diagnosis, Comparative Analysis, Depression (Psychology)

A Tribute to Robert L. Ebel: Scholar, Teacher, Mentor, and Statesman

Peer reviewed

Direct link

Cizek, Gregory J.; Crocker, Linda; Frisbie, David A.; Mehrens, William A.; Stiggins, Richard J. – Educational Measurement: Issues and Practice, 2006

The authors describe the significant contributions of Robert Ebel to educational measurement theory and its applications. A biographical sketch details Ebel's roots and professional resume. His influence on classroom assessment views and procedures are explored. Classic publications associated with validity, reliability, and score interpretation…

Descriptors: Test Theory, Educational Assessment, Psychometrics, Test Reliability

Previous Page | Next Page »

Pages: 1 | 2 | 3

Koretz, Daniel	3
Wise, Lauress L.	2
Alfonso, Vincent C.	1
Alspaugh, John W.	1
Autry, Beth K.	1
Awomolo, Ademola	1
Chun, Seyeoung	1
Cizek, Gregory J.	1
Coetzee, Thys	1
Crocker, Linda	1
Crowley, Susan L.	1
Dunbar, Stephen B.	1
Fadhilah, Nurul	1
Ferguson, Richard L.	1
Fernández-García, Carmen-María	1
Floyd, Randy G.	1
Forster, Greg	1
Frisbie, David A.	1
Greene, Jay P.	1
Haberman, Shelby J.	1
Helms-Lorenz, Michelle	1
Hochweber, Jan	1
Hoffman, Anne	1
Inda-Caro, Mercedes	1
More ▼