Publication Date
| In 2026 | 0 |
| Since 2025 | 1 |
| Since 2022 (last 5 years) | 1 |
| Since 2017 (last 10 years) | 3 |
| Since 2007 (last 20 years) | 10 |
Descriptor
| Educational Assessment | 38 |
| Scores | 38 |
| Test Reliability | 38 |
| Test Validity | 18 |
| State Programs | 11 |
| Elementary Secondary Education | 10 |
| Test Construction | 10 |
| Achievement Tests | 9 |
| Performance Based Assessment | 9 |
| Test Interpretation | 9 |
| Elementary Education | 7 |
| More ▼ | |
Source
Author
| Koretz, Daniel | 3 |
| Wise, Lauress L. | 2 |
| Alfonso, Vincent C. | 1 |
| Alspaugh, John W. | 1 |
| Autry, Beth K. | 1 |
| Awomolo, Ademola | 1 |
| Chun, Seyeoung | 1 |
| Cizek, Gregory J. | 1 |
| Coetzee, Thys | 1 |
| Crocker, Linda | 1 |
| Crowley, Susan L. | 1 |
| More ▼ | |
Publication Type
Education Level
| Elementary Secondary Education | 2 |
| Grade 4 | 2 |
| Grade 8 | 2 |
| Higher Education | 2 |
| Postsecondary Education | 2 |
| Grade 3 | 1 |
| Grade 5 | 1 |
| Grade 6 | 1 |
| Grade 7 | 1 |
| High Schools | 1 |
| Secondary Education | 1 |
| More ▼ | |
Audience
| Practitioners | 1 |
Location
| Vermont | 4 |
| Florida | 2 |
| Illinois | 2 |
| Netherlands | 2 |
| Ohio | 2 |
| South Korea | 2 |
| Spain | 2 |
| Alaska | 1 |
| Asia | 1 |
| Australia | 1 |
| Brazil | 1 |
| More ▼ | |
Laws, Policies, & Programs
| No Child Left Behind Act 2001 | 1 |
Assessments and Surveys
| ACT Assessment | 2 |
| National Assessment of… | 2 |
| Childrens Depression Inventory | 1 |
| Pennsylvania Educational… | 1 |
| Vineland Adaptive Behavior… | 1 |
What Works Clearinghouse Rating
Kuan-Yu Jin; Wai-Lok Siu – Journal of Educational Measurement, 2025
Educational tests often have a cluster of items linked by a common stimulus ("testlet"). In such a design, the dependencies caused between items are called "testlet effects." In particular, the directional testlet effect (DTE) refers to a recursive influence whereby responses to earlier items can positively or negatively affect…
Descriptors: Models, Test Items, Educational Assessment, Scores
van der Lans, Rikkert M.; Maulana, Ridwan; Helms-Lorenz, Michelle; Fernández-García, Carmen-María; Chun, Seyeoung; de Jager, Thelma; Irnidayanti, Yulia; Inda-Caro, Mercedes; Lee, Okhwa; Coetzee, Thys; Fadhilah, Nurul; Jeon, Meae; Moorer, Peter – SAGE Open, 2021
This study examines measurement invariance of student perceptions of teaching quality collected in five countries: Indonesia (n students = 6,331), the Netherlands (n students = 6,738), South Africa (n students = 3,422), South Korea (n students = 6,997) and Spain (n students = 4,676). The administered questionnaire was the My Teacher Questionnaire…
Descriptors: Foreign Countries, Student Attitudes, Student Evaluation of Teacher Performance, Teacher Effectiveness
Haberman, Shelby J.; Liu, Yang; Lee, Yi-Hsuan – ETS Research Report Series, 2019
Distractor analyses are routinely conducted in educational assessments with multiple-choice items. In this research report, we focus on three item response models for distractors: (a) the traditional nominal response (NR) model, (b) a combination of a two-parameter logistic model for item scores and a NR model for selections of incorrect…
Descriptors: Multiple Choice Tests, Scores, Test Reliability, High Stakes Tests
Warlop, Daniel M. – Curriculum and Teaching Dialogue, 2016
This chapter is a research summary of the author's doctoral dissertation completed in May, 2015, which investigates the way Standardized Assessment (SA) is used in state educational accountability structures. This quasi-experimental quantitative study found that SA scores trend towards consistency over time, and that there is additional variance,…
Descriptors: Accountability, Educational Assessment, Student Evaluation, Public Education
Lichtenstein, Robert – Communique, 2013
Assessment of human abilities and behaviors is enormously enhanced by the use of standardized assessment measures that yield norm-referenced scores. As school psychologists, they rely on quantitative findings to anchor their judgments about a child's developmental and educational functioning and to enhance our capacity to draw diagnostic…
Descriptors: Test Results, School Psychologists, Psychoeducational Methods, Scores
Floyd, Randy G.; Shands, Elizabeth I.; Alfonso, Vincent C.; Phillips, Jessica F.; Autry, Beth K.; Mosteller, Jessica A.; Skinner, Mary; Irby, Sarah – Journal of Applied School Psychology, 2015
Adaptive behavior scales are vital in assessing children and adolescents who experience a range of disabling conditions in school settings. This article presents the results of an evaluation of the design characteristics, norming, scale characteristics, reliability and validity evidence, and bias identification studies supporting 14…
Descriptors: Behavior Rating Scales, Psychometrics, Daily Living Skills, Evaluation Criteria
Peng, Pai; Hochweber, Jan; Klieme, Eckhard – Frontiers of Education in China, 2013
Outcome-oriented evaluation of school effectiveness is often based on student test scores in certain critical examinations. This study provides another method of evaluation--value-added--which is based on student achievement progress. This paper introduces the method of estimating the value-added score of schools in multi-level models. Based on…
Descriptors: School Effectiveness, Foreign Countries, Achievement Gains, Outcomes of Education
Wise, Lauress L. – Applied Measurement in Education, 2010
The articles in this special issue make two important contributions to our understanding of the impact of accommodations on test score validity. First, they illustrate a variety of methods for collection and rigorous analyses of empirical data that can supplant expert judgment of the impact of accommodations. These methods range from internal…
Descriptors: Reading Achievement, Educational Assessment, Test Reliability, Learning Disabilities
Lorence, Jon – Educational Research Quarterly, 2010
The Texas Assessment of Academic Skills (TAAS) test was the major source of data for the Texas educational accountability system from 1994 through 2002. Contrary to critics who claim that TAAS data are invalid and unreliable measures of student performance, structural equation analyses of TAAS reading data based on the 1994 Texas third grade…
Descriptors: Educational Assessment, High Stakes Tests, Reading Tests, Scores
Peer reviewedFerguson, Richard L. – NASSP Bulletin, 1976
Declining test scores have been a major cause for concern in the past year. Describes the decline and explores some possible causes. (Editor/RK)
Descriptors: Achievement Tests, Data Analysis, Data Collection, Educational Assessment
Hoffman, Anne – 1997
The Ability Explorer (AE) is a newly developed self-report inventory of abilities that is appropriate for group or individual administration. There are machine-scorable and hand-scorable versions of the test, and there are two levels. Level 1 is for students from junior high to high school, and Level 2 is for high school students and adults.…
Descriptors: Ability, Adolescents, Adults, Aptitude Tests
Peer reviewedReckase, Mark D. – Educational Measurement: Issues and Practice, 1995
An example application of portfolio assessment was developed and the model and estimates of reliability derived from the literature were then used to estimate the characteristics of an operational large-scale portfolio assessment program. Costs were estimated to put results in a realistic context. (SLD)
Descriptors: Cost Estimates, Educational Assessment, Educational Theories, Models
Peer reviewedWainer, Howard; Thissen, David – Applied Measurement in Education, 1993
Because assessment instruments of the future may well be composed of a combination of types of questions, a way to combine those scores effectively is discussed. Two new graphic tools are presented that show that it may not be practical to equalize the reliability of different components. (SLD)
Descriptors: Constructed Response, Educational Assessment, Graphs, Item Response Theory
Peer reviewedCrowley, Susan L.; And Others – Educational and Psychological Measurement, 1994
Dependability of the Children's Depression Inventory (CDI) was studied using both generalizability and classical test score analyses with a sample of 164 elementary school students. Results suggest that sources of error variance interact to decrease dependability of CDI scores. Depression in children might be better assessed through multiple…
Descriptors: Children, Clinical Diagnosis, Comparative Analysis, Depression (Psychology)
Cizek, Gregory J.; Crocker, Linda; Frisbie, David A.; Mehrens, William A.; Stiggins, Richard J. – Educational Measurement: Issues and Practice, 2006
The authors describe the significant contributions of Robert Ebel to educational measurement theory and its applications. A biographical sketch details Ebel's roots and professional resume. His influence on classroom assessment views and procedures are explored. Classic publications associated with validity, reliability, and score interpretation…
Descriptors: Test Theory, Educational Assessment, Psychometrics, Test Reliability

Direct link
