Publication Date
| In 2026 | 0 |
| Since 2025 | 0 |
| Since 2022 (last 5 years) | 3 |
| Since 2017 (last 10 years) | 4 |
| Since 2007 (last 20 years) | 17 |
Descriptor
Source
Author
| von Davier, Alina A. | 2 |
| Adedoyin, Kimberly Clark | 1 |
| Babson, Andrew | 1 |
| Bach, James V. | 1 |
| Blank, Rolf K. | 1 |
| Bolt, Sara E. | 1 |
| Brian F. French | 1 |
| Buckley, Barbara C. | 1 |
| Burke, Karen | 1 |
| Canney, George F. | 1 |
| Chan, Tsze | 1 |
| More ▼ | |
Publication Type
| Journal Articles | 34 |
| Reports - Research | 14 |
| Reports - Evaluative | 12 |
| Reports - Descriptive | 5 |
| Information Analyses | 1 |
| Opinion Papers | 1 |
| Tests/Questionnaires | 1 |
Education Level
| Higher Education | 5 |
| Elementary Education | 4 |
| Elementary Secondary Education | 4 |
| Grade 8 | 3 |
| High Schools | 3 |
| Middle Schools | 2 |
| Secondary Education | 2 |
| Adult Education | 1 |
| Grade 11 | 1 |
| Grade 4 | 1 |
| Grade 6 | 1 |
| More ▼ | |
Audience
| Policymakers | 1 |
| Researchers | 1 |
Location
| New York | 2 |
| Arizona | 1 |
| California | 1 |
| Florida | 1 |
| Hong Kong | 1 |
| Idaho | 1 |
| Indonesia | 1 |
| Maryland | 1 |
| Minnesota | 1 |
| Netherlands | 1 |
| South Carolina | 1 |
| More ▼ | |
Laws, Policies, & Programs
| No Child Left Behind Act 2001 | 4 |
| Americans with Disabilities… | 1 |
| Individuals with Disabilities… | 1 |
| Rehabilitation Act 1973… | 1 |
Assessments and Surveys
| Program for International… | 2 |
What Works Clearinghouse Rating
Sireci, Stephen G.; Suarez-Alvarez, Javier – Educational Measurement: Issues and Practice, 2022
The COVID-19 pandemic negatively affected the quality of data from educational testing programs. These data were previously used for many important purposes ranging from placing students in instructional programs to school accountability. In this article, we draw from the research design literature to point out the limitations inherent in…
Descriptors: Decision Making, Data Use, COVID-19, Pandemics
Traditional vs Intersectional DIF Analysis: Considerations and a Comparison Using State Testing Data
Tony Albano; Brian F. French; Thao Thu Vo – Applied Measurement in Education, 2024
Recent research has demonstrated an intersectional approach to the study of differential item functioning (DIF). This approach expands DIF to account for the interactions between what have traditionally been treated as separate grouping variables. In this paper, we compare traditional and intersectional DIF analyses using data from a state testing…
Descriptors: Test Items, Item Analysis, Data Use, Standardized Tests
Xue, Kang; Huggins-Manley, Anne Corinne; Leite, Walter – Educational and Psychological Measurement, 2022
In data collected from virtual learning environments (VLEs), item response theory (IRT) models can be used to guide the ongoing measurement of student ability. However, such applications of IRT rely on unbiased item parameter estimates associated with test items in the VLE. Without formal piloting of the items, one can expect a large amount of…
Descriptors: Virtual Classrooms, Artificial Intelligence, Item Response Theory, Item Analysis
Sellar, Sam – Critical Studies in Education, 2015
This article explores the relationship between commensuration and affect in various contexts of education policy. Commensuration is the process through which disparate qualities are transformed into a common metric and is central to the production of performance data. The rise of governance through numbers in education has resulted in a…
Descriptors: Educational Policy, Data, Governance, Psychological Patterns
Yee, Mary – Teachers College Record, 2015
This study constitutes the secondary analysis of data collected as part of classroom instruction in a prior practitioner inquiry study. Consequently, IRB approval, parental consent, and participant assent for the present study were obtained after the conclusion of the original study.
Descriptors: English Language Learners, Classroom Techniques, Inquiry, Educational Legislation
Chow, Kui Foon; Kennedy, Kerry John – Educational Research and Evaluation, 2014
International large-scale assessments are now part of the educational landscape in many countries and often feed into major policy decisions. Yet, such assessments also provide data sets for secondary analysis that can address key issues of concern to educators and policymakers alike. Traditionally, such secondary analyses have been based on a…
Descriptors: Measurement, Data Analysis, Educational Assessment, Multivariate Analysis
Stohlman, Trey – Journal of the Scholarship of Teaching and Learning, 2015
A good assessment plan combines many direct and indirect measures to validate the collected data. One often controversial assessment measure comes in the form of retention exams. Although assessment retention exams may come with faults, others advocate for their inclusion in program assessment. Objective-based tests may offer insight to…
Descriptors: Alternative Assessment, Retention (Psychology), Program Evaluation, Program Effectiveness
Cronin, John; Jensen, Nate – Phi Delta Kappan, 2014
When New York state released the first results of the exams under the Common Core State Standards, many wrongly believed that the results showed dramatic declines in student achievement. A closer look at the results showed that student achievement may have increased. Another lesson from the exams is that states need to closely coordinate new data…
Descriptors: Academic Achievement, State Standards, Core Curriculum, Achievement Gains
Conti, Maria; LaMance, Rachel; Miller-Cochran, Susan – Composition Forum, 2017
To address the needs and interests of primary stakeholders in a writing program, this article presents a model of "grassroots" assessment that involves instructors from all ranks as well as students in the development, facilitation, and interpretation of assessment results. The authors describe two assessment plans that measured student…
Descriptors: Writing Improvement, Needs Assessment, Stakeholders, Student Needs
Kim, Sooyeon; von Davier, Alina A.; Haberman, Shelby – Applied Measurement in Education, 2011
The synthetic function is a weighted average of the identity (the linking function for forms that are known to be completely parallel) and a traditional equating method. The purpose of the present study was to investigate the benefits of the synthetic function on small-sample equating using various real data sets gathered from different…
Descriptors: Testing Programs, Equated Scores, Investigations, Data Analysis
von Davier, Alina A. – ETS Research Report Series, 2012
Maintaining comparability of test scores is a major challenge faced by testing programs that have almost continuous administrations. Among the potential problems are scale drift and rapid accumulation of errors. Many standard quality control techniques for testing programs, which can effectively detect and address scale drift for small numbers of…
Descriptors: Quality Control, Data Analysis, Trend Analysis, Scaling
Quellmalz, Edys S.; Timms, Michael J.; Silberglitt, Matt D.; Buckley, Barbara C. – Journal of Research in Science Teaching, 2012
This article reports on the collaboration of six states to study how simulation-based science assessments can become transformative components of multi-level, balanced state science assessment systems. The project studied the psychometric quality, feasibility, and utility of simulation-based science assessments designed to serve formative purposes…
Descriptors: State Programs, Educational Assessment, Simulated Environment, Grade 6
Wagner, Daniel A.; Babson, Andrew; Murphy, Katie M. – Current Issues in Comparative Education, 2011
Timely and credible data on student learning has become a global issue in the ongoing effort to improve educational outcomes. With the potential to serve as a powerful diagnostic tool to gauge the overall health and well-being of an educational system, educational assessments have received increasing attention among specialists and the media.…
Descriptors: Low Income, Educational Objectives, Outcomes of Education, Educational Change
Meyers, Jason L.; Miller, G. Edward; Way, Walter D. – Applied Measurement in Education, 2009
In operational testing programs using item response theory (IRT), item parameter invariance is threatened when an item appears in a different location on the live test than it did when it was field tested. This study utilizes data from a large state's assessments to model change in Rasch item difficulty (RID) as a function of item position change,…
Descriptors: Test Items, Test Content, Testing Programs, Simulation
Cohen, Jon; Chan, Tsze; Jiang, Tao; Seburn, Mary – Applied Psychological Measurement, 2008
U.S. state educational testing programs administer tests to track student progress and hold schools accountable for educational outcomes. Methods from item response theory, especially Rasch models, are usually used to equate different forms of a test. The most popular method for estimating Rasch models yields inconsistent estimates and relies on…
Descriptors: Testing Programs, Educational Testing, Item Response Theory, Computation

Peer reviewed
Direct link
