ERIC - Search Results

Publication Date

In 2026	0
Since 2025	0
Since 2022 (last 5 years)	3
Since 2017 (last 10 years)	4
Since 2007 (last 20 years)	17

Descriptor

Testing Programs	34
Data Analysis	17
Academic Achievement	7
Data Collection	7
Educational Assessment	7
Item Response Theory	7
Standardized Tests	7
Student Evaluation	6
Accountability	5
Educational Testing	5
Evaluation Methods	5
Achievement Tests	4
Data	4
Educational Change	4
Educational Legislation	4
Elementary Secondary Education	4
Federal Legislation	4
High Stakes Tests	4
Program Effectiveness	4
Program Evaluation	4
Psychometrics	4
Scores	4
State Standards	4
Test Items	4
Testing	4
More ▼

Publication Type

Journal Articles	34
Reports - Research	14
Reports - Evaluative	12
Reports - Descriptive	5
Information Analyses	1
Opinion Papers	1
Tests/Questionnaires	1

Education Level

Higher Education	5
Elementary Education	4
Elementary Secondary Education	4
Grade 8	3
High Schools	3
Middle Schools	2
Secondary Education	2
Adult Education	1
Grade 11	1
Grade 4	1
Grade 6	1
Grade 9	1
Intermediate Grades	1
Junior High Schools	1
Postsecondary Education	1
More ▼

Audience

Policymakers	1
Researchers	1

Location

New York	2
Arizona	1
California	1
Florida	1
Hong Kong	1
Idaho	1
Indonesia	1
Maryland	1
Minnesota	1
Netherlands	1
South Carolina	1
South Korea	1
Taiwan	1
Texas	1
Thailand	1
More ▼

Laws, Policies, & Programs

No Child Left Behind Act 2001	4
Americans with Disabilities…	1
Individuals with Disabilities…	1
Rehabilitation Act 1973…	1

Assessments and Surveys

Program for International…

What Works Clearinghouse Rating

Showing 1 to 15 of 34 results Save | Export

Deriving Decisions from Disrupted Data

Peer reviewed

Direct link

Sireci, Stephen G.; Suarez-Alvarez, Javier – Educational Measurement: Issues and Practice, 2022

The COVID-19 pandemic negatively affected the quality of data from educational testing programs. These data were previously used for many important purposes ranging from placing students in instructional programs to school accountability. In this article, we draw from the research design literature to point out the limitations inherent in…

Descriptors: Decision Making, Data Use, COVID-19, Pandemics

Traditional vs Intersectional DIF Analysis: Considerations and a Comparison Using State Testing Data

Peer reviewed

Direct link

Tony Albano; Brian F. French; Thao Thu Vo – Applied Measurement in Education, 2024

Recent research has demonstrated an intersectional approach to the study of differential item functioning (DIF). This approach expands DIF to account for the interactions between what have traditionally been treated as separate grouping variables. In this paper, we compare traditional and intersectional DIF analyses using data from a state testing…

Descriptors: Test Items, Item Analysis, Data Use, Standardized Tests

Semisupervised Learning Method to Adjust Biased Item Difficulty Estimates Caused by Nonignorable Missingness in a Virtual Learning Environment

Peer reviewed
PDF on ERIC

Download full text

Direct link

Xue, Kang; Huggins-Manley, Anne Corinne; Leite, Walter – Educational and Psychological Measurement, 2022

In data collected from virtual learning environments (VLEs), item response theory (IRT) models can be used to guide the ongoing measurement of student ability. However, such applications of IRT rely on unbiased item parameter estimates associated with test items in the VLE. Without formal piloting of the items, one can expect a large amount of…

Descriptors: Virtual Classrooms, Artificial Intelligence, Item Response Theory, Item Analysis

A Feel for Numbers: Affect, Data and Education Policy

Peer reviewed

Direct link

Sellar, Sam – Critical Studies in Education, 2015

This article explores the relationship between commensuration and affect in various contexts of education policy. Commensuration is the process through which disparate qualities are transformed into a common metric and is central to the production of performance data. The rise of governance through numbers in education has resulted in a…

Descriptors: Educational Policy, Data, Governance, Psychological Patterns

What English Language Learners Have to Say about NCLB Testing

Peer reviewed

Direct link

Yee, Mary – Teachers College Record, 2015

This study constitutes the secondary analysis of data collected as part of classroom instruction in a prior practitioner inquiry study. Consequently, IRB approval, parental consent, and participant assent for the present study were obtained after the conclusion of the original study.

Descriptors: English Language Learners, Classroom Techniques, Inquiry, Educational Legislation

Secondary Analysis of Large-Scale Assessment Data: An Alternative to Variable-Centred Analysis

Peer reviewed

Direct link

Chow, Kui Foon; Kennedy, Kerry John – Educational Research and Evaluation, 2014

International large-scale assessments are now part of the educational landscape in many countries and often feed into major policy decisions. Yet, such assessments also provide data sets for secondary analysis that can address key issues of concern to educators and policymakers alike. Traditionally, such secondary analyses have been based on a…

Descriptors: Measurement, Data Analysis, Educational Assessment, Multivariate Analysis

The Road to Redemption: Reclaiming the Value in Assessment Retention Exams

Peer reviewed
PDF on ERIC

Download full text

Stohlman, Trey – Journal of the Scholarship of Teaching and Learning, 2015

A good assessment plan combines many direct and indirect measures to validate the collected data. One often controversial assessment measure comes in the form of retention exams. Although assessment retention exams may come with faults, others advocate for their inclusion in program assessment. Objective-based tests may offer insight to…

Descriptors: Alternative Assessment, Retention (Psychology), Program Evaluation, Program Effectiveness

The Phantom Collapse of Student Achievement in New York

Direct link

Cronin, John; Jensen, Nate – Phi Delta Kappan, 2014

When New York state released the first results of the exams under the Common Core State Standards, many wrongly believed that the results showed dramatic declines in student achievement. A closer look at the results showed that student achievement may have increased. Another lesson from the exams is that states need to closely coordinate new data…

Descriptors: Academic Achievement, State Standards, Core Curriculum, Achievement Gains

Cultivating Change from the Ground Up: Developing a Grassroots Programmatic Assessment

Peer reviewed
PDF on ERIC

Download full text

Conti, Maria; LaMance, Rachel; Miller-Cochran, Susan – Composition Forum, 2017

To address the needs and interests of primary stakeholders in a writing program, this article presents a model of "grassroots" assessment that involves instructors from all ranks as well as students in the development, facilitation, and interpretation of assessment results. The authors describe two assessment plans that measured student…

Descriptors: Writing Improvement, Needs Assessment, Stakeholders, Student Needs

Practical Application of a Synthetic Linking Function on Small-Sample Equating

Peer reviewed

Direct link

Kim, Sooyeon; von Davier, Alina A.; Haberman, Shelby – Applied Measurement in Education, 2011

The synthetic function is a weighted average of the identity (the linking function for forms that are known to be completely parallel) and a traditional equating method. The purpose of the present study was to investigate the benefits of the synthetic function on small-sample equating using various real data sets gathered from different…

Descriptors: Testing Programs, Equated Scores, Investigations, Data Analysis

The Use of Quality Control and Data Mining Techniques for Monitoring Scaled Scores: An Overview. Research Report. ETS RR-12-20

Peer reviewed
PDF on ERIC

Download full text

von Davier, Alina A. – ETS Research Report Series, 2012

Maintaining comparability of test scores is a major challenge faced by testing programs that have almost continuous administrations. Among the potential problems are scale drift and rapid accumulation of errors. Many standard quality control techniques for testing programs, which can effectively detect and address scale drift for small numbers of…

Descriptors: Quality Control, Data Analysis, Trend Analysis, Scaling

Science Assessments for All: Integrating Science Simulations into Balanced State Science Assessment Systems

Peer reviewed

Direct link

Quellmalz, Edys S.; Timms, Michael J.; Silberglitt, Matt D.; Buckley, Barbara C. – Journal of Research in Science Teaching, 2012

This article reports on the collaboration of six states to study how simulation-based science assessments can become transformative components of multi-level, balanced state science assessment systems. The project studied the psychometric quality, feasibility, and utility of simulation-based science assessments designed to serve formative purposes…

Descriptors: State Programs, Educational Assessment, Simulated Environment, Grade 6

How Much Is Learning Measurement Worth? Assessment Costs in Low-Income Countries

Peer reviewed
PDF on ERIC

Download full text

Wagner, Daniel A.; Babson, Andrew; Murphy, Katie M. – Current Issues in Comparative Education, 2011

Timely and credible data on student learning has become a global issue in the ongoing effort to improve educational outcomes. With the potential to serve as a powerful diagnostic tool to gauge the overall health and well-being of an educational system, educational assessments have received increasing attention among specialists and the media.…

Descriptors: Low Income, Educational Objectives, Outcomes of Education, Educational Change

Item Position and Item Difficulty Change in an IRT-Based Common Item Equating Design

Peer reviewed

Direct link

Meyers, Jason L.; Miller, G. Edward; Way, Walter D. – Applied Measurement in Education, 2009

In operational testing programs using item response theory (IRT), item parameter invariance is threatened when an item appears in a different location on the live test than it did when it was field tested. This study utilizes data from a large state's assessments to model change in Rasch item difficulty (RID) as a function of item position change,…

Descriptors: Test Items, Test Content, Testing Programs, Simulation

Consistent Estimation of Rasch Item Parameters and Their Standard Errors under Complex Sample Designs

Peer reviewed

Direct link

Cohen, Jon; Chan, Tsze; Jiang, Tao; Seburn, Mary – Applied Psychological Measurement, 2008

U.S. state educational testing programs administer tests to track student progress and hold schools accountable for educational outcomes. Methods from item response theory, especially Rasch models, are usually used to equate different forms of a test. The most popular method for estimating Rasch models yields inconsistent estimates and relies on…

Descriptors: Testing Programs, Educational Testing, Item Response Theory, Computation

Previous Page | Next Page »

Pages: 1 | 2 | 3

Applied Measurement in…	4
Applied Psychological…	2
Educational Evaluation and…	2
College Student Journal	1
Composition Forum	1
Critical Studies in Education	1
Current Issues in Comparative…	1
Diagnostique	1
ERS Spectrum	1
ETS Research Report Series	1
Educational Measurement:…	1
Educational Research and…	1
Educational and Psychological…	1
Journal of Applied Testing…	1
Journal of Developmental…	1
Journal of General Education	1
Journal of Research in…	1
Journal of Teacher Education	1
Journal of Teaching in…	1
Journal of the Scholarship of…	1
Leadership	1
NCME Measurement in Education	1
Online Submission	1
Phi Delta Kappan	1
Planning and Changing	1
More ▼

von Davier, Alina A.	2
Adedoyin, Kimberly Clark	1
Babson, Andrew	1
Bach, James V.	1
Blank, Rolf K.	1
Bolt, Sara E.	1
Brian F. French	1
Buckley, Barbara C.	1
Burke, Karen	1
Canney, George F.	1
Chan, Tsze	1
Chow, Kui Foon	1
Cohen, Jon	1
Connolly, Faith	1
Conti, Maria	1
Crawford, Carolyn	1
Cronin, John	1
Espenshade, Pamela H.	1
Ewell, Peter T.	1
Foord, Kathleen A.	1
Griffee, Dale T.	1
Haberman, Shelby	1
Haertel, Edward	1
Harris, Sandra	1
More ▼