ERIC - Search Results

Publication Date

In 2026	0
Since 2025	0
Since 2022 (last 5 years)	2
Since 2017 (last 10 years)	2
Since 2007 (last 20 years)	17

Descriptor

Item Analysis	26
Testing Programs	26
Test Items	14
Item Response Theory	10
Elementary Secondary Education	6
Test Construction	6
Evaluation Methods	5
Foreign Countries	5
Models	4
Standardized Tests	4
Test Validity	4
Comparative Analysis	3
Computer Assisted Testing	3
Error of Measurement	3
Item Banks	3
Mathematics Tests	3
Scores	3
Scoring	3
Testing Problems	3
Accountability	2
Adaptive Testing	2
Alternative Assessment	2
College Entrance Examinations	2
Criterion Referenced Tests	2
Educational Change	2
More ▼

Publication Type

Journal Articles	26
Reports - Research	11
Reports - Evaluative	8
Reports - Descriptive	5
Tests/Questionnaires	1

Education Level

Elementary Secondary Education	7
Secondary Education	3
Higher Education	2
Postsecondary Education	2
Adult Education	1
Early Childhood Education	1
Elementary Education	1
Grade 1	1
Grade 11	1
Grade 3	1
Grade 4	1
Grade 5	1
Grade 6	1
Grade 7	1
High Schools	1
Intermediate Grades	1
Junior High Schools	1
Middle Schools	1
Primary Education	1
More ▼

Audience

Location

Australia	2
Florida	2
Russia	1
Singapore	1

Laws, Policies, & Programs

Assessments and Surveys

Graduate Record Examinations	1
National Assessment of…	1
Praxis Series	1
Program for International…	1
SAT (College Admission Test)	1
Stanford Achievement Tests	1

What Works Clearinghouse Rating

Showing 1 to 15 of 26 results Save | Export

Traditional vs Intersectional DIF Analysis: Considerations and a Comparison Using State Testing Data

Peer reviewed

Direct link

Tony Albano; Brian F. French; Thao Thu Vo – Applied Measurement in Education, 2024

Recent research has demonstrated an intersectional approach to the study of differential item functioning (DIF). This approach expands DIF to account for the interactions between what have traditionally been treated as separate grouping variables. In this paper, we compare traditional and intersectional DIF analyses using data from a state testing…

Descriptors: Test Items, Item Analysis, Data Use, Standardized Tests

Semisupervised Learning Method to Adjust Biased Item Difficulty Estimates Caused by Nonignorable Missingness in a Virtual Learning Environment

Peer reviewed
PDF on ERIC

Download full text

Direct link

Xue, Kang; Huggins-Manley, Anne Corinne; Leite, Walter – Educational and Psychological Measurement, 2022

In data collected from virtual learning environments (VLEs), item response theory (IRT) models can be used to guide the ongoing measurement of student ability. However, such applications of IRT rely on unbiased item parameter estimates associated with test items in the VLE. Without formal piloting of the items, one can expect a large amount of…

Descriptors: Virtual Classrooms, Artificial Intelligence, Item Response Theory, Item Analysis

Considering the Use of General and Modified Assessment Items in Computerized Adaptive Testing

Peer reviewed

Direct link

Wyse, Adam E.; Albano, Anthony D. – Applied Measurement in Education, 2015

This article used several data sets from a large-scale state testing program to examine the feasibility of combining general and modified assessment items in computerized adaptive testing (CAT) for different groups of students. Results suggested that several of the assumptions made when employing this type of mixed-item CAT may not be met for…

Descriptors: Adaptive Testing, Computer Assisted Testing, Test Items, Testing Programs

Multilevel Modeling of Item Position Effects

Peer reviewed

Direct link

Albano, Anthony D. – Journal of Educational Measurement, 2013

In many testing programs it is assumed that the context or position in which an item is administered does not have a differential effect on examinee responses to the item. Violations of this assumption may bias item response theory estimates of item and person parameters. This study examines the potentially biasing effects of item position. A…

Descriptors: Test Items, Item Response Theory, Test Format, Questioning Techniques

Impact of Design Effects in Large-Scale District and State Assessments

Peer reviewed

Direct link

Phillips, Gary W. – Applied Measurement in Education, 2015

This article proposes that sampling design effects have potentially huge unrecognized impacts on the results reported by large-scale district and state assessments in the United States. When design effects are unrecognized and unaccounted for they lead to underestimating the sampling error in item and test statistics. Underestimating the sampling…

Descriptors: State Programs, Sampling, Research Design, Error of Measurement

Multimodal Reading Comprehension: Curriculum Expectations and Large-Scale Literacy Testing Practices

Peer reviewed

Direct link

Unsworth, Len – Pedagogies: An International Journal, 2014

Interpreting the image-language interface in multimodal texts is now well recognized as a crucial aspect of reading comprehension in a number of official school syllabi such as the recently published Australian Curriculum: English (ACE). This article outlines the relevant expected student learning outcomes in this curriculum and draws attention to…

Descriptors: Foreign Countries, National Curriculum, Reading Comprehension, Reading Tests

Nonparametric Item Response Curve Estimation with Correction for Measurement Error

Peer reviewed

Direct link

Guo, Hongwen; Sinharay, Sandip – Journal of Educational and Behavioral Statistics, 2011

Nonparametric or kernel regression estimation of item response curves (IRCs) is often used in item analysis in testing programs. These estimates are biased when the observed scores are used as the regressor because the observed scores are contaminated by measurement error. Accuracy of this estimation is a concern theoretically and operationally.…

Descriptors: Testing Programs, Measurement, Item Analysis, Error of Measurement

Applying Multidimensional Item Response Theory Models in Validating Test Dimensionality: An Example of K-12 Large-Scale Science Assessment

Peer reviewed

Direct link

Li, Ying; Jiao, Hong; Lissitz, Robert W. – Journal of Applied Testing Technology, 2012

This study investigated the application of multidimensional item response theory (IRT) models to validate test structure and dimensionality. Multiple content areas or domains within a single subject often exist in large-scale achievement tests. Such areas or domains may cause multidimensionality or local item dependence, which both violate the…

Descriptors: Achievement Tests, Science Tests, Item Response Theory, Measures (Individuals)

Test Reviewing at the Buros Center for Testing

Peer reviewed

Direct link

Carlson, Janet F.; Geisinger, Kurt F. – International Journal of Testing, 2012

The test review process used by the Buros Center for Testing is described as a series of 11 steps: (1) identifying tests to be reviewed, (2) obtaining tests and preparing test descriptions, (3) determining whether tests meet review criteria, (4) identifying appropriate reviewers, (5) selecting reviewers, (6) sending instructions and materials to…

Descriptors: Testing, Test Reviews, Evaluation Methods, Evaluation Criteria

The Russian Uniform State Examination in Mathematics: The Latest Version

Peer reviewed

Direct link

Marushina, Albina – Journal of Mathematics Education at Teachers College, 2012

This paper aims to tell how the Russian national examination in mathematics (the Uniform State Examination or USE) has been conducted most recently. The author must say at once that the history of the system of secondary school graduation examinations or even the history of the USE will be covered only to the small degree that is necessary for…

Descriptors: Foreign Countries, Mathematics Tests, National Competency Tests, Secondary School Mathematics

Constructed-Response DIF Evaluations for Mixed-Format Tests. Research Report. ETS RR-13-33

Peer reviewed
PDF on ERIC

Download full text

Moses, Tim; Liu, Jinghua; Tan, Adele; Deng, Weiling; Dorans, Neil J. – ETS Research Report Series, 2013

In this study, differential item functioning (DIF) methods utilizing 14 different matching variables were applied to assess DIF in the constructed-response (CR) items from 6 forms of 3 mixed-format tests. Results suggested that the methods might produce distinct patterns of DIF results for different tests and testing programs, in that the DIF…

Descriptors: Test Construction, Multiple Choice Tests, Test Items, Item Analysis

Strengthening Educational Quality in Developing Countries: The Role of National Examinations and International Assessment Systems

Peer reviewed

Direct link

Somerset, Anthony – Compare: A Journal of Comparative and International Education, 2011

Educational practitioners rely predominantly on measures of outcome, rather than of inputs or process, in making judgements as to quality. Outcome measures are available from two main sources: (1) the relatively new international assessment systems; and (2) the traditional national examinations systems. The two types of system differ in their…

Descriptors: Testing Programs, Educational Quality, National Competency Tests, Educational Improvement

Investigating the Suitability of Implementing the "e-rater"® Scoring Engine in a Large-Scale English Language Testing Program. Research Report. ETS RR-13-36

Peer reviewed
PDF on ERIC

Download full text

Zhang, Mo; Breyer, F. Jay; Lorenz, Florian – ETS Research Report Series, 2013

In this research, we investigated the suitability of implementing "e-rater"® automated essay scoring in a high-stakes large-scale English language testing program. We examined the effectiveness of generic scoring and 2 variants of prompt-based scoring approaches. Effectiveness was evaluated on a number of dimensions, including agreement…

Descriptors: Computer Assisted Testing, Computer Software, Scoring, Language Tests

Design of a Computer-Adaptive Test to Measure English Literacy and Numeracy in the Singapore Workforce: Considerations, Benefits, and Implications

Peer reviewed

Direct link

Jacobsen, Jared; Ackermann, Richard; Eguez, Jane; Ganguli, Debalina; Rickard, Patricia; Taylor, Linda – Journal of Applied Testing Technology, 2011

A computer adaptive test (CAT) is a delivery methodology that serves the larger goals of the assessment system in which it is embedded. A thorough analysis of the assessment system for which a CAT is being designed is critical to ensure that the delivery platform is appropriate and addresses all relevant complexities. As such, a CAT engine must be…

Descriptors: Delivery Systems, Testing Programs, Computer Assisted Testing, Foreign Countries

Identification of Reading Problems in First Grade within a Response-to-Intervention Framework

Peer reviewed

Direct link

Speece, Deborah L.; Schatschneider, Christopher; Silverman, Rebecca; Case, Lisa Pericola; Cooper, David H.; Jacobs, Dawn M. – Elementary School Journal, 2011

Models of Response to Intervention (RTI) include parameters of assessment and instruction. This study focuses on assessment with the purpose of developing a screening battery that validly and efficiently identifies first-grade children at risk for reading problems. In an RTI model, these children would be candidates for early intervention. We…

Descriptors: Reading Difficulties, Early Intervention, Grade 1, Response to Intervention

Previous Page | Next Page »

Pages: 1 | 2

Applied Measurement in…	3
ETS Research Report Series	2
Educational and Psychological…	2
International Journal of…	2
Journal of Applied Testing…	2
Journal of Educational…	2
Australian Journal of…	1
Compare: A Journal of…	1
Creative Computing	1
Educational Measurement:…	1
Elementary School Journal	1
Evaluation and the Health…	1
Journal of Educational and…	1
Journal of Employment…	1
Journal of Mathematics…	1
Pedagogies: An International…	1
Quest	1
School Science and Mathematics	1
Spectrum	1
More ▼

Albano, Anthony D.	2
Ackermann, Richard	1
Breyer, F. Jay	1
Brian F. French	1
Carlson, Janet F.	1
Case, Lisa Pericola	1
Childs, Ruth A.	1
Cooper, David H.	1
Davis, Robbie G.	1
Deng, Weiling	1
Diezmann, Carmel M.	1
Dorans, Neil J.	1
Eguez, Jane	1
Ganguli, Debalina	1
Geisinger, Kurt F.	1
Gramenz, Gary W.	1
Green, Douglas W.	1
Guo, Hongwen	1
Hankins, Janette A.	1
Harris, Deborah J.	1
Hering, Jeffrey	1
Hills, John R.	1
Huggins-Manley, Anne Corinne	1
Jaciw, Andrew P.	1
Jacobs, Dawn M.	1
More ▼