ERIC - Search Results

Publication Date

In 2026	0
Since 2025	0
Since 2022 (last 5 years)	1
Since 2017 (last 10 years)	2
Since 2007 (last 20 years)	10

Descriptor

Test Content	14
Test Length	14
Computer Assisted Testing	6
Test Construction	6
Test Items	6
Scores	5
Test Format	4
Academic Achievement	3
Alternative Assessment	3
Comparative Analysis	3
Evaluation Methods	3
Simulation	3
Student Evaluation	3
Accountability	2
Adaptive Testing	2
Correlation	2
Elementary Secondary Education	2
Equated Scores	2
Error of Measurement	2
Evaluation Criteria	2
Evaluation Research	2
Item Analysis	2
Mathematics Tests	2
Models	2
Prediction	2
More ▼

Source

ETS Research Report Series	2
Applied Measurement in…	1
Applied Psychological…	1
Center on Children and…	1
Contemporary Education	1
ERS Spectrum	1
Educational Research and…	1
International Journal of…	1
Journal of Educational…	1
Journal of Technology,…	1
Pearson	1
Rhode Island Department of…	1
Teaching and Learning…	1
More ▼

Publication Type

Journal Articles	11
Reports - Research	6
Reports - Evaluative	5
Guides - Non-Classroom	1
Reports - Descriptive	1
Reports - General	1
Speeches/Meeting Papers	1

Education Level

Elementary Education	2
Elementary Secondary Education	2
Higher Education	1
Middle Schools	1
Postsecondary Education	1
Two Year Colleges	1

Audience

Practitioners

Location

Florida	1
Rhode Island	1

Laws, Policies, & Programs

Assessments and Surveys

Florida Comprehensive…

What Works Clearinghouse Rating

Showing all 14 results Save | Export

Impact of Selected Teaching Techniques on Exam Performance in Business Law

Peer reviewed
PDF on ERIC

Download full text

Timothy S. Faith – Teaching and Learning Excellence through Scholarship, 2024

This study compared traditional methods of college-level instruction, including lecture and class discussion followed by assessment via course content exams, with a variety of other instructional techniques. The intent was to evaluate whether more contemporary instructional techniques are significantly correlated with improved average exam scores…

Descriptors: Community College Students, Business Administration Education, Teaching Methods, Alternative Assessment

Dynamic Multistage Testing: A Highly Efficient and Regulated Adaptive Testing Method

Peer reviewed

Direct link

Luo, Xiao; Wang, Xinrui – International Journal of Testing, 2019

This study introduced dynamic multistage testing (dy-MST) as an improvement to existing adaptive testing methods. dy-MST combines the advantages of computerized adaptive testing (CAT) and computerized adaptive multistage testing (ca-MST) to create a highly efficient and regulated adaptive testing method. In the test construction phase, multistage…

Descriptors: Adaptive Testing, Computer Assisted Testing, Test Construction, Psychometrics

Linking Composite Scores: Effects of Anchor Test Length and Content Representativeness. Research Report. ETS RR-16-36

Peer reviewed
PDF on ERIC

Download full text

Lin, Peng; Dorans, Neil; Weeks, Jonathan – ETS Research Report Series, 2016

The nonequivalent groups with anchor test (NEAT) design is frequently used in test score equating or linking. One important assumption of the NEAT design is that the anchor test is a miniversion of the 2 tests to be equated/linked. When the content of the 2 tests is different, it is not possible for the anchor test to be adequately representative…

Descriptors: Equated Scores, Test Length, Test Content, Difficulty Level

Student Test Scores: How the Sausage Is Made and Why You Should Care. Evidence Speaks Reports, Vol 1, #25

Direct link

Jacob, Brian A. – Center on Children and Families at Brookings, 2016

Contrary to popular belief, modern cognitive assessments--including the new Common Core tests--produce test scores based on sophisticated statistical models rather than the simple percent of items a student answers correctly. While there are good reasons for this, it means that reported test scores depend on many decisions made by test designers,…

Descriptors: Scores, Common Core State Standards, Test Length, Test Content

A Comparison of Three Content Balancing Methods for Fixed and Variable Length Computerized Adaptive Tests

Direct link

Shin, Chingwei David; Chien, Yuehmei; Way, Walter Denny – Pearson, 2012

Content balancing is one of the most important components in the computerized adaptive testing (CAT) especially in the K to 12 large scale tests that complex constraint structure is required to cover a broad spectrum of content. The purpose of this study is to compare the weighted penalty model (WPM) and the weighted deviation method (WDM) under…

Descriptors: Computer Assisted Testing, Elementary Secondary Education, Test Content, Models

Controlling Test Overlap Rate in Automated Assembly of Multiple Equivalent Test Forms

Peer reviewed
PDF on ERIC

Download full text

Lin, Chuan-Ju – Journal of Technology, Learning, and Assessment, 2010

Assembling equivalent test forms with minimal test overlap across forms is important in ensuring test security. Chen and Lei (2009) suggested a exposure control technique to control test overlap-ordered item pooling on the fly based on the essence that test overlap rate--ordered item pooling for the first t examinees is a function of test overlap…

Descriptors: Test Length, Test Format, Evaluation Criteria, Psychometrics

Ongoing Issues in Test Fairness

Peer reviewed

Direct link

Camilli, Gregory – Educational Research and Evaluation, 2013

In the attempt to identify or prevent unfair tests, both quantitative analyses and logical evaluation are often used. For the most part, fairness evaluation is a pragmatic attempt at determining whether procedural or substantive due process has been accorded to either a group of test takers or an individual. In both the individual and comparative…

Descriptors: Alternative Assessment, Test Bias, Test Content, Test Format

Comparison of Parametric and Nonparametric Bootstrap Methods for Estimating Random Error in Equipercentile Equating

Peer reviewed

Direct link

Cui, Zhongmin; Kolen, Michael J. – Applied Psychological Measurement, 2008

This article considers two methods of estimating standard errors of equipercentile equating: the parametric bootstrap method and the nonparametric bootstrap method. Using a simulation study, these two methods are compared under three sample sizes (300, 1,000, and 3,000), for two test content areas (the Iowa Tests of Basic Skills Maps and Diagrams…

Descriptors: Test Length, Test Content, Simulation, Computation

Comparison of Multistage Tests with Computerized Adaptive and Paper-and-Pencil Tests. Research Report. ETS RR-07-04

Peer reviewed
PDF on ERIC

Download full text

Rotou, Ourania; Patsula, Liane; Steffen, Manfred; Rizavi, Saba – ETS Research Report Series, 2007

Traditionally, the fixed-length linear paper-and-pencil (P&P) mode of administration has been the standard method of test delivery. With the advancement of technology, however, the popularity of administering tests using adaptive methods like computerized adaptive testing (CAT) and multistage testing (MST) has grown in the field of measurement…

Descriptors: Comparative Analysis, Test Format, Computer Assisted Testing, Models

Rhode Island State Assessment Program District and School Testing Coordinators Handbook: K-1 Assessment Program

Download full text

Rhode Island Department of Elementary and Secondary Education, 2007

This handbook will assist principals and school testing coordinators in implementing the spring 2007 administration of the Developmental Reading Assessment (DRA). Information regarding administration timeline, reporting, process, online tools and contact personnel is discussed. Contents include: (1) Scheduling; (2) Identify Primary Test…

Descriptors: Testing Accommodations, Alternative Assessment, Educational Testing, Guidance Programs

Effects of Practical Constraints on Item Selection Rules at the Early Stages of Computerized Adaptive Testing

Peer reviewed

Direct link

Chen, Shu-Ying; Ankenman, Robert D. – Journal of Educational Measurement, 2004

The purpose of this study was to compare the effects of four item selection rules--(1) Fisher information (F), (2) Fisher information with a posterior distribution (FP), (3) Kullback-Leibler information with a posterior distribution (KP), and (4) completely randomized item selection (RN)--with respect to the precision of trait estimation and the…

Descriptors: Test Length, Adaptive Testing, Computer Assisted Testing, Test Selection

Customized Tests and Customized Norms.

Peer reviewed

Linn, Robert L.; Hambleton, Ronald K. – Applied Measurement in Education, 1991

Four main approaches to customized testing are described, and their resulting scores' valid uses and interpretations are discussed. Customized testing can yield valid normative and curriculum-specific information, although cautious application is needed to avoid misleading inferences about student achievement. (SLD)

Descriptors: Academic Achievement, Accountability, Criterion Referenced Tests, Curriculum

The Technical and Practical Challenges in Developing Innovative Assessment Approaches for Use in Statewide Assessment Programs.

Peer reviewed

Roeber, Edward D. – Contemporary Education, 1997

Examines reasons why student assessment is undergoing reform, discussing how such reform may affect the nation's schools. The paper describes why school reform is occurring, notes how reform of assessment fits school reform, explains types of assessments and assessment designs, and highlights practical and technical challenges inherent in using…

Descriptors: Academic Achievement, Academic Standards, Educational Change, Elementary School Students

Go Back and Check Your Work: Recommendations for Improving Florida's Accountability System

Peer reviewed

Direct link

Jones, Brett D.; Egley, Robert J. – ERS Spectrum, 2005

The purpose of this paper is to discuss Florida teachers' recommendations for improving the Florida Comprehensive Assessment Test (FCAT) and to compare their recommendations with those of Florida administrators. Although teachers' suggestions varied as to the types and extent of remedies needed to improve the FCAT, some common themes emerged. The…

Descriptors: Test Results, Core Curriculum, Student Evaluation, Accountability

Ankenman, Robert D.	1
Camilli, Gregory	1
Chen, Shu-Ying	1
Chien, Yuehmei	1
Cui, Zhongmin	1
Dorans, Neil	1
Egley, Robert J.	1
Hambleton, Ronald K.	1
Jacob, Brian A.	1
Jones, Brett D.	1
Kolen, Michael J.	1
Lin, Chuan-Ju	1
Lin, Peng	1
Linn, Robert L.	1
Luo, Xiao	1
Patsula, Liane	1
Rizavi, Saba	1
Roeber, Edward D.	1
Rotou, Ourania	1
Shin, Chingwei David	1
Steffen, Manfred	1
Timothy S. Faith	1
Wang, Xinrui	1
Way, Walter Denny	1
Weeks, Jonathan	1
More ▼