Publication Date
| In 2026 | 0 |
| Since 2025 | 0 |
| Since 2022 (last 5 years) | 1 |
| Since 2017 (last 10 years) | 2 |
| Since 2007 (last 20 years) | 10 |
Descriptor
| Test Content | 14 |
| Test Length | 14 |
| Computer Assisted Testing | 6 |
| Test Construction | 6 |
| Test Items | 6 |
| Scores | 5 |
| Test Format | 4 |
| Academic Achievement | 3 |
| Alternative Assessment | 3 |
| Comparative Analysis | 3 |
| Evaluation Methods | 3 |
| More ▼ | |
Source
Author
| Ankenman, Robert D. | 1 |
| Camilli, Gregory | 1 |
| Chen, Shu-Ying | 1 |
| Chien, Yuehmei | 1 |
| Cui, Zhongmin | 1 |
| Dorans, Neil | 1 |
| Egley, Robert J. | 1 |
| Hambleton, Ronald K. | 1 |
| Jacob, Brian A. | 1 |
| Jones, Brett D. | 1 |
| Kolen, Michael J. | 1 |
| More ▼ | |
Publication Type
| Journal Articles | 11 |
| Reports - Research | 6 |
| Reports - Evaluative | 5 |
| Guides - Non-Classroom | 1 |
| Reports - Descriptive | 1 |
| Reports - General | 1 |
| Speeches/Meeting Papers | 1 |
Education Level
| Elementary Education | 2 |
| Elementary Secondary Education | 2 |
| Higher Education | 1 |
| Middle Schools | 1 |
| Postsecondary Education | 1 |
| Two Year Colleges | 1 |
Audience
| Practitioners | 1 |
Location
| Florida | 1 |
| Rhode Island | 1 |
Laws, Policies, & Programs
Assessments and Surveys
| Florida Comprehensive… | 1 |
What Works Clearinghouse Rating
Timothy S. Faith – Teaching and Learning Excellence through Scholarship, 2024
This study compared traditional methods of college-level instruction, including lecture and class discussion followed by assessment via course content exams, with a variety of other instructional techniques. The intent was to evaluate whether more contemporary instructional techniques are significantly correlated with improved average exam scores…
Descriptors: Community College Students, Business Administration Education, Teaching Methods, Alternative Assessment
Luo, Xiao; Wang, Xinrui – International Journal of Testing, 2019
This study introduced dynamic multistage testing (dy-MST) as an improvement to existing adaptive testing methods. dy-MST combines the advantages of computerized adaptive testing (CAT) and computerized adaptive multistage testing (ca-MST) to create a highly efficient and regulated adaptive testing method. In the test construction phase, multistage…
Descriptors: Adaptive Testing, Computer Assisted Testing, Test Construction, Psychometrics
Lin, Peng; Dorans, Neil; Weeks, Jonathan – ETS Research Report Series, 2016
The nonequivalent groups with anchor test (NEAT) design is frequently used in test score equating or linking. One important assumption of the NEAT design is that the anchor test is a miniversion of the 2 tests to be equated/linked. When the content of the 2 tests is different, it is not possible for the anchor test to be adequately representative…
Descriptors: Equated Scores, Test Length, Test Content, Difficulty Level
Jacob, Brian A. – Center on Children and Families at Brookings, 2016
Contrary to popular belief, modern cognitive assessments--including the new Common Core tests--produce test scores based on sophisticated statistical models rather than the simple percent of items a student answers correctly. While there are good reasons for this, it means that reported test scores depend on many decisions made by test designers,…
Descriptors: Scores, Common Core State Standards, Test Length, Test Content
Shin, Chingwei David; Chien, Yuehmei; Way, Walter Denny – Pearson, 2012
Content balancing is one of the most important components in the computerized adaptive testing (CAT) especially in the K to 12 large scale tests that complex constraint structure is required to cover a broad spectrum of content. The purpose of this study is to compare the weighted penalty model (WPM) and the weighted deviation method (WDM) under…
Descriptors: Computer Assisted Testing, Elementary Secondary Education, Test Content, Models
Lin, Chuan-Ju – Journal of Technology, Learning, and Assessment, 2010
Assembling equivalent test forms with minimal test overlap across forms is important in ensuring test security. Chen and Lei (2009) suggested a exposure control technique to control test overlap-ordered item pooling on the fly based on the essence that test overlap rate--ordered item pooling for the first t examinees is a function of test overlap…
Descriptors: Test Length, Test Format, Evaluation Criteria, Psychometrics
Camilli, Gregory – Educational Research and Evaluation, 2013
In the attempt to identify or prevent unfair tests, both quantitative analyses and logical evaluation are often used. For the most part, fairness evaluation is a pragmatic attempt at determining whether procedural or substantive due process has been accorded to either a group of test takers or an individual. In both the individual and comparative…
Descriptors: Alternative Assessment, Test Bias, Test Content, Test Format
Cui, Zhongmin; Kolen, Michael J. – Applied Psychological Measurement, 2008
This article considers two methods of estimating standard errors of equipercentile equating: the parametric bootstrap method and the nonparametric bootstrap method. Using a simulation study, these two methods are compared under three sample sizes (300, 1,000, and 3,000), for two test content areas (the Iowa Tests of Basic Skills Maps and Diagrams…
Descriptors: Test Length, Test Content, Simulation, Computation
Rotou, Ourania; Patsula, Liane; Steffen, Manfred; Rizavi, Saba – ETS Research Report Series, 2007
Traditionally, the fixed-length linear paper-and-pencil (P&P) mode of administration has been the standard method of test delivery. With the advancement of technology, however, the popularity of administering tests using adaptive methods like computerized adaptive testing (CAT) and multistage testing (MST) has grown in the field of measurement…
Descriptors: Comparative Analysis, Test Format, Computer Assisted Testing, Models
Rhode Island Department of Elementary and Secondary Education, 2007
This handbook will assist principals and school testing coordinators in implementing the spring 2007 administration of the Developmental Reading Assessment (DRA). Information regarding administration timeline, reporting, process, online tools and contact personnel is discussed. Contents include: (1) Scheduling; (2) Identify Primary Test…
Descriptors: Testing Accommodations, Alternative Assessment, Educational Testing, Guidance Programs
Chen, Shu-Ying; Ankenman, Robert D. – Journal of Educational Measurement, 2004
The purpose of this study was to compare the effects of four item selection rules--(1) Fisher information (F), (2) Fisher information with a posterior distribution (FP), (3) Kullback-Leibler information with a posterior distribution (KP), and (4) completely randomized item selection (RN)--with respect to the precision of trait estimation and the…
Descriptors: Test Length, Adaptive Testing, Computer Assisted Testing, Test Selection
Peer reviewedLinn, Robert L.; Hambleton, Ronald K. – Applied Measurement in Education, 1991
Four main approaches to customized testing are described, and their resulting scores' valid uses and interpretations are discussed. Customized testing can yield valid normative and curriculum-specific information, although cautious application is needed to avoid misleading inferences about student achievement. (SLD)
Descriptors: Academic Achievement, Accountability, Criterion Referenced Tests, Curriculum
Peer reviewedRoeber, Edward D. – Contemporary Education, 1997
Examines reasons why student assessment is undergoing reform, discussing how such reform may affect the nation's schools. The paper describes why school reform is occurring, notes how reform of assessment fits school reform, explains types of assessments and assessment designs, and highlights practical and technical challenges inherent in using…
Descriptors: Academic Achievement, Academic Standards, Educational Change, Elementary School Students
Jones, Brett D.; Egley, Robert J. – ERS Spectrum, 2005
The purpose of this paper is to discuss Florida teachers' recommendations for improving the Florida Comprehensive Assessment Test (FCAT) and to compare their recommendations with those of Florida administrators. Although teachers' suggestions varied as to the types and extent of remedies needed to improve the FCAT, some common themes emerged. The…
Descriptors: Test Results, Core Curriculum, Student Evaluation, Accountability

Direct link
