Publication Date
| In 2026 | 0 |
| Since 2025 | 0 |
| Since 2022 (last 5 years) | 1 |
| Since 2017 (last 10 years) | 1 |
| Since 2007 (last 20 years) | 5 |
Descriptor
Source
| National Center for Education… | 3 |
| Applied Psychological… | 1 |
| ETS Research Report Series | 1 |
| Journal of Educational and… | 1 |
| Journal of University… | 1 |
| Language Testing | 1 |
| ProQuest LLC | 1 |
| Spectrum | 1 |
Author
| DiLuzio, Geneva J. | 4 |
| Dishaw, Marilyn | 2 |
| Filby, Nikola N. | 2 |
| Reckase, Mark D. | 2 |
| Samejima, Fumiko | 2 |
| Ahmed, S. | 1 |
| Andrew D. Ho | 1 |
| Baxter, G. P. | 1 |
| Bower, Ruth | 1 |
| Chen, Shu-Ying | 1 |
| Clarke, S. C. T. | 1 |
| More ▼ | |
Publication Type
Education Level
| Postsecondary Education | 3 |
| Grade 4 | 1 |
| Grade 5 | 1 |
| Grade 6 | 1 |
| Grade 7 | 1 |
| Grade 8 | 1 |
| Higher Education | 1 |
Audience
| Researchers | 3 |
Location
| Canada | 1 |
| Puerto Rico | 1 |
Laws, Policies, & Programs
Assessments and Surveys
| Beginning Postsecondary… | 2 |
| ACT Assessment | 1 |
| California Achievement Tests | 1 |
| California Test of Mental… | 1 |
| Iowa Tests of Basic Skills | 1 |
| Stanford Achievement Tests | 1 |
What Works Clearinghouse Rating
Andrew D. Ho – Journal of Educational and Behavioral Statistics, 2024
I review opportunities and threats that widely accessible Artificial Intelligence (AI)-powered services present for educational statistics and measurement. Algorithmic and computational advances continue to improve approaches to item generation, scale maintenance, test security, test scoring, and score reporting. Predictable misuses of AI for…
Descriptors: Artificial Intelligence, Measurement, Educational Assessment, Technology Uses in Education
Xiang, Rui – ProQuest LLC, 2013
A key issue of cognitive diagnostic models (CDMs) is the correct identification of Q-matrix which indicates the relationship between attributes and test items. Previous CDMs typically assumed a known Q-matrix provided by domain experts such as those who developed the questions. However, misspecifications of Q-matrix had been discovered in the past…
Descriptors: Diagnostic Tests, Cognitive Processes, Matrices, Test Items
Cominole, Melissa; Wheeless, Sara; Dudley, Kristin; Franklin, Jeff; Wine, Jennifer – National Center for Education Statistics, 2007
The "2004/06 Beginning Postsecondary Students Longitudinal Study (BPS:04/06)" is sponsored by the U.S. Department of Education to respond to the need for a national, comprehensive database concerning issues students may face in enrollment, persistence, progress, and attainment in postsecondary education and in consequent early rates of…
Descriptors: Postsecondary Education, Stopouts, Research Methodology, Data Collection
Filby, Nikola N.; Dishaw, Marilyn – 1976
Major analyses of the achievement tests used in the Beginning Teacher Evaluation Study were conducted to determine test reactivity to instruction. Reading and mathematics tests were administered to second and fifth grade children. Classroom teachers' records were examined to determine the amount of opportunity students had to learn the content…
Descriptors: Academic Ability, Academic Achievement, Achievement Gains, Achievement Tests
Chen, Shu-Ying; Lei, Pui-Wa – Applied Psychological Measurement, 2005
This article proposes an item exposure control method, which is the extension of the Sympson and Hetter procedure and can provide item exposure control at both the item and test levels. Item exposure rate and test overlap rate are two indices commonly used to track item exposure in computerized adaptive tests. By considering both indices, item…
Descriptors: Computer Assisted Testing, Test Items, Computer Simulation, Evaluation Criteria
Wine, Jennifer S.; Heuer, Ruth E.; Wheeless, Sara C.; Francis, Talbric L.; Franklin, Jeff W.; Dudley, Kristin M. – National Center for Education Statistics, 2002
This report describes the methods and procedures used for the Beginning Postsecondary Students Longitudinal Study: 1996-2001 (BPS:1996/2001). These students, who started their postsecondary education during the 1995-96 academic year, were first interviewed in 1996 as part of the National Postsecondary Student Aid Study (NPSAS:96). A follow-up…
Descriptors: Longitudinal Studies, Postsecondary Education, Research Methodology, Interviews
Sinharay, Sandip; Holland, Paul – ETS Research Report Series, 2006
It is a widely held belief that an anchor test used in equating should be a miniature version (or "minitest") of the tests to be equated; that is, the anchor test should be proportionally representative of the two tests in content and statistical characteristics. This paper examines the scientific foundation of this belief, especially…
Descriptors: Test Items, Equated Scores, Correlation, Tests
Filby, Nikola N.; Dishaw, Marilyn – 1975
Achievement tests that are maximally sensitive to effective instruction in reading and mathematics for grades 2 and 5 were developed and refined. Important considerations regarding the tests' validity were: its coverage of instructional content (opportunity to learn), and its reactivity to instruction. Student ability must be minimally related to…
Descriptors: Academic Ability, Academic Achievement, Achievement Gains, Achievement Tests
Bower, Ruth – Spectrum, 1983
As a preliminary step in customizing the Stanford Achievement Test, a study was made to compare the test items at each level with the curriculum objectives of a Florida school district, classifying them as either matching, above or below grade level, or unmatching. Results are shown in table form. (TE)
Descriptors: Elementary Secondary Education, Item Analysis, Standardized Tests, Statistical Data
Samejima, Fumiko – 1982
In a preceding research report, ONR/RR-82-1 (Information Loss Caused by Noise in Models for Dichotomous Items), observations were made on the effect of noise accommodated in different types of models on the dichotomous response level. In the present paper, focus is put upon the three-parameter logistic model, which is widely used among…
Descriptors: Estimation (Mathematics), Goodness of Fit, Guessing (Tests), Mathematical Models
Reckase, Mark D.; McKinley, Robert L. – 1983
A study was undertaken to develop guidelines for the interpretation of the parameters of three multidimensional item response theory models and to determine the relationship between the parameters and traditional concepts of item difficulty and discrimination. The three models considered were multidimensional extensions of the one-, two-, and…
Descriptors: Computer Programs, Difficulty Level, Goodness of Fit, Latent Trait Theory
Baxter, G. P.; Ahmed, S.; Sikali, E.; Waits, T.; Sloan, M.; Salvucci, S. – National Center for Education Statistics, 2007
In 2003, a trial National Assessment of Educational Progress (NAEP) mathematics assessment was administered in Spanish to public school students at grades 4 and 8 in Puerto Rico. Based on preliminary analyses of the 2003 data, changes were made in administration and translation procedures for the 2005 NAEP administration in Puerto Rico. This…
Descriptors: Foreign Countries, Grade 4, Grade 5, Grade 6
Samejima, Fumiko – 1982
Because of the recent popularity of the three-parameter logistic model among the researchers who apply latent trait theory, it will be worthwhile to investigate the effect of noise accommodated in different models. In the present paper, four types of models on the dichotomous response level, Types A, B, C and D, are considered. Type A does not…
Descriptors: Adaptive Testing, Goodness of Fit, Latent Trait Theory, Mathematical Models
Muir, Sharon Pray – 1976
The Test for Inquiry Social Studies (TISS) is a 40-item multiple-choice instrument designed to measure the application of either inquiry social studies or higher-level cognitive thinking skills in fifth and sixth graders. The test can generally be completed in 40 to 50 minutes. This administrator's guide contains information on the reliability,…
Descriptors: Answer Keys, Cognitive Processes, Difficulty Level, Grade 5
Crisp, Geoffrey T.; Palmer, Edward J. – Journal of University Teaching and Learning Practice, 2007
The appropriate analysis of students' responses to an assessment is an essential step in improving the quality of the assessment itself as well as staff teaching and student learning. Many academics are unfamiliar with the formal processes used to analyze assessment results; the standard statistical methods associated with analyzing the validity…
Descriptors: Multiple Choice Tests, Student Evaluation, Test Results, Test Construction
Previous Page | Next Page ยป
Pages: 1 | 2
Peer reviewed
Direct link
