ERIC - Search Results

Publication Date

In 2026	0
Since 2025	0
Since 2022 (last 5 years)	1
Since 2017 (last 10 years)	1
Since 2007 (last 20 years)	5

Descriptor

Statistical Data	26
Test Items	26
Test Construction	13
Item Analysis	10
Difficulty Level	9
Cognitive Processes	6
Elementary Secondary Education	6
Standardized Tests	6
Achievement Tests	5
Developmental Stages	5
Mathematical Models	5
Student Evaluation	5
Test Reliability	5
Abstract Reasoning	4
Cognitive Development	4
Cognitive Measurement	4
Cognitive Tests	4
Concept Formation	4
Grade 5	4
Learning Theories	4
Measurement Objectives	4
Statistical Analysis	4
Test Validity	4
Academic Achievement	3
Data Collection	3
More ▼

Source

National Center for Education…	3
Applied Psychological…	1
ETS Research Report Series	1
Journal of Educational and…	1
Journal of University…	1
Language Testing	1
ProQuest LLC	1
Spectrum	1

Publication Type

Reports - Research	21
Journal Articles	6
Numerical/Quantitative Data	4
Reports - Evaluative	2
Speeches/Meeting Papers	2
Dissertations/Theses -…	1
Guides - Non-Classroom	1
Reports - Descriptive	1
Tests/Questionnaires	1

Education Level

Postsecondary Education	3
Grade 4	1
Grade 5	1
Grade 6	1
Grade 7	1
Grade 8	1
Higher Education	1

Audience

Researchers

Location

Canada	1
Puerto Rico	1

Laws, Policies, & Programs

Assessments and Surveys

Beginning Postsecondary…	2
ACT Assessment	1
California Achievement Tests	1
California Test of Mental…	1
Iowa Tests of Basic Skills	1
Stanford Achievement Tests	1

What Works Clearinghouse Rating

Showing 1 to 15 of 26 results Save | Export

Artificial Intelligence and Educational Measurement: Opportunities and Threats

Peer reviewed

Direct link

Andrew D. Ho – Journal of Educational and Behavioral Statistics, 2024

I review opportunities and threats that widely accessible Artificial Intelligence (AI)-powered services present for educational statistics and measurement. Algorithmic and computational advances continue to improve approaches to item generation, scale maintenance, test security, test scoring, and score reporting. Predictable misuses of AI for…

Descriptors: Artificial Intelligence, Measurement, Educational Assessment, Technology Uses in Education

Nonlinear Penalized Estimation of True Q-Matrix in Cognitive Diagnostic Models

Direct link

Xiang, Rui – ProQuest LLC, 2013

A key issue of cognitive diagnostic models (CDMs) is the correct identification of Q-matrix which indicates the relationship between attributes and test items. Previous CDMs typically assumed a known Q-matrix provided by domain experts such as those who developed the questions. However, misspecifications of Q-matrix had been discovered in the past…

Descriptors: Diagnostic Tests, Cognitive Processes, Matrices, Test Items

2004/06 Beginning Postsecondary Students Longitudinal Study (BPS:04/06). Methodology Report. NCES 2008-184

Peer reviewed
PDF on ERIC

Download full text

Cominole, Melissa; Wheeless, Sara; Dudley, Kristin; Franklin, Jeff; Wine, Jennifer – National Center for Education Statistics, 2007

The "2004/06 Beginning Postsecondary Students Longitudinal Study (BPS:04/06)" is sponsored by the U.S. Department of Education to respond to the need for a national, comprehensive database concerning issues students may face in enrollment, persistence, progress, and attainment in postsecondary education and in consequent early rates of…

Descriptors: Postsecondary Education, Stopouts, Research Methodology, Data Collection

Refinement of Reading and Mathematics Test Through an Analysis of Reactivity. Beginning Teacher Evaluation Study. Technical Report Series. Technical Report III-6.

Filby, Nikola N.; Dishaw, Marilyn – 1976

Major analyses of the achievement tests used in the Beginning Teacher Evaluation Study were conducted to determine test reactivity to instruction. Reading and mathematics tests were administered to second and fifth grade children. Classroom teachers' records were examined to determine the amount of opportunity students had to learn the content…

Descriptors: Academic Ability, Academic Achievement, Achievement Gains, Achievement Tests

Controlling Item Exposure and Test Overlap in Computerized Adaptive Testing

Peer reviewed

Direct link

Chen, Shu-Ying; Lei, Pui-Wa – Applied Psychological Measurement, 2005

This article proposes an item exposure control method, which is the extension of the Sympson and Hetter procedure and can provide item exposure control at both the item and test levels. Item exposure rate and test overlap rate are two indices commonly used to track item exposure in computerized adaptive tests. By considering both indices, item…

Descriptors: Computer Assisted Testing, Test Items, Computer Simulation, Evaluation Criteria

Beginning Postsecondary Students Longitudinal Study: 1996-2001 (BPS:1996/2001) Methodology Report. Technical Report. NCES 2002-171

Peer reviewed
PDF on ERIC

Download full text

Wine, Jennifer S.; Heuer, Ruth E.; Wheeless, Sara C.; Francis, Talbric L.; Franklin, Jeff W.; Dudley, Kristin M. – National Center for Education Statistics, 2002

This report describes the methods and procedures used for the Beginning Postsecondary Students Longitudinal Study: 1996-2001 (BPS:1996/2001). These students, who started their postsecondary education during the 1995-96 academic year, were first interviewed in 1996 as part of the National Postsecondary Student Aid Study (NPSAS:96). A follow-up…

Descriptors: Longitudinal Studies, Postsecondary Education, Research Methodology, Interviews

The Correlation between the Scores of a Test and an Anchor Test. Research Report. ETS RR-06-04

Peer reviewed
PDF on ERIC

Download full text

Sinharay, Sandip; Holland, Paul – ETS Research Report Series, 2006

It is a widely held belief that an anchor test used in equating should be a miniature version (or "minitest") of the tests to be equated; that is, the anchor test should be proportionally representative of the two tests in content and statistical characteristics. This paper examines the scientific foundation of this belief, especially…

Descriptors: Test Items, Equated Scores, Correlation, Tests

Development and Refinement of Reading and Mathematics Tests for Grades 2 and 5. Beginning Teacher Evaluation Study. Technical Report Series. Technical Report III-1. Continuation of Phase III A.

Filby, Nikola N.; Dishaw, Marilyn – 1975

Achievement tests that are maximally sensitive to effective instruction in reading and mathematics for grades 2 and 5 were developed and refined. Important considerations regarding the tests' validity were: its coverage of instructional content (opportunity to learn), and its reactivity to instruction. Student ability must be minimally related to…

Descriptors: Academic Ability, Academic Achievement, Achievement Gains, Achievement Tests

Matching Standardized Achievement Tests to Local Objectives.

Bower, Ruth – Spectrum, 1983

As a preliminary step in customizing the Stanford Achievement Test, a study was made to compare the test items at each level with the curriculum objectives of a Florida school district, classifying them as either matching, above or below grade level, or unmatching. Results are shown in table form. (TE)

Descriptors: Elementary Secondary Education, Item Analysis, Standardized Tests, Statistical Data

Effect of Noise in the Three-Parameter Logistic Model.

Samejima, Fumiko – 1982

In a preceding research report, ONR/RR-82-1 (Information Loss Caused by Noise in Models for Dichotomous Items), observations were made on the effect of noise accommodated in different types of models on the dichotomous response level. In the present paper, focus is put upon the three-parameter logistic model, which is widely used among…

Descriptors: Estimation (Mathematics), Goodness of Fit, Guessing (Tests), Mathematical Models

The Definition of Difficulty and Discrimination for Multidimensional Item Response Theory Models.

Download full text

Reckase, Mark D.; McKinley, Robert L. – 1983

A study was undertaken to develop guidelines for the interpretation of the parameters of three multidimensional item response theory models and to determine the relationship between the parameters and traditional concepts of item difficulty and discrimination. The three models considered were multidimensional extensions of the one-, two-, and…

Descriptors: Computer Programs, Difficulty Level, Goodness of Fit, Latent Trait Theory

Technical Report of the NAEP Mathematics Assessment in Puerto Rico: Focus on Statistical Issues. NCES 2007-462

Peer reviewed
PDF on ERIC

Download full text

Baxter, G. P.; Ahmed, S.; Sikali, E.; Waits, T.; Sloan, M.; Salvucci, S. – National Center for Education Statistics, 2007

In 2003, a trial National Assessment of Educational Progress (NAEP) mathematics assessment was administered in Spanish to public school students at grades 4 and 8 in Puerto Rico. Based on preliminary analyses of the 2003 data, changes were made in administration and translation procedures for the 2005 NAEP administration in Puerto Rico. This…

Descriptors: Foreign Countries, Grade 4, Grade 5, Grade 6

Information Loss Caused by Noise in Models for Dichotomous Items.

Samejima, Fumiko – 1982

Because of the recent popularity of the three-parameter logistic model among the researchers who apply latent trait theory, it will be worthwhile to investigate the effect of noise accommodated in different models. In the present paper, four types of models on the dichotomous response level, Types A, B, C and D, are considered. Type A does not…

Descriptors: Adaptive Testing, Goodness of Fit, Latent Trait Theory, Mathematical Models

A Test for Inquiry Social Studies: Administrator's Guide; Grades 5 and 6.

Muir, Sharon Pray – 1976

The Test for Inquiry Social Studies (TISS) is a 40-item multiple-choice instrument designed to measure the application of either inquiry social studies or higher-level cognitive thinking skills in fifth and sixth graders. The test can generally be completed in 40 to 50 minutes. This administrator's guide contains information on the reliability,…

Descriptors: Answer Keys, Cognitive Processes, Difficulty Level, Grade 5

Engaging Academics with a Simplified Analysis of Their Multiple-Choice Question (MCQ) Assessment Results

Peer reviewed
PDF on ERIC

Download full text

Crisp, Geoffrey T.; Palmer, Edward J. – Journal of University Teaching and Learning Practice, 2007

The appropriate analysis of students' responses to an assessment is an essential step in improving the quality of the assessment itself as well as staff teaching and student learning. Many academics are unfamiliar with the formal processes used to analyze assessment results; the standard statistical methods associated with analyzing the validity…

Descriptors: Multiple Choice Tests, Student Evaluation, Test Results, Test Construction

Previous Page | Next Page »

Pages: 1 | 2

DiLuzio, Geneva J.	4
Dishaw, Marilyn	2
Filby, Nikola N.	2
Reckase, Mark D.	2
Samejima, Fumiko	2
Ahmed, S.	1
Andrew D. Ho	1
Baxter, G. P.	1
Bower, Ruth	1
Chen, Shu-Ying	1
Clarke, S. C. T.	1
Coffman, William E.	1
Cominole, Melissa	1
Crisp, Geoffrey T.	1
Damico, Sandra B.	1
Dudley, Kristin	1
Dudley, Kristin M.	1
Forsyth, Robert A.	1
Francis, Talbric L.	1
Franklin, Jeff	1
Franklin, Jeff W.	1
Heuer, Ruth E.	1
Holland, Paul	1
Lei, Pui-Wa	1
More ▼