NotesFAQContact Us
Collection
Advanced
Search Tips
Audience
Laws, Policies, & Programs
What Works Clearinghouse Rating
Showing 1 to 15 of 23 results Save | Export
Peer reviewed Peer reviewed
Direct linkDirect link
Andrés Christiansen; Rianne Janssen – Educational Assessment, Evaluation and Accountability, 2024
In international large-scale assessments, students may not be compelled to answer every test item: a student can decide to skip a seemingly difficult item or may drop out before the end of the test is reached. The way these missing responses are treated will affect the estimation of the item difficulty and student ability, and ultimately affect…
Descriptors: Test Items, Item Response Theory, Grade 4, International Assessment
Peer reviewed Peer reviewed
Direct linkDirect link
Liao, Xiangyi; Bolt, Daniel M. – Journal of Educational and Behavioral Statistics, 2021
Four-parameter models have received increasing psychometric attention in recent years, as a reduced upper asymptote for item characteristic curves can be appealing for measurement applications such as adaptive testing and person-fit assessment. However, applications can be challenging due to the large number of parameters in the model. In this…
Descriptors: Test Items, Models, Mathematics Tests, Item Response Theory
Peer reviewed Peer reviewed
Direct linkDirect link
Qi Huang; Daniel M. Bolt; Weicong Lyu – Large-scale Assessments in Education, 2024
Large scale international assessments depend on invariance of measurement across countries. An important consideration when observing cross-national differential item functioning (DIF) is whether the DIF actually reflects a source of bias, or might instead be a methodological artifact reflecting item response theory (IRT) model misspecification.…
Descriptors: Test Items, Item Response Theory, Test Bias, Test Validity
Peer reviewed Peer reviewed
PDF on ERIC Download full text
Ilhan, Mustafa; Öztürk, Nagihan Boztunç; Sahin, Melek Gülsah – Participatory Educational Research, 2020
In this research, the effect of an item's type and cognitive level on its difficulty index was investigated. The data source of the study consisted of the responses of the 12535 students in the Turkey sample (6079 and 6456 students from eighth and fourth grade respectively) of TIMSS 2015. The responses were a total of 215 items at the eighth-grade…
Descriptors: Test Items, Difficulty Level, Cognitive Processes, Responses
Ping Wang – ProQuest LLC, 2021
According to the RAND model framework, reading comprehension test performance is influenced by readers' reading skills or reader characteristics, test properties, and their interactions. However, little empirical research has systematically compared the impacts of reader characteristics, test properties, and reader-test interactions across…
Descriptors: Reading Comprehension, Reading Tests, Reading Research, Test Items
Peer reviewed Peer reviewed
Direct linkDirect link
Alexander, Patricia A.; Zhao, Hongyang; Sun, Yuting – Mathematical Thinking and Learning: An International Journal, 2020
In this study, we analyzed the imprecise (i.e., less mathematically precise) responses that 148 third- to fifth-grade Chinese students made on selected-response problems that were part of a spontaneous mathematical focusing task, the Quantitative Relations Test for Chinese Children (QRTC[superscript 2]). The purpose for this analysis was to…
Descriptors: Mathematics Instruction, Teaching Methods, Multiplication, Elementary School Students
Sullivan, Samantha DeHaan – ProQuest LLC, 2019
This dissertation reports the results of a research project that examined the effects of varying rates of reinforcement on students' math performance and assignment preference. The relationship between students' instructional level and the rate of reinforcement threshold for influencing assignment preference was also explored. Participants were…
Descriptors: Mathematics Achievement, Reinforcement, Mathematics Instruction, Assignments
Schoen, Robert C.; Liu, Sicong; Yang, Xiaotong; Paek, Insu – Grantee Submission, 2017
The Early Fractions Test is a paper-pencil test designed to measure mathematics achievement of third- and fourth-grade students in the domain of fractions. The purpose, or intended use, of the Early Fractions Test is to serve as a student pretest covariate and a test of baseline equivalence in the larger study. In this report, we discuss our…
Descriptors: Mathematics Achievement, Fractions, Mathematics Tests, Grade 3
Schoen, Robert C.; Yang, Xiaotong; Liu, Sicong; Paek, Insu – Grantee Submission, 2017
The Early Fractions Test v2.2 is a paper-pencil test designed to measure mathematics achievement of third- and fourth-grade students in the domain of fractions. The purpose, or intended use, of the Early Fractions Test v2.2 is to serve as a measure of student outcomes in a randomized trial designed to estimate the effect of an educational…
Descriptors: Psychometrics, Mathematics Tests, Mathematics Achievement, Fractions
Peer reviewed Peer reviewed
Direct linkDirect link
Ye, Meng; Xin, Tao – Educational and Psychological Measurement, 2014
The authors explored the effects of drifting common items on vertical scaling within the higher order framework of item parameter drift (IPD). The results showed that if IPD occurred between a pair of test levels, the scaling performance started to deviate from the ideal state, as indicated by bias of scaling. When there were two items drifting…
Descriptors: Scaling, Test Items, Equated Scores, Achievement Gains
Irvin, P. Shawn; Saven, Jessica L.; Alonzo, Julie; Park, Bitnara Jasmine; Anderson, Daniel; Tindal, Gerald – Behavioral Research and Teaching, 2012
The results of formative assessments are regularly used to inform important instructional decisions (e.g., targeted intervention) within a response to intervention (RTI) system of teaching and learning. The validity of such instructional decision-making depends, in part, on the alignment between formative measures and the academic content…
Descriptors: Elementary School Mathematics, Curriculum Based Assessment, Mathematics Tests, Academic Standards
Peer reviewed Peer reviewed
PDF on ERIC Download full text
Pishghadamn, Reza; Abbasnejad, Hannaheh – International Electronic Journal of Elementary Education, 2016
Given the deficiencies of readability formulae as reliable tools for measuring text readability in educational settings, this study aims to offer a new measure to improve the current methods of testing the readability levels of texts through the incorporation of the newly-developed concept of emotioncy. To this end, a group of 221 students were…
Descriptors: Readability, Readability Formulas, Psychological Patterns, Emotional Response
Peer reviewed Peer reviewed
Direct linkDirect link
Cueto, Santiago; Guerrero, Gabriela; Leon, Juan; Zapata, Mayli; Freire, Silvana – Oxford Review of Education, 2014
Using Young Lives longitudinal data from Peru, this paper explores the relationship between socioeconomic status (SES) measured at the age of one, opportunities to learn (OTL) and achievement in mathematics ten years later. Four variables of OTL were measured: hours of class per year, curriculum coverage, quality of teachers' feedback, and level…
Descriptors: Socioeconomic Status, Correlation, Infants, Educational Opportunities
Peer reviewed Peer reviewed
Direct linkDirect link
Cho, Hyun-Jeong; Lee, Jaehoon; Kingston, Neal – Applied Measurement in Education, 2012
This study examined the validity of test accommodation in third-eighth graders using differential item functioning (DIF) and mixture IRT models. Two data sets were used for these analyses. With the first data set (N = 51,591) we examined whether item type (i.e., story, explanation, straightforward) or item features were associated with item…
Descriptors: Testing Accommodations, Test Bias, Item Response Theory, Validity
Peer reviewed Peer reviewed
Direct linkDirect link
Wang, Jianjun – School Science and Mathematics, 2011
As the largest international study ever taken in history, the Trend in Mathematics and Science Study (TIMSS) has been held as a benchmark to measure U.S. student performance in the global context. In-depth analyses of the TIMSS project are conducted in this study to examine key issues of the comparative investigation: (1) item flaws in mathematics…
Descriptors: Test Items, Figurative Language, Item Response Theory, Benchmarking
Previous Page | Next Page »
Pages: 1  |  2