Publication Date
In 2025 | 0 |
Since 2024 | 0 |
Since 2021 (last 5 years) | 1 |
Since 2016 (last 10 years) | 2 |
Since 2006 (last 20 years) | 11 |
Descriptor
Comparative Analysis | 11 |
Item Response Theory | 11 |
Models | 8 |
Foreign Countries | 5 |
Test Items | 5 |
Accuracy | 3 |
Data Analysis | 3 |
Goodness of Fit | 3 |
International Assessment | 3 |
Mathematics Tests | 3 |
Reading Tests | 3 |
More ▼ |
Source
ETS Research Report Series | 5 |
Educational Testing Service | 1 |
Educational and Psychological… | 1 |
International Journal of… | 1 |
Journal of Educational and… | 1 |
Measurement:… | 1 |
Psychometrika | 1 |
Author
von Davier, Matthias | 11 |
Xu, Xueli | 4 |
Carstensen, Claus H. | 3 |
Khorramdel, Lale | 2 |
Chen, Haiwen | 1 |
Chen, Haiwen H. | 1 |
Haberman, Shelby J. | 1 |
He, Qiwei | 1 |
Kong, Nan | 1 |
Lee, Yi-Hsuan | 1 |
Naemi, Bobby | 1 |
More ▼ |
Publication Type
Journal Articles | 10 |
Reports - Research | 10 |
Numerical/Quantitative Data | 1 |
Reports - Descriptive | 1 |
Education Level
Secondary Education | 3 |
Elementary Education | 1 |
Elementary Secondary Education | 1 |
Grade 10 | 1 |
Grade 12 | 1 |
Grade 4 | 1 |
Grade 9 | 1 |
High Schools | 1 |
Intermediate Grades | 1 |
Audience
Location
Bermuda | 1 |
Canada | 1 |
Germany | 1 |
Italy | 1 |
Norway | 1 |
Switzerland | 1 |
United States | 1 |
Laws, Policies, & Programs
Assessments and Surveys
National Assessment of… | 2 |
Program for International… | 2 |
Progress in International… | 1 |
Trends in International… | 1 |
What Works Clearinghouse Rating
von Davier, Matthias; Tyack, Lillian; Khorramdel, Lale – Educational and Psychological Measurement, 2023
Automated scoring of free drawings or images as responses has yet to be used in large-scale assessments of student achievement. In this study, we propose artificial neural networks to classify these types of graphical responses from a TIMSS 2019 item. We are comparing classification accuracy of convolutional and feed-forward approaches. Our…
Descriptors: Scoring, Networks, Artificial Intelligence, Elementary Secondary Education
von Davier, Matthias; Khorramdel, Lale; He, Qiwei; Shin, Hyo Jeong; Chen, Haiwen – Journal of Educational and Behavioral Statistics, 2019
International large-scale assessments (ILSAs) transitioned from paper-based assessments to computer-based assessments (CBAs) facilitating the use of new item types and more effective data collection tools. This allows implementation of more complex test designs and to collect process and response time (RT) data. These new data types can be used to…
Descriptors: International Assessment, Computer Assisted Testing, Psychometrics, Item Response Theory
Chen, Haiwen H.; von Davier, Matthias; Yamamoto, Kentaro; Kong, Nan – ETS Research Report Series, 2015
One major issue with large-scale assessments is that the respondents might give no responses to many items, resulting in less accurate estimations of both assessed abilities and item parameters. This report studies how the types of items affect the item-level nonresponse rates and how different methods of treating item-level nonresponses have an…
Descriptors: Achievement Tests, Foreign Countries, International Assessment, Secondary School Students
Oliveri, Maria Elena; von Davier, Matthias – International Journal of Testing, 2014
In this article, we investigate the creation of comparable score scales across countries in international assessments. We examine potential improvements to current score scale calibration procedures used in international large-scale assessments. Our approach seeks to improve fairness in scoring international large-scale assessments, which often…
Descriptors: Test Bias, Scores, International Programs, Educational Assessment
von Davier, Matthias; Xu, Xueli; Carstensen, Claus H. – Psychometrika, 2011
The aim of the research presented here is the use of extensions of longitudinal item response theory (IRT) models in the analysis and comparison of group-specific growth in large-scale assessments of educational outcomes. A general discrete latent variable model was used to specify and compare two types of multidimensional item-response-theory…
Descriptors: Educational Objectives, Outcomes of Education, Measures (Individuals), Item Response Theory
von Davier, Matthias; Naemi, Bobby; Roberts, Richard D. – Measurement: Interdisciplinary Research and Perspectives, 2012
This article describes an exploration of the distinction between typological and factorial latent variables in the domain of personality theory. Traditionally, many personality variables have been considered to be factorial in nature, even though there are examples of typological constructs dating back to Hippocrates. Recently, some…
Descriptors: Individual Differences, Item Response Theory, Classification, Personality Theories
Haberman, Shelby J.; von Davier, Matthias; Lee, Yi-Hsuan – ETS Research Report Series, 2008
Multidimensional item response models can be based on multivariate normal ability distributions or on multivariate polytomous ability distributions. For the case of simple structure in which each item corresponds to a unique dimension of the ability vector, some applications of the two-parameter logistic model to empirical data are employed to…
Descriptors: Item Response Theory, Comparative Analysis, Ability, Models
von Davier, Matthias; Xu, Xueli; Carstensen, Claus H. – Educational Testing Service, 2009
A general diagnostic model was used to specify and compare two multidimensional item-response-theory (MIRT) models for longitudinal data: (a) a model that handles repeated measurements as multiple, correlated variables over time (Andersen, 1985) and (b) a model that assumes one common variable over time and additional orthogonal variables that…
Descriptors: Models, Item Response Theory, Longitudinal Studies, Measurement
Xu, Xueli; von Davier, Matthias – ETS Research Report Series, 2008
Three strategies for linking two consecutive assessments are investigated and compared by analyzing reading data for the National Assessment of Educational Progress (NAEP) using the general diagnostic model. These strategies are compared in terms of marginal and joint expectations of skills, joint probabilities of skill patterns, and item…
Descriptors: National Competency Tests, Probability, Reading Achievement, Test Items
von Davier, Alina A.; Carstensen, Claus H.; von Davier, Matthias – ETS Research Report Series, 2006
Measuring and linking competencies require special instruments, special data collection designs, and special statistical models. The measurement instruments are tests or tests forms, which can be used in the following situations: The same test can be given repeatedly; two or more parallel tests forms (i.e., forms intended to be similar in…
Descriptors: Scores, Measurement Techniques, Competence, Comparative Analysis
Xu, Xueli; von Davier, Matthias – ETS Research Report Series, 2006
More than a dozen statistical models have been developed for the purpose of cognitive diagnosis. These models are supposed to extract a much finer level of information from item responses than traditional unidimensional item response models. In this paper, a general diagnostic model (GDM) was used to analyze a set of simulated sparse data and real…
Descriptors: Statistical Analysis, National Competency Tests, Diagnostic Tests, Item Response Theory