Publication Date
| In 2026 | 0 |
| Since 2025 | 0 |
| Since 2022 (last 5 years) | 0 |
| Since 2017 (last 10 years) | 3 |
| Since 2007 (last 20 years) | 8 |
Descriptor
| Bayesian Statistics | 10 |
| Mathematics Tests | 10 |
| Statistical Analysis | 10 |
| Models | 6 |
| Item Response Theory | 5 |
| Test Items | 5 |
| Achievement Tests | 4 |
| Correlation | 4 |
| Foreign Countries | 4 |
| Computation | 3 |
| Scores | 3 |
| More ▼ | |
Source
| ETS Research Report Series | 2 |
| Educational and Psychological… | 2 |
| Journal of Educational and… | 2 |
| Applied Psychological… | 1 |
| EURASIA Journal of… | 1 |
| International Education… | 1 |
| Journal of Educational… | 1 |
Author
| Boyd, Donald | 1 |
| Chang, Wanchen | 1 |
| Dodd, Barbara G. | 1 |
| Fifield, Steve | 1 |
| Ford, Danielle | 1 |
| Gamerman, Dani | 1 |
| Glutting, Joseoph | 1 |
| Goncalves, Flavio B. | 1 |
| Gräfe, Linda | 1 |
| Huang, Hung-Yu | 1 |
| Lankford, Hamilton | 1 |
| More ▼ | |
Publication Type
| Journal Articles | 10 |
| Reports - Research | 9 |
| Reports - Evaluative | 1 |
Education Level
| Elementary Education | 3 |
| Grade 4 | 3 |
| Grade 5 | 3 |
| Grade 8 | 3 |
| Intermediate Grades | 3 |
| Higher Education | 2 |
| Elementary Secondary Education | 1 |
| Grade 12 | 1 |
| Grade 3 | 1 |
| Grade 6 | 1 |
| Grade 7 | 1 |
| More ▼ | |
Audience
Laws, Policies, & Programs
Assessments and Surveys
| Trends in International… | 2 |
| National Assessment of… | 1 |
| Students Evaluation of… | 1 |
What Works Clearinghouse Rating
Zhang, Zhidong – International Education Studies, 2018
This study explored a diagnostic assessment method that emphasized the cognitive process of algebra learning. The study utilized a design and a theory-driven model to examine the content knowledge. Using the theory driven model, the thinking skills of algebra learning was also examined. A Bayesian network model was applied to represent the theory…
Descriptors: Algebra, Bayesian Statistics, Scores, Mathematics Achievement
Ting, Mu Yu – EURASIA Journal of Mathematics, Science & Technology Education, 2017
Using the capabilities of expert knowledge structures, the researcher prepared test questions on the university calculus topic of "finding the area by integration." The quiz is divided into two types of multiple choice items (one out of four and one out of many). After the calculus course was taught and tested, the results revealed that…
Descriptors: Calculus, Mathematics Instruction, College Mathematics, Multiple Choice Tests
Pohl, Steffi; Gräfe, Linda; Rose, Norman – Educational and Psychological Measurement, 2014
Data from competence tests usually show a number of missing responses on test items due to both omitted and not-reached items. Different approaches for dealing with missing responses exist, and there are no clear guidelines on which of those to use. While classical approaches rely on an ignorable missing data mechanism, the most recently developed…
Descriptors: Test Items, Achievement Tests, Item Response Theory, Models
Whittaker, Tiffany A.; Chang, Wanchen; Dodd, Barbara G. – Applied Psychological Measurement, 2012
When tests consist of multiple-choice and constructed-response items, researchers are confronted with the question of which item response theory (IRT) model combination will appropriately represent the data collected from these mixed-format tests. This simulation study examined the performance of six model selection criteria, including the…
Descriptors: Item Response Theory, Models, Selection, Criteria
Qian, Xiaoyu; Nandakumar, Ratna; Glutting, Joseoph; Ford, Danielle; Fifield, Steve – ETS Research Report Series, 2017
In this study, we investigated gender and minority achievement gaps on 8th-grade science items employing a multilevel item response methodology. Both gaps were wider on physics and earth science items than on biology and chemistry items. Larger gender gaps were found on items with specific topics favoring male students than other items, for…
Descriptors: Item Analysis, Gender Differences, Achievement Gap, Grade 8
Huang, Hung-Yu; Wang, Wen-Chung – Educational and Psychological Measurement, 2014
In the social sciences, latent traits often have a hierarchical structure, and data can be sampled from multiple levels. Both hierarchical latent traits and multilevel data can occur simultaneously. In this study, we developed a general class of item response theory models to accommodate both hierarchical latent traits and multilevel data. The…
Descriptors: Item Response Theory, Hierarchical Linear Modeling, Computation, Test Reliability
Boyd, Donald; Lankford, Hamilton; Loeb, Susanna; Wyckoff, James – Journal of Educational and Behavioral Statistics, 2013
Test-based accountability as well as value-added asessments and much experimental and quasi-experimental research in education rely on achievement tests to measure student skills and knowledge. Yet, we know little regarding fundamental properties of these tests, an important example being the extent of measurement error and its implications for…
Descriptors: Accountability, Educational Research, Educational Testing, Error of Measurement
Soares, Tufi M.; Goncalves, Flavio B.; Gamerman, Dani – Journal of Educational and Behavioral Statistics, 2009
In this article, an integrated Bayesian model for differential item functioning (DIF) analysis is proposed. The model is integrated in the sense of modeling the responses along with the DIF analysis. This approach allows DIF detection and explanation in a simultaneous setup. Previous empirical studies and/or subjective beliefs about the item…
Descriptors: Test Bias, Bayesian Statistics, Models, Item Response Theory
Peer reviewedTsutakawa, Robert K.; Soltys, Michael J. – Journal of Educational Statistics, 1988
An approximation procedure is proposed for the posterior means and standard deviation of the ability parameter in an item response model. The method is illustrated for the two-parameter logistic model using data from a 39-item American College Testing mathematics test. The effect of sample size is considered. (SLD)
Descriptors: Ability, Academic Ability, Bayesian Statistics, Equations (Mathematics)
Sinharay, Sandip – ETS Research Report Series, 2004
Assessing fit of psychometric models has always been an issue of enormous interest, but there exists no unanimously agreed upon item fit diagnostic for the models. Bayesian networks, frequently used in educational assessments (see, for example, Mislevy, Almond, Yan, & Steinberg, 2001) primarily for learning about students' knowledge and…
Descriptors: Bayesian Statistics, Networks, Models, Goodness of Fit

Direct link
