Publication Date
| In 2026 | 0 |
| Since 2025 | 0 |
| Since 2022 (last 5 years) | 0 |
| Since 2017 (last 10 years) | 5 |
| Since 2007 (last 20 years) | 10 |
Descriptor
| Achievement Tests | 14 |
| Bayesian Statistics | 14 |
| Statistical Analysis | 14 |
| Foreign Countries | 8 |
| Item Response Theory | 5 |
| Correlation | 4 |
| Elementary School Students | 4 |
| Mathematics Tests | 4 |
| International Assessment | 3 |
| Item Analysis | 3 |
| Mathematics Achievement | 3 |
| More ▼ | |
Source
Author
| Huang, Hung-Yu | 2 |
| Almond, Russell | 1 |
| Boyd, Donald | 1 |
| Browne, Dillon T. | 1 |
| Casabianca, Jodi M. | 1 |
| Fifield, Steve | 1 |
| Ford, Danielle | 1 |
| Geary, David C. | 1 |
| Gentry, Marcia | 1 |
| Glutting, Joseoph | 1 |
| Gräfe, Linda | 1 |
| More ▼ | |
Publication Type
| Reports - Research | 10 |
| Journal Articles | 9 |
| Reports - Evaluative | 2 |
| Guides - Classroom - Learner | 1 |
| Numerical/Quantitative Data | 1 |
Education Level
| Elementary Education | 3 |
| Elementary Secondary Education | 2 |
| Grade 4 | 2 |
| Grade 5 | 2 |
| Grade 8 | 2 |
| Intermediate Grades | 2 |
| Secondary Education | 2 |
| Grade 3 | 1 |
| Grade 6 | 1 |
| Grade 7 | 1 |
| Higher Education | 1 |
| More ▼ | |
Audience
Laws, Policies, & Programs
| Elementary and Secondary… | 1 |
Assessments and Surveys
What Works Clearinghouse Rating
Huang, Hung-Yu – Educational and Psychological Measurement, 2020
In educational assessments and achievement tests, test developers and administrators commonly assume that test-takers attempt all test items with full effort and leave no blank responses with unplanned missing values. However, aberrant response behavior--such as performance decline, dropping out beyond a certain point, and skipping certain items…
Descriptors: Item Response Theory, Response Style (Tests), Test Items, Statistical Analysis
Casabianca, Jodi M.; Lewis, Charles – Journal of Educational and Behavioral Statistics, 2018
The null hypothesis test used in differential item functioning (DIF) detection tests for a subgroup difference in item-level performance--if the null hypothesis of "no DIF" is rejected, the item is flagged for DIF. Conversely, an item is kept in the test form if there is insufficient evidence of DIF. We present frequentist and empirical…
Descriptors: Test Bias, Hypothesis Testing, Bayesian Statistics, Statistical Analysis
Browne, Dillon T.; Wade, Mark; Prime, Heather; Jenkins, Jennifer M. – Journal of Educational Psychology, 2018
There is an ongoing need for literature that identifies the effects of broad contextual risk on school readiness outcomes via family mediating mechanisms. This is especially true amongst diverse and urban samples characterized by variability in immigration history. To address this limitation, family profiles of sociodemographic and contextual risk…
Descriptors: Foreign Countries, School Readiness, Urban Areas, Family Characteristics
Hodges, Jaret; McIntosh, Jason; Gentry, Marcia – Journal of Advanced Academics, 2017
High-potential students from low-income families are at an academic disadvantage compared with their more affluent peers. To address this issue, researchers have suggested novel approaches to mitigate gaps in student performance, including out-of-school enrichment programs. Longitudinal mixed effects modeling was used to analyze the growth of…
Descriptors: After School Programs, Enrichment Activities, Academic Achievement, High Achievement
Geary, David C.; vanMarle, Kristy – Developmental Psychology, 2016
At the beginning of preschool (M = 46 months of age), 197 (94 boys) children were administered tasks that assessed a suite of nonsymbolic and symbolic quantitative competencies as well as their executive functions, verbal and nonverbal intelligence, preliteracy skills, and their parents' education level. The children's mathematics achievement was…
Descriptors: Young Children, Mathematics, Mathematics Achievement, Mathematics Education
Pohl, Steffi; Gräfe, Linda; Rose, Norman – Educational and Psychological Measurement, 2014
Data from competence tests usually show a number of missing responses on test items due to both omitted and not-reached items. Different approaches for dealing with missing responses exist, and there are no clear guidelines on which of those to use. While classical approaches rely on an ignorable missing data mechanism, the most recently developed…
Descriptors: Test Items, Achievement Tests, Item Response Theory, Models
Qian, Xiaoyu; Nandakumar, Ratna; Glutting, Joseoph; Ford, Danielle; Fifield, Steve – ETS Research Report Series, 2017
In this study, we investigated gender and minority achievement gaps on 8th-grade science items employing a multilevel item response methodology. Both gaps were wider on physics and earth science items than on biology and chemistry items. Larger gender gaps were found on items with specific topics favoring male students than other items, for…
Descriptors: Item Analysis, Gender Differences, Achievement Gap, Grade 8
Huang, Hung-Yu; Wang, Wen-Chung – Educational and Psychological Measurement, 2014
In the social sciences, latent traits often have a hierarchical structure, and data can be sampled from multiple levels. Both hierarchical latent traits and multilevel data can occur simultaneously. In this study, we developed a general class of item response theory models to accommodate both hierarchical latent traits and multilevel data. The…
Descriptors: Item Response Theory, Hierarchical Linear Modeling, Computation, Test Reliability
Boyd, Donald; Lankford, Hamilton; Loeb, Susanna; Wyckoff, James – Journal of Educational and Behavioral Statistics, 2013
Test-based accountability as well as value-added asessments and much experimental and quasi-experimental research in education rely on achievement tests to measure student skills and knowledge. Yet, we know little regarding fundamental properties of these tests, an important example being the extent of measurement error and its implications for…
Descriptors: Accountability, Educational Research, Educational Testing, Error of Measurement
Kaplan, David; Turner, Alyn – OECD Publishing (NJ1), 2012
The OECD Program for International Student Assessment (PISA) and the OECD Teaching and Learning International Survey (TALIS) constitute two of the largest ongoing international student and teacher surveys presently underway. Data generated from these surveys offer researchers and policy-makers opportunities to identify particular educational…
Descriptors: Outcomes of Education, Teacher Surveys, Policy Analysis, Educational Change
PDF pending restorationvan der Linden, Wim J. – 1986
Differences between traditional linear regression and a Bayesian approach to classification are discussed. Classification consists of assigning subjects to one of several available treatments on the basis of their test scores when the success of each treatment is measured by a different criterion. Formulating this problem as an empirical Bayes…
Descriptors: Achievement Tests, Bayesian Statistics, Classification, Decision Making
Sinharay, Sandip; Almond, Russell; Yan, Duanli – Educational Testing Service, 2004
Model checking is a crucial part of any statistical analysis. As educators tie models for testing to cognitive theory of the domains, there is a natural tendency to represent participant proficiencies with latent variables representing the presence or absence of the knowledge, skills, and proficiencies to be tested (Mislevy, Almond, Yan, &…
Descriptors: Statistical Analysis, Epistemology, Educational Assessment, Item Response Theory
Urry, Vern W. – 1971
Bayesian estimation procedures are summarized and numerically illustrated by means of simulation methods. Procedures of data generation for simulation purposes are also delineated and computationally demonstrated. The logistic model basic to the Bayesian estimation procedures is shown to be explicit with respect to the probability distribution…
Descriptors: Achievement Tests, Adaptive Testing, Bayesian Statistics, Computer Programs
Warm, Thomas A. – 1978
This primer is an introduction to item response theory (also called item characteristic curve theory, or latent trait theory) as it is used most commonly--for scoring multiple choice achievement or aptitude tests. Written for the testing practitioner with minimum training in statistics and psychometrics, it presents and illustrates the basic…
Descriptors: Ability Identification, Achievement Tests, Adaptive Testing, Aptitude Tests

Peer reviewed
Direct link
