Publication Date
| In 2026 | 0 |
| Since 2025 | 0 |
| Since 2022 (last 5 years) | 4 |
| Since 2017 (last 10 years) | 8 |
| Since 2007 (last 20 years) | 14 |
Descriptor
| Bayesian Statistics | 17 |
| High Stakes Tests | 17 |
| Accountability | 5 |
| Item Response Theory | 5 |
| Reaction Time | 4 |
| Scores | 4 |
| Simulation | 4 |
| Statistical Analysis | 4 |
| Test Items | 4 |
| Accuracy | 3 |
| Cheating | 3 |
| More ▼ | |
Source
Author
| Chun Wang | 2 |
| Foorman, Barbara R. | 2 |
| Jing Lu | 2 |
| Kershaw, Sarah | 2 |
| Koon, Sharon | 2 |
| Petscher, Yaacov | 2 |
| Sinharay, Sandip | 2 |
| Bejar, Isaac I. | 1 |
| Boyd, Donald | 1 |
| Chen, Qi | 1 |
| Feinberg, Richard A. | 1 |
| More ▼ | |
Publication Type
| Reports - Research | 14 |
| Journal Articles | 11 |
| Reports - Evaluative | 2 |
| Dissertations/Theses -… | 1 |
| Speeches/Meeting Papers | 1 |
Education Level
| Elementary Education | 4 |
| Grade 3 | 4 |
| Grade 4 | 4 |
| Elementary Secondary Education | 3 |
| Grade 5 | 3 |
| Grade 6 | 3 |
| Grade 7 | 3 |
| Grade 8 | 3 |
| Secondary Education | 3 |
| Grade 10 | 2 |
| Grade 9 | 2 |
| More ▼ | |
Audience
Laws, Policies, & Programs
| Individuals with Disabilities… | 1 |
| No Child Left Behind Act 2001 | 1 |
Assessments and Surveys
| Florida Comprehensive… | 2 |
| Dynamic Indicators of Basic… | 1 |
| Graduate Record Examinations | 1 |
| National Assessment of… | 1 |
| Program for International… | 1 |
| SAT (College Admission Test) | 1 |
| Woodcock Reading Mastery Test | 1 |
What Works Clearinghouse Rating
Huan Liu – ProQuest LLC, 2024
In many large-scale testing programs, examinees are frequently categorized into different performance levels. These classifications are then used to make high-stakes decisions about examinees in contexts such as in licensure, certification, and educational assessments. Numerous approaches to estimating the consistency and accuracy of this…
Descriptors: Classification, Accuracy, Item Response Theory, Decision Making
Zhu, Hongyue; Jiao, Hong; Gao, Wei; Meng, Xiangbin – Journal of Educational and Behavioral Statistics, 2023
Change-point analysis (CPA) is a method for detecting abrupt changes in parameter(s) underlying a sequence of random variables. It has been applied to detect examinees' aberrant test-taking behavior by identifying abrupt test performance change. Previous studies utilized maximum likelihood estimations of ability parameters, focusing on detecting…
Descriptors: Bayesian Statistics, Test Wiseness, Behavior Problems, Reaction Time
Feinberg, Richard A. – Educational Measurement: Issues and Practice, 2021
Unforeseen complications during the administration of large-scale testing programs are inevitable and can prevent examinees from accessing all test material. For classification tests in which the primary purpose is to yield a decision, such as a pass/fail result, the current study investigated a model-based standard error approach, Bayesian…
Descriptors: High Stakes Tests, Classification, Decision Making, Bayesian Statistics
Niessen, A. Susan M.; Meijer, Rob R.; Tendeiro, Jorge N. – Educational Measurement: Issues and Practice, 2019
A longstanding concern about admissions to higher education is the underprediction of female academic performance by admission test scores. One explanation for these findings is selection system bias, that is, not all relevant KSAOs that are related to academic performance and gender are included in the prediction model. One solution to this…
Descriptors: College Admission, High Stakes Tests, Gender Differences, Sampling
A Sequential Bayesian Changepoint Detection Procedure for Aberrant Behaviors in Computerized Testing
Jing Lu; Chun Wang; Jiwei Zhang; Xue Wang – Grantee Submission, 2023
Changepoints are abrupt variations in a sequence of data in statistical inference. In educational and psychological assessments, it is pivotal to properly differentiate examinees' aberrant behaviors from solution behavior to ensure test reliability and validity. In this paper, we propose a sequential Bayesian changepoint detection algorithm to…
Descriptors: Bayesian Statistics, Behavior Patterns, Computer Assisted Testing, Accuracy
Jing Lu; Chun Wang; Ningzhong Shi – Grantee Submission, 2023
In high-stakes, large-scale, standardized tests with certain time limits, examinees are likely to engage in either one of the three types of behavior (e.g., van der Linden & Guo, 2008; Wang & Xu, 2015): solution behavior, rapid guessing behavior, and cheating behavior. Oftentimes examinees do not always solve all items due to various…
Descriptors: High Stakes Tests, Standardized Tests, Guessing (Tests), Cheating
Man, Kaiwen; Harring, Jeffery R.; Ouyang, Yunbo; Thomas, Sarah L. – International Journal of Testing, 2018
Many important high-stakes decisions--college admission, academic performance evaluation, and even job promotion--depend on accurate and reliable scores from valid large-scale assessments. However, examinees sometimes cheat by copying answers from other test-takers or practicing with test items ahead of time, which can undermine the effectiveness…
Descriptors: Reaction Time, High Stakes Tests, Test Wiseness, Cheating
Huang, Hung-Yu – Educational and Psychological Measurement, 2017
Mixture item response theory (IRT) models have been suggested as an efficient method of detecting the different response patterns derived from latent classes when developing a test. In testing situations, multiple latent traits measured by a battery of tests can exhibit a higher-order structure, and mixtures of latent classes may occur on…
Descriptors: Item Response Theory, Models, Bayesian Statistics, Computation
Ng, Hui Leng; Koretz, Daniel – Applied Measurement in Education, 2015
Policymakers usually leave decisions about scaling the scores used for accountability to their appointed technical advisory committees and the testing contractors. However, scaling decisions can have an appreciable impact on school ratings. Using middle-school data from New York State, we examined the consistency of school ratings based on two…
Descriptors: School Effectiveness, Scaling, Middle Schools, Accountability
Boyd, Donald; Lankford, Hamilton; Loeb, Susanna; Wyckoff, James – Journal of Educational and Behavioral Statistics, 2013
Test-based accountability as well as value-added asessments and much experimental and quasi-experimental research in education rely on achievement tests to measure student skills and knowledge. Yet, we know little regarding fundamental properties of these tests, an important example being the extent of measurement error and its implications for…
Descriptors: Accountability, Educational Research, Educational Testing, Error of Measurement
Petscher, Yaacov; Kershaw, Sarah; Koon, Sharon; Foorman, Barbara R. – Regional Educational Laboratory Southeast, 2014
Districts and schools use progress monitoring to assess student progress, to identify students who fail to respond to intervention, and to further adapt instruction to student needs. Researchers and practitioners often use progress monitoring data to estimate student achievement growth (slope) and evaluate changes in performance over time for…
Descriptors: Reading Comprehension, Reading Achievement, Elementary School Students, Secondary School Students
Petscher, Yaacov; Kershaw, Sarah; Koon, Sharon; Foorman, Barbara R. – Regional Educational Laboratory Southeast, 2014
Districts and schools use progress monitoring to assess student progress, to identify students who fail to respond to intervention, and to further adapt instruction to student needs. Researchers and practitioners often use progress monitoring data to estimate student achievement growth (slope) and evaluate changes in performance over time for…
Descriptors: Response to Intervention, Achievement Gains, High Stakes Tests, Prediction
Hughes, Jan N.; Chen, Qi; Thoemmes, Felix; Kwok, Oi-man – Educational Evaluation and Policy Analysis, 2010
The association between grade retention in first grade and passing the third grade state accountability tests, the Texas Assessment of Knowledge and Skills (TAKS) reading and math, was investigated in a sample of 769 students who were recruited into the study when they were in first grade. Of these 769 students, 165 were retained in first grade…
Descriptors: Grade Repetition, Mathematics Tests, High Stakes Tests, Grade 3
Johnson, David R.; Thurlow, Martha L.; Stout, Karen Evans; Mavis, Ann – Journal of Special Education Leadership, 2007
In response to public demands for better-quality high school graduates and to requirements of No Child Left Behind legislation, states have developed a variety of policies such as high-stakes exit exams and diploma options. Additionally, under the Individuals with Disabilities Education Act Amendments of 1997, students with disabilities must be…
Descriptors: Federal Legislation, High Stakes Tests, High School Graduates, Exit Examinations
Williamson, David M.; Johnson, Matthew S.; Sinharay, Sandip; Bejar, Isaac I. – 2002
This paper explores the application of a technique for hierarchical item response theory (IRT) calibration of complex constructed response tasks that has promise both as a calibration tool and as a means of evaluating the isomorphic equivalence of complex constructed response tasks. Isomorphic tasks are explicitly and rigorously designed to be…
Descriptors: Bayesian Statistics, Constructed Response, Estimation (Mathematics), Evaluation Methods
Previous Page | Next Page ยป
Pages: 1 | 2
Direct link
Peer reviewed
