Publication Date
| In 2026 | 0 |
| Since 2025 | 0 |
| Since 2022 (last 5 years) | 2 |
| Since 2017 (last 10 years) | 3 |
| Since 2007 (last 20 years) | 5 |
Descriptor
| Bayesian Statistics | 7 |
| Scores | 7 |
| Test Reliability | 7 |
| Error of Measurement | 4 |
| Comparative Analysis | 3 |
| Test Validity | 3 |
| Accuracy | 2 |
| Correlation | 2 |
| Equations (Mathematics) | 2 |
| High School Students | 2 |
| Item Response Theory | 2 |
| More ▼ | |
Source
| Educational and Psychological… | 2 |
| ETS Research Report Series | 1 |
| Education and Information… | 1 |
| ProQuest LLC | 1 |
| Psychometrika | 1 |
Author
| Carvajal, Jorge | 1 |
| Flore, Paulette C. | 1 |
| Gelbal, Selahattin | 1 |
| Guo, Hongwen | 1 |
| Kim, Jwa K. | 1 |
| Millman, Jason | 1 |
| Nicewander, W. Alan | 1 |
| Oh, Hyeonjoo J. | 1 |
| Ozdemir, Burhanettin | 1 |
| Pei-Hsuan Chiu | 1 |
| Phillips, Lorraine A. T. | 1 |
| More ▼ | |
Publication Type
| Journal Articles | 5 |
| Reports - Research | 4 |
| Reports - Evaluative | 2 |
| Dissertations/Theses -… | 1 |
Education Level
| High Schools | 2 |
| Higher Education | 2 |
| Postsecondary Education | 2 |
| Secondary Education | 2 |
Audience
Location
| Netherlands | 1 |
Laws, Policies, & Programs
Assessments and Surveys
| National Merit Scholarship… | 1 |
| Preliminary Scholastic… | 1 |
What Works Clearinghouse Rating
Stoevenbelt, Andrea H.; Wicherts, Jelte M.; Flore, Paulette C.; Phillips, Lorraine A. T.; Pietschnig, Jakob; Verschuere, Bruno; Voracek, Martin; Schwabe, Inga – Educational and Psychological Measurement, 2023
When cognitive and educational tests are administered under time limits, tests may become speeded and this may affect the reliability and validity of the resulting test scores. Prior research has shown that time limits may create or enlarge gender gaps in cognitive and academic testing. On average, women complete fewer items than men when a test…
Descriptors: Timed Tests, Gender Differences, Item Response Theory, Correlation
Ozdemir, Burhanettin; Gelbal, Selahattin – Education and Information Technologies, 2022
The computerized adaptive tests (CAT) apply an adaptive process in which the items are tailored to individuals' ability scores. The multidimensional CAT (MCAT) designs differ in terms of different item selection, ability estimation, and termination methods being used. This study aims at investigating the performance of the MCAT designs used to…
Descriptors: Scores, Computer Assisted Testing, Test Items, Language Proficiency
Pei-Hsuan Chiu – ProQuest LLC, 2018
Evidence of student growth is a primary outcome of interest for educational accountability systems. When three or more years of student test data are available, questions around how students grow and what their predicted growth is can be answered. Given that test scores contain measurement error, this error should be considered in growth and…
Descriptors: Bayesian Statistics, Scores, Error of Measurement, Growth Models
Skorupski, William P.; Carvajal, Jorge – Educational and Psychological Measurement, 2010
This study is an evaluation of the psychometric issues associated with estimating objective level scores, often referred to as "subscores." The article begins by introducing the concepts of reliability and validity for subscores from statewide achievement tests. These issues are discussed with reference to popular scaling techniques, classical…
Descriptors: Testing Programs, Test Validity, Achievement Tests, Scores
Oh, Hyeonjoo J.; Guo, Hongwen; Walker, Michael E. – ETS Research Report Series, 2009
Issues of equity and fairness across subgroups of the population (e.g., gender or ethnicity) must be seriously considered in any standardized testing program. For this reason, many testing programs require some means for assessing test characteristics, such as reliability, for subgroups of the population. However, often only small sample sizes are…
Descriptors: Standardized Tests, Test Reliability, Sample Size, Bayesian Statistics
Peer reviewedKim, Jwa K.; Nicewander, W. Alan – Psychometrika, 1993
Bias, standard error, and reliability of five ability estimators were evaluated using Monte Carlo estimates of the unknown conditional means and variances of the estimators. Results indicate that estimates based on Bayesian modal, expected a posteriori, and weighted likelihood estimators were reasonably unbiased with relatively small standard…
Descriptors: Ability, Bayesian Statistics, Equations (Mathematics), Error of Measurement
Millman, Jason – 1974
This chapter should not only acquaint the reader with the present state of the art on Criterion-Referenced (CR) measurement but also suggest possible directions for further inquiry. The goal of the first part of this chapter is to deal with the definitional dilemma of CR measurement by proceeding from the more traditional view of CR measurement to…
Descriptors: Analysis of Variance, Bayesian Statistics, Behavioral Objectives, Comparative Analysis

Direct link
