NotesFAQContact Us
Collection
Advanced
Search Tips
Education Level
Secondary Education1
Audience
Researchers2
Location
Laws, Policies, & Programs
What Works Clearinghouse Rating
Showing 1 to 15 of 18 results Save | Export
Peer reviewed Peer reviewed
Direct linkDirect link
Chen, Yunxiao; Lee, Yi-Hsuan; Li, Xiaoou – Journal of Educational and Behavioral Statistics, 2022
In standardized educational testing, test items are reused in multiple test administrations. To ensure the validity of test scores, the psychometric properties of items should remain unchanged over time. In this article, we consider the sequential monitoring of test items, in particular, the detection of abrupt changes to their psychometric…
Descriptors: Standardized Tests, Test Items, Test Validity, Scores
Peer reviewed Peer reviewed
Direct linkDirect link
Lu, Jing; Wang, Chun – Journal of Educational Measurement, 2020
Item nonresponses are prevalent in standardized testing. They happen either when students fail to reach the end of a test due to a time limit or quitting, or when students choose to omit some items strategically. Oftentimes, item nonresponses are nonrandom, and hence, the missing data mechanism needs to be properly modeled. In this paper, we…
Descriptors: Item Response Theory, Test Items, Standardized Tests, Responses
Peer reviewed Peer reviewed
Kaiser, Henry F. – Educational and Psychological Measurement, 1980
The use of Bayes' estimates for proportions in the Law of Comparative Judgment is suggested to avoid sample proportions of zero and one. (Author)
Descriptors: Bayesian Statistics, Comparative Analysis, Reliability, Statistical Analysis
Wendler, Cathy L. W. – 1982
It is commonly proposed that educational and psychological assessments consist of a multifaceted procedure that incorporates scores from a variety of tests and resources. Little guidance has been offered regarding the method for selecting a combination of testing instruments used in an evaluation. The problem of test selection is discussed in the…
Descriptors: Bayesian Statistics, Cost Effectiveness, Methods, Models
Peer reviewed Peer reviewed
Meredith, William; Millsap, Roger E. – Psychometrika, 1992
A unified treatment is presented for conditions that should allow detection of measurement bias using statistical procedures involving only observed or manifest variables. Computational results demonstrate that methods for studying bias that rely exclusively on manifest variables are not generally diagnostic of the presence or absence of…
Descriptors: Bayesian Statistics, Equations (Mathematics), Identification, Item Bias
Sympson, James B. – 1976
Latent trait test score theory is discussed primarily in terms of Birnbaum's three-parameter logistic model, and with some reference to the Rasch model. Equations and graphic illustrations are given for item characteristic curves and item information curves. An example is given for a hypothetical 20-item adaptive test, showing cumulative results…
Descriptors: Adaptive Testing, Bayesian Statistics, Item Analysis, Latent Trait Theory
Wang, Jianjun – 1995
Effects of blind guessing on the success of passing true-false and multiple-choice tests are investigated under a stochastic binomial model. Critical values of guessing are thresholds which signify when the effect of guessing is negligible. By checking a table of critical values assembled in this paper, one can make a decision with 95% confidence…
Descriptors: Bayesian Statistics, Grading, Guessing (Tests), Models
Wilcox, Rand R. – 1979
Three separate papers are included in this report. The first describes a two-stage procedure for choosing from among several instructional programs the one which maximizes the probability of passing the test. The second gives the exact sample sizes required to determine whether a squared multiple correlation coefficient is above or below a known…
Descriptors: Bayesian Statistics, Correlation, Hypothesis Testing, Mathematical Models
PDF pending restoration PDF pending restoration
van der Linden, Wim J. – 1984
The classification problem in educational testing is a decision problem. One must assign subjects to one of several available treatments on the basis of test scores, where the success of each treatment is measured by a different criterion. Examples of classification decisions include individualized instruction, counseling, and clinical settings.…
Descriptors: Bayesian Statistics, Classification, Cutting Scores, Decision Making
De Ayala, R. J. – 1990
The effect of dimensionality on an adaptive test's ability estimation was examined. Two-dimensional data sets, which differed from one another in the interdimensional ability association, the correlation among the difficulty parameters, and whether the item discriminations were or were not confounded with item difficulty, were generated for 1,600…
Descriptors: Ability Identification, Adaptive Testing, Bayesian Statistics, Computer Assisted Testing
Way, Walter D.; McKinley, Robert L. – 1991
Two procedures were developed to determine whether examinees in a given test center were affected by a testing irregularity on the Listening Comprehension section of the Test of English as a Foreign Language (TOEFL). One approach employed analysis of covariance (ANCOVA) on Listening Comprehension (Section 1) means using scores on Structure and…
Descriptors: Analysis of Covariance, Bayesian Statistics, English (Second Language), Language Proficiency
Rudner, Lawrence M. – 1978
Tailored testing provides the same information as group-administered standardized tests, but can do so using fewer items because the items administered are selected for the ability of the individual student. Thus, tailored testing offers several advantages over traditional methods. Because individual tailored tests are not timed, anxiety is…
Descriptors: Ability, Adaptive Testing, Bayesian Statistics, Computer Assisted Testing
DeAyala, R. J.; Koch, William R. – 1986
A computerized flexilevel test was implemented and its ability estimates were compared with those of a Bayesian estimation based computerized adaptive test (CAT) as well as with known true ability estimates. Results showed that when the flexilevel test was terminated according to Lord's criterion, its ability estimates were highly and…
Descriptors: Ability, Adaptive Testing, Bayesian Statistics, Comparative Analysis
Weiss, David J. – 1983
During 1975-1979 this research into the potential of computerized adaptive testing to reduce errors in the measurement of human capabilities used Marine recruits for a live-testing validity comparison of computerized adaptive and conventional tests. The program purposes were to: (1) identify the most useful computer-based adaptive testing…
Descriptors: Ability, Adaptive Testing, Adults, Bayesian Statistics
Educational Testing Service, Princeton, NJ. – 1971
The conference theme was "The Promise and Perils of Educational Information Systems," defined as collections of test data on knowledges, skills, interests, and attitudes maintained for the purpose of educational decision making. Topics covered were: "Longer Education: Thinner, Broader, or Higher" (Fritz Machlup); "Testing:…
Descriptors: Bayesian Statistics, Bias, Blacks, Conferences
Previous Page | Next Page ยป
Pages: 1  |  2