ERIC - Search Results

Publication Date

In 2026	0
Since 2025	0
Since 2022 (last 5 years)	4
Since 2017 (last 10 years)	4
Since 2007 (last 20 years)	4

Descriptor

Bayesian Statistics	13
Comparative Analysis	13
Test Construction	13
Mathematical Models	5
Test Items	5
Test Reliability	5
Adaptive Testing	4
Computer Assisted Testing	4
Estimation (Mathematics)	4
Criterion Referenced Tests	3
Higher Education	3
Item Analysis	3
Latent Trait Theory	3
Maximum Likelihood Statistics	3
Scores	3
Simulation	3
Computer Simulation	2
Decision Making	2
Difficulty Level	2
Individual Differences	2
Language Proficiency	2
Language Tests	2
Measurement Techniques	2
Monte Carlo Methods	2
Scoring	2
More ▼

Source

Applied Measurement in…	1
Education and Information…	1
International Educational…	1
International Journal of…	1
Journal of Educational…	1
Physical Review Physics…	1

Publication Type

Reports - Research	8
Journal Articles	5
Reports - Evaluative	3
Speeches/Meeting Papers	3
Information Analyses	1
Reports - Descriptive	1

Education Level

Elementary Secondary Education

Audience

Location

Laws, Policies, & Programs

Assessments and Surveys

Michigan Test of English…	1
School and College Ability…	1

What Works Clearinghouse Rating

Showing all 13 results Save | Export

Comparison of Item Response Theory Ability and Item Parameters According to Classical and Bayesian Estimation Methods

Peer reviewed
PDF on ERIC

Download full text

Eray Selçuk; Ergül Demir – International Journal of Assessment Tools in Education, 2024

This research aims to compare the ability and item parameter estimations of Item Response Theory according to Maximum likelihood and Bayesian approaches in different Monte Carlo simulation conditions. For this purpose, depending on the changes in the priori distribution type, sample size, test length, and logistics model, the ability and item…

Descriptors: Item Response Theory, Item Analysis, Test Items, Simulation

Measuring Language Ability of Students with Compensatory Multidimensional CAT: A Post-Hoc Simulation Study

Peer reviewed

Direct link

Ozdemir, Burhanettin; Gelbal, Selahattin – Education and Information Technologies, 2022

The computerized adaptive tests (CAT) apply an adaptive process in which the items are tailored to individuals' ability scores. The multidimensional CAT (MCAT) designs differ in terms of different item selection, ability estimation, and termination methods being used. This study aims at investigating the performance of the MCAT designs used to…

Descriptors: Scores, Computer Assisted Testing, Test Items, Language Proficiency

The AI Teacher Test: Measuring the Pedagogical Ability of Blender and GPT-3 in Educational Dialogues

Peer reviewed
PDF on ERIC

Download full text

Tack, Anaïs; Piech, Chris – International Educational Data Mining Society, 2022

How can we test whether state-of-the-art generative models, such as Blender and GPT-3, are good AI teachers, capable of replying to a student in an educational dialogue? Designing an AI teacher test is challenging: although evaluation methods are much-needed, there is no off-the-shelf solution to measuring pedagogical ability. This paper reports…

Descriptors: Artificial Intelligence, Dialogs (Language), Bayesian Statistics, Decision Making

Theoretical Model and Quantitative Assessment of Scientific Thinking and Reasoning

Peer reviewed

Direct link

Bao, Lei; Koenig, Kathleen; Xiao, Yang; Fritchman, Joseph; Zhou, Shaona; Chen, Cheng – Physical Review Physics Education Research, 2022

Abilities in scientific thinking and reasoning have been emphasized as core areas of initiatives, such as the Next Generation Science Standards or the College Board Standards for College Success in Science, which focus on the skills the future will demand of today's students. Although there is rich literature on studies of how these abilities…

Descriptors: Physics, Science Instruction, Teaching Methods, Thinking Skills

Computerized Mastery Testing Using Fuzzy Set Decision Theory.

Peer reviewed

Du, Yi; And Others – Applied Measurement in Education, 1993

A new computerized mastery test is described that builds on the Lewis and Sheehan procedure (sequential testlets) (1990), but uses fuzzy set decision theory to determine stopping rules and the Rasch model to calibrate items and estimate abilities. Differences between fuzzy set and Bayesian methods are illustrated through an example. (SLD)

Descriptors: Bayesian Statistics, Comparative Analysis, Computer Assisted Testing, Estimation (Mathematics)

Bayesian Estimation in the Rasch Model.

Peer reviewed

Swaminathan, Hariharan; Gifford, Janice A. – Journal of Educational Statistics, 1982

Bayesian estimation procedures based on a hierarchical model for estimating parameters in the Rasch model are described. It is shown that the Bayesian procedures result in estimates with superior statistical characteristics. (Author/JKS)

Descriptors: Bayesian Statistics, Comparative Analysis, Estimation (Mathematics), Item Analysis

A Comparison of Bayesian and Traditional Indexes of Test Item Effectiveness.

Download full text

Helmstadter, Gerald C. – 1974

Bayes Theorem leads to three indexes of item effectiveness: 1) probability that an examinee knows the content given that the correct response was selected; 2) probability that an examinee does not know that content given that an incorrect response was selected; and 3) probability of making a correct decision about the examinee's knowledge given…

Descriptors: Achievement Tests, Bayesian Statistics, Comparative Analysis, Criterion Referenced Tests

Assessing the Reliability of Computer Adaptive Testing Branching Algorithms Using HyperCAT.

Shermis, Mark D.; And Others – 1992

The reliability of four branching algorithms commonly used in computer adaptive testing (CAT) was examined. These algorithms were: (1) maximum likelihood (MLE); (2) Bayesian; (3) modal Bayesian; and (4) crossover. Sixty-eight undergraduate college students were randomly assigned to one of the four conditions using the HyperCard-based CAT program,…

Descriptors: Adaptive Testing, Algorithms, Bayesian Statistics, Comparative Analysis

A Comparison of Instructional Sensitivity Indices.

Perkins, Kyle – 1984

Instructional sensitivity indices are compared from four perspectives: (1) criterion-referenced testing; (2) classical test theory; (3) item response theory; and (4) Bayesian theory. Instructional sensitivity is defined as the tendency of a test item to vary in difficulty as a function of instruction. The instructional sensitivity of the items in…

Descriptors: Bayesian Statistics, Comparative Analysis, Criterion Referenced Tests, Difficulty Level

A Comparison of a Bayesian and a Maximum Likelihood Tailored Testing Procedure.

Download full text

McKinley, Robert L.; Reckase, Mark D. – 1981

A study was conducted to compare tailored testing procedures based on a Bayesian ability estimation technique and on a maximum likelihood ability estimation technique. The Bayesian tailored testing procedure selected items so as to minimize the posterior variance of the ability estimate distribution, while the maximum likelihood tailored testing…

Descriptors: Academic Ability, Adaptive Testing, Bayesian Statistics, Comparative Analysis

Estimation of Ability Level by Using Only Observable Quantities in Adaptive Testing.

Download full text

Kirisci, Levent; Hsu, Tse-Chi – 1992

A predictive adaptive testing (PAT) strategy was developed based on statistical predictive analysis, and its feasibility was studied by comparing PAT performance to those of the Flexilevel, Bayesian modal, and expected a posteriori (EAP) strategies in a simulated environment. The proposed adaptive test is based on the idea of using item difficulty…

Descriptors: Adaptive Testing, Bayesian Statistics, Comparative Analysis, Computer Assisted Testing

Computerized Ability Testing, 1972-1975. Final Report.

Download full text

Weiss, David J. – 1976

Three and one-half years of research on computerized ability testing are summarized. The original objectives of the research were: (1) to develop and implement the stratified computer-based ability test; (2) to compare, on psychometric criteria, the various approaches to computer-based ability testing, including the stratified computerized test,…

Descriptors: Adaptive Testing, Bayesian Statistics, Branching, Comparative Analysis

Criterion-Referenced Measurement.

Millman, Jason – 1974

This chapter should not only acquaint the reader with the present state of the art on Criterion-Referenced (CR) measurement but also suggest possible directions for further inquiry. The goal of the first part of this chapter is to deal with the definitional dilemma of CR measurement by proceeding from the more traditional view of CR measurement to…

Descriptors: Analysis of Variance, Bayesian Statistics, Behavioral Objectives, Comparative Analysis

Bao, Lei	1
Chen, Cheng	1
Du, Yi	1
Eray Selçuk	1
Ergül Demir	1
Fritchman, Joseph	1
Gelbal, Selahattin	1
Gifford, Janice A.	1
Helmstadter, Gerald C.	1
Hsu, Tse-Chi	1
Kirisci, Levent	1
Koenig, Kathleen	1
McKinley, Robert L.	1
Millman, Jason	1
Ozdemir, Burhanettin	1
Perkins, Kyle	1
Piech, Chris	1
Reckase, Mark D.	1
Shermis, Mark D.	1
Swaminathan, Hariharan	1
Tack, Anaïs	1
Weiss, David J.	1
Xiao, Yang	1
Zhou, Shaona	1
More ▼