ERIC - Search Results

Publication Date

In 2026	0
Since 2025	0
Since 2022 (last 5 years)	1
Since 2017 (last 10 years)	1
Since 2007 (last 20 years)	1

Descriptor

Bayesian Statistics	3
Computer Simulation	3
Scores	3
Comparative Analysis	2
Evaluation Methods	2
Models	2
Analysis of Covariance	1
Artificial Intelligence	1
Cognitive Development	1
Computer Software	1
Decision Making	1
Dialogs (Language)	1
Estimation (Mathematics)	1
Goodness of Fit	1
Intelligent Tutoring Systems	1
Interrater Reliability	1
Least Squares Statistics	1
Markov Processes	1
Mathematical Models	1
Matrices	1
Monte Carlo Methods	1
Probability	1
Regression (Statistics)	1
Reliability	1
Research Methodology	1
More ▼

Source

International Educational…	1
US Department of Education	1

Author

Levy, Roy	1
Mislevy, Robert J.	1
Piech, Chris	1
Rule, David L.	1
Tack, Anaïs	1

Publication Type

Reports - Evaluative	2
Speeches/Meeting Papers	2
Reports - Research	1

Education Level

Audience

Researchers

Location

Laws, Policies, & Programs

Assessments and Surveys

What Works Clearinghouse Rating

Showing all 3 results Save | Export

The AI Teacher Test: Measuring the Pedagogical Ability of Blender and GPT-3 in Educational Dialogues

Peer reviewed
PDF on ERIC

Download full text

Tack, Anaïs; Piech, Chris – International Educational Data Mining Society, 2022

How can we test whether state-of-the-art generative models, such as Blender and GPT-3, are good AI teachers, capable of replying to a student in an educational dialogue? Designing an AI teacher test is challenging: although evaluation methods are much-needed, there is no off-the-shelf solution to measuring pedagogical ability. This paper reports…

Descriptors: Artificial Intelligence, Dialogs (Language), Bayesian Statistics, Decision Making

Specifying and Refining a Measurement Model for a Simulation-Based Assessment. CSE Report 619.

Download full text

Levy, Roy; Mislevy, Robert J. – US Department of Education, 2004

The challenges of modeling students' performance in simulation-based assessments include accounting for multiple aspects of knowledge and skill that arise in different situations and the conditional dependencies among multiple aspects of performance in a complex assessment. This paper describes a Bayesian approach to modeling and estimating…

Descriptors: Probability, Markov Processes, Monte Carlo Methods, Bayesian Statistics

A Simulation-Based Comparison of Several Stochastic Linear Regression Methods in the Presence of Outliers.

Download full text

Rule, David L. – 1993

Several regression methods were examined within the framework of weighted structural regression (WSR), comparing their regression weight stability and score estimation accuracy in the presence of outlier contamination. The methods compared are: (1) ordinary least squares; (2) WSR ridge regression; (3) minimum risk regression; (4) minimum risk 2;…

Descriptors: Analysis of Covariance, Bayesian Statistics, Comparative Analysis, Computer Simulation