ERIC - Search Results

Publication Date

In 2026	0
Since 2025	0
Since 2022 (last 5 years)	2
Since 2017 (last 10 years)	3
Since 2007 (last 20 years)	6

Descriptor

Bayesian Statistics	10
Comparative Analysis	10
Test Reliability	10
Test Construction	5
Test Validity	5
Adaptive Testing	3
Computer Assisted Testing	3
Higher Education	3
Maximum Likelihood Statistics	3
Scores	3
Item Response Theory	2
Mathematical Models	2
Probability	2
Psychometrics	2
Simulation	2
Statistical Analysis	2
Test Format	2
Test Items	2
Test Length	2
Undergraduate Students	2
21st Century Skills	1
Abstract Reasoning	1
Academic Ability	1
Academic Libraries	1
Accuracy	1
More ▼

Source

British Journal of Guidance &…	1
College & Research Libraries	1
Education and Information…	1
Educational and Psychological…	1
IEEE Transactions on Learning…	1
Physical Review Physics…	1

Publication Type

Journal Articles	6
Reports - Research	6
Reports - Descriptive	2
Reports - Evaluative	2
Speeches/Meeting Papers	2

Education Level

Higher Education	2
Postsecondary Education	2
Elementary Secondary Education	1

Audience

Location

South Africa	1
United Kingdom	1

Laws, Policies, & Programs

Assessments and Surveys

School and College Ability…

What Works Clearinghouse Rating

Showing all 10 results Save | Export

Measuring Language Ability of Students with Compensatory Multidimensional CAT: A Post-Hoc Simulation Study

Peer reviewed

Direct link

Ozdemir, Burhanettin; Gelbal, Selahattin – Education and Information Technologies, 2022

The computerized adaptive tests (CAT) apply an adaptive process in which the items are tailored to individuals' ability scores. The multidimensional CAT (MCAT) designs differ in terms of different item selection, ability estimation, and termination methods being used. This study aims at investigating the performance of the MCAT designs used to…

Descriptors: Scores, Computer Assisted Testing, Test Items, Language Proficiency

Theoretical Model and Quantitative Assessment of Scientific Thinking and Reasoning

Peer reviewed

Direct link

Bao, Lei; Koenig, Kathleen; Xiao, Yang; Fritchman, Joseph; Zhou, Shaona; Chen, Cheng – Physical Review Physics Education Research, 2022

Abilities in scientific thinking and reasoning have been emphasized as core areas of initiatives, such as the Next Generation Science Standards or the College Board Standards for College Success in Science, which focus on the skills the future will demand of today's students. Although there is rich literature on studies of how these abilities…

Descriptors: Physics, Science Instruction, Teaching Methods, Thinking Skills

How Large Is the "Public Domain"? A Comparative Analysis of Ringer's 1961 Copyright Renewal Study and HathiTrust CRMS Data

Peer reviewed

Direct link

Wilkin, John P. – College & Research Libraries, 2017

The 1961 Copyright Office study on renewals, authored by Barbara Ringer, has cast an outsized influence on discussions of the U.S. 1923-1963 public domain. As more concrete data emerge from initiatives such as the large-scale determination process in the Copyright Review Management System (CRMS) project, questions are raised about the reliability…

Descriptors: Comparative Analysis, Copyrights, Misconceptions, Test Reliability

Item Response Theory for Peer Assessment

Peer reviewed

Direct link

Uto, Masaki; Ueno, Maomi – IEEE Transactions on Learning Technologies, 2016

As an assessment method based on a constructivist approach, peer assessment has become popular in recent years. However, in peer assessment, a problem remains that reliability depends on the rater characteristics. For this reason, some item response models that incorporate rater parameters have been proposed. Those models are expected to improve…

Descriptors: Item Response Theory, Peer Evaluation, Bayesian Statistics, Simulation

Student Wellbeing at a University in Post-Apartheid South Africa: A Comparison with a British University Sample Using the GP-CORE Measure

Peer reviewed

Direct link

Young, Charles; Campbell, Megan – British Journal of Guidance & Counselling, 2014

This article provides GP-CORE norms for a South African university sample, which are compared to published data obtained from a United Kingdom university sample. The measure appears to be both reliable and valid for this multilingual and multicultural South African sample. The profiles of the psychological distress reported by white South African…

Descriptors: Foreign Countries, Well Being, Comparative Analysis, Psychological Needs

A Comparison of Approaches for Improving the Reliability of Objective Level Scores

Peer reviewed

Direct link

Skorupski, William P.; Carvajal, Jorge – Educational and Psychological Measurement, 2010

This study is an evaluation of the psychometric issues associated with estimating objective level scores, often referred to as "subscores." The article begins by introducing the concepts of reliability and validity for subscores from statewide achievement tests. These issues are discussed with reference to popular scaling techniques, classical…

Descriptors: Testing Programs, Test Validity, Achievement Tests, Scores

A Comparison of a Maximum Likelihood and a Bayesian Ability Estimation Procedure for Tailored Testing.

Download full text

Rosso, Martin A.; Reckase, Mark D. – 1981

The overall purpose of this research was to compare a maximum likelihood based tailored testing procedure to a Bayesian tailored testing procedure. The results indicated that both tailored testing procedures produced equally reliable ability estimates. Also an analysis of test length indicated that reasonable ability estimates could be obtained…

Descriptors: Adaptive Testing, Bayesian Statistics, Comparative Analysis, Computer Assisted Testing

Assessing the Reliability of Computer Adaptive Testing Branching Algorithms Using HyperCAT.

Shermis, Mark D.; And Others – 1992

The reliability of four branching algorithms commonly used in computer adaptive testing (CAT) was examined. These algorithms were: (1) maximum likelihood (MLE); (2) Bayesian; (3) modal Bayesian; and (4) crossover. Sixty-eight undergraduate college students were randomly assigned to one of the four conditions using the HyperCard-based CAT program,…

Descriptors: Adaptive Testing, Algorithms, Bayesian Statistics, Comparative Analysis

A Comparison of a Bayesian and a Maximum Likelihood Tailored Testing Procedure.

Download full text

McKinley, Robert L.; Reckase, Mark D. – 1981

A study was conducted to compare tailored testing procedures based on a Bayesian ability estimation technique and on a maximum likelihood ability estimation technique. The Bayesian tailored testing procedure selected items so as to minimize the posterior variance of the ability estimate distribution, while the maximum likelihood tailored testing…

Descriptors: Academic Ability, Adaptive Testing, Bayesian Statistics, Comparative Analysis

Criterion-Referenced Measurement.

Millman, Jason – 1974

This chapter should not only acquaint the reader with the present state of the art on Criterion-Referenced (CR) measurement but also suggest possible directions for further inquiry. The goal of the first part of this chapter is to deal with the definitional dilemma of CR measurement by proceeding from the more traditional view of CR measurement to…

Descriptors: Analysis of Variance, Bayesian Statistics, Behavioral Objectives, Comparative Analysis

Reckase, Mark D.	2
Bao, Lei	1
Campbell, Megan	1
Carvajal, Jorge	1
Chen, Cheng	1
Fritchman, Joseph	1
Gelbal, Selahattin	1
Koenig, Kathleen	1
McKinley, Robert L.	1
Millman, Jason	1
Ozdemir, Burhanettin	1
Rosso, Martin A.	1
Shermis, Mark D.	1
Skorupski, William P.	1
Ueno, Maomi	1
Uto, Masaki	1
Wilkin, John P.	1
Xiao, Yang	1
Young, Charles	1
Zhou, Shaona	1
More ▼