ERIC - Search Results

Publication Date

In 2026	0
Since 2025	0
Since 2022 (last 5 years)	0
Since 2017 (last 10 years)	0
Since 2007 (last 20 years)	4

Descriptor

Hypothesis Testing	7
Test Length	7
Sample Size	3
Cutting Scores	2
Item Response Theory	2
Mastery Tests	2
Mathematical Models	2
Probability	2
Simulation	2
Academic Ability	1
Access to Information	1
Achievement Gains	1
Achievement Tests	1
Adaptive Testing	1
Asynchronous Communication	1
Bayesian Statistics	1
Classification	1
Comparative Analysis	1
Computation	1
Computer Assisted Testing	1
Computer Science Education	1
Course Content	1
Criterion Referenced Tests	1
Decision Making	1
Foreign Countries	1
More ▼

Source

Applied Psychological…	1
International Journal of…	1
Marketing Education Review	1
Psychometrika	1

Author

Cohen, Allan S.	1
Doebler, Anna	1
Doebler, Philipp	1
Downey, W. Scott	1
Epstein, Kenneth I.	1
Finch, Holmes	1
Frick, Theodore W.	1
Hambleton, Ronald K.	1
Holling, Heinz	1
Maxwell, Scott E.	1
Monahan, Patrick O.	1
Patton, Jeffrey	1
Schetzsle, Stacey	1
Steinheiser, Frederick H., Jr.	1
Stump, Timothy E.	1
Wells, Craig S.	1
More ▼

Publication Type

Reports - Research	6
Journal Articles	4
Reports - Evaluative	2
Speeches/Meeting Papers	2

Education Level

Higher Education

Audience

Researchers

Location

Germany

Laws, Policies, & Programs

Assessments and Surveys

What Works Clearinghouse Rating

Showing all 7 results Save | Export

Optimal and Most Exact Confidence Intervals for Person Parameters in Item Response Theory Models

Peer reviewed

Direct link

Doebler, Anna; Doebler, Philipp; Holling, Heinz – Psychometrika, 2013

The common way to calculate confidence intervals for item response theory models is to assume that the standardized maximum likelihood estimator for the person parameter [theta] is normally distributed. However, this approximation is often inadequate for short and medium test lengths. As a result, the coverage probabilities fall below the given…

Descriptors: Foreign Countries, Item Response Theory, Computation, Hypothesis Testing

Asynchronous Assessment in a Large Lecture Marketing Course

Peer reviewed

Direct link

Downey, W. Scott; Schetzsle, Stacey – Marketing Education Review, 2012

Asynchronous assessment, which includes quizzes or exams online or outside class, offers marketing educators an opportunity to make more efficient use of class time and to enhance students' learning experiences by giving them more flexibility and choice in their assessment environment. In this paper, we examine the performance difference between…

Descriptors: Marketing, Lecture Method, Asynchronous Communication, Synchronous Communication

A Range-Null Hypothesis Approach for Testing DIF under the Rasch Model

Peer reviewed

Direct link

Wells, Craig S.; Cohen, Allan S.; Patton, Jeffrey – International Journal of Testing, 2009

A primary concern with testing differential item functioning (DIF) using a traditional point-null hypothesis is that a statistically significant result does not imply that the magnitude of DIF is of practical interest. Similarly, for a given sample size, a non-significant result does not allow the researcher to conclude the item is free of DIF. To…

Descriptors: Test Bias, Test Items, Statistical Analysis, Hypothesis Testing

Bias of Exploratory and Cross-Validated DETECT Index under Unidimensionality

Peer reviewed

Direct link

Monahan, Patrick O.; Stump, Timothy E.; Finch, Holmes; Hambleton, Ronald K. – Applied Psychological Measurement, 2007

DETECT is a nonparametric "full" dimensionality assessment procedure that clusters dichotomously scored items into dimensions and provides a DETECT index of magnitude of multidimensionality. Four factors (test length, sample size, item response theory [IRT] model, and DETECT index) were manipulated in a Monte Carlo study of bias, standard error,…

Descriptors: Test Length, Sample Size, Monte Carlo Methods, Geometric Concepts

Dependent Variable Reliability and Determination of Sample Size.

Maxwell, Scott E. – 1979

Arguments have recently been put forth that standard textbook procedures for determining the sample size necessary to achieve a certain level of power in a completely randomized design are incorrect when the dependent variable is fallible because they ignore measurement error. In fact, however, there are several correct procedures, one of which is…

Descriptors: Hypothesis Testing, Mathematical Formulas, Power (Statistics), Predictor Variables

A Bayesian Method for Evaluating Trainee Proficiency. Technical Paper 323.

Download full text

Epstein, Kenneth I.; Steinheiser, Frederick H., Jr. – 1978

A multiparameter, programmable model was developed to examine the interactive influence of certain parameters on the probability of deciding that an examinee had attained a specified degree of mastery. It was applied within the simulated context of performance testing of military trainees. These parameters included: (1) the number of assumed…

Descriptors: Academic Ability, Bayesian Statistics, Cutting Scores, Hypothesis Testing

An Investigation of the Validity of the Sequential Probability Ratio Test for Mastery Decisions in Criterion-Referenced Testing.

Download full text

Frick, Theodore W. – 1986

The sequential probability ratio test (SPRT), developed by Abraham Wald, is one statistical model available for making mastery decisions during computer-based criterion referenced tests. The predictive validity of the SPRT was empirically investigated with two different and relatively large item pools with heterogeneous item parameters. Graduate…

Descriptors: Achievement Tests, Adaptive Testing, Classification, Comparative Analysis