ERIC - Search Results

Publication Date

In 2026	0
Since 2025	0
Since 2022 (last 5 years)	0
Since 2017 (last 10 years)	2
Since 2007 (last 20 years)	6

Descriptor

Maximum Likelihood Statistics	12
Test Items	12
Test Reliability	12
Test Construction	6
Adaptive Testing	4
Computation	4
Item Response Theory	4
Monte Carlo Methods	4
Test Validity	4
Bayesian Statistics	3
Computer Assisted Testing	3
Equations (Mathematics)	3
Factor Analysis	3
Latent Trait Theory	3
Mathematical Models	3
Simulation	3
Statistical Analysis	3
Accuracy	2
Attitude Measures	2
Comparative Analysis	2
Error of Measurement	2
Gender Differences	2
Item Analysis	2
Multiple Choice Tests	2
Multivariate Analysis	2
More ▼

Source

Educational and Psychological…	2
ETS Research Report Series	1
Educational Research and…	1
International Journal for the…	1
Measurement in Physical…	1
Online Submission	1
Psychometrika	1

Publication Type

Reports - Research	10
Journal Articles	7
Reports - Descriptive	1
Reports - Evaluative	1
Speeches/Meeting Papers	1
Tests/Questionnaires	1

Education Level

Higher Education	2
Postsecondary Education	2

Audience

Location

Iowa

Laws, Policies, & Programs

Assessments and Surveys

School and College Ability…

What Works Clearinghouse Rating

Showing all 12 results Save | Export

Large Sample Confidence Intervals for Item Response Theory Reliability Coefficients

Peer reviewed

Direct link

Andersson, Björn; Xin, Tao – Educational and Psychological Measurement, 2018

In applications of item response theory (IRT), an estimate of the reliability of the ability estimates or sum scores is often reported. However, analytical expressions for the standard errors of the estimators of the reliability coefficients are not available in the literature and therefore the variability associated with the estimated reliability…

Descriptors: Item Response Theory, Test Reliability, Test Items, Scores

Variance Difference between Maximum Likelihood Estimation Method and Expected A Posteriori Estimation Method Viewed from Number of Test Items

Peer reviewed
PDF on ERIC

Download full text

Mahmud, Jumailiyah; Sutikno, Muzayanah; Naga, Dali S. – Educational Research and Reviews, 2016

The aim of this study is to determine variance difference between maximum likelihood and expected A posteriori estimation methods viewed from number of test items of aptitude test. The variance presents an accuracy generated by both maximum likelihood and Bayes estimation methods. The test consists of three subtests, each with 40 multiple-choice…

Descriptors: Maximum Likelihood Statistics, Computation, Item Response Theory, Test Items

Development and Validation of a Novice Teacher and Supervisor Survey

Download full text

Finster, Matthew – Online Submission, 2017

This brief presents initial evidence about the reliability and validity of a novice teacher survey and a novice teacher supervisor survey. The novice teacher and novice teacher supervisor surveys assess how well prepared novice teachers are to meet the job requirements of teaching. The surveys are designed to provide educator preparation programs…

Descriptors: Test Construction, Test Validity, Teacher Surveys, Beginning Teachers

The Effects of Rater Severity and Rater Distribution on Examinees' Ability Estimation for Constructed-Response Items. Research Report. ETS RR-13-23

Peer reviewed
PDF on ERIC

Download full text

Wang, Zhen; Yao, Lihua – ETS Research Report Series, 2013

The current study used simulated data to investigate the properties of a newly proposed method (Yao's rater model) for modeling rater severity and its distribution under different conditions. Our study examined the effects of rater severity, distributions of rater severity, the difference between item response theory (IRT) models with rater effect…

Descriptors: Test Format, Test Items, Responses, Computation

Developing a Scale of Perception of Sexual Abuse in Youth Sports (SPSAYS)

Peer reviewed

Direct link

Baker, Thomas A., III.; Byon, Kevin K. – Measurement in Physical Education and Exercise Science, 2014

A scale was developed to measure perceptions of sexual abuse in youth sports by assessing (a) the perceived prevalence of sexual abuse committed by pedophilic youth sport coaches, (b) the perceived likelihood that a coach is a pedophile, (c) perceptions on how youth sport organizations should manage the risk of pedophilia, and (d) media influence…

Descriptors: Sexual Abuse, Test Construction, Attitude Measures, Incidence

The Learning Alliance Inventory: Instrument Development and Initial Validation

Peer reviewed
PDF on ERIC

Download full text

Rogers, Daniel T. – International Journal for the Scholarship of Teaching and Learning, 2012

Despite potential applications to educational contexts, the working alliance concept has largely been confined to psychotherapy intervention research. Some have explored theoretically related concepts (e.g., immediacy, rapport), but no measure currently exists of the working alliance between a teacher and student within an academic course. The aim…

Descriptors: Test Construction, Test Validity, Test Reliability, Psychometrics

The Impact of Missing Data on Sample Reliability Estimates: Implications for Reliability Reporting Practices

Peer reviewed

Direct link

Enders, Craig K. – Educational and Psychological Measurement, 2004

A method for incorporating maximum likelihood (ML) estimation into reliability analyses with item-level missing data is outlined. An ML estimate of the covariance matrix is first obtained using the expectation maximization (EM) algorithm, and coefficient alpha is subsequently computed using standard formulae. A simulation study demonstrated that…

Descriptors: Intervals, Simulation, Test Reliability, Computation

Multidimensional Adaptive Testing.

Peer reviewed

Segall, Daniel O. – Psychometrika, 1996

Maximum likelihood and Bayesian procedures are presented for item selection and scoring of multidimensional adaptive tests. A demonstration with simulated response data illustrates that multidimensional adaptive testing can provide equal or higher reliabilities with fewer items than are required in one-dimensional adaptive testing. (SLD)

Descriptors: Adaptive Testing, Bayesian Statistics, Computer Assisted Testing, Equations (Mathematics)

Factors Influencing the Psychometric Characteristics of an Adaptive Testing Strategy for Test Batteries.

Download full text

Maurelli, Vincent A.; Weiss, David J. – 1981

A monte carlo simulation was conducted to assess the effects in an adaptive testing strategy for test batteries of varying subtest order, subtest termination criterion, and variable versus fixed entry on the psychometric properties of an existent achievement test battery. Comparisons were made among conventionally administered tests and adaptive…

Descriptors: Achievement Tests, Adaptive Testing, Computer Assisted Testing, Latent Trait Theory

A Comparison of a Bayesian and a Maximum Likelihood Tailored Testing Procedure.

Download full text

McKinley, Robert L.; Reckase, Mark D. – 1981

A study was conducted to compare tailored testing procedures based on a Bayesian ability estimation technique and on a maximum likelihood ability estimation technique. The Bayesian tailored testing procedure selected items so as to minimize the posterior variance of the ability estimate distribution, while the maximum likelihood tailored testing…

Descriptors: Academic Ability, Adaptive Testing, Bayesian Statistics, Comparative Analysis

Operational Characteristics of a One-Parameter Tailored Testing Procedure. Research Report 79-2.

Download full text

Patience, Wayne M.; Reckase, Mark D. – 1979

An experiment was performed with computer-generated data to investigate some of the operational characteristics of tailored testing as they are related to various provisions of the computer program and item pool. With respect to the computer program, two characteristics were varied: the size of the step of increase or decrease in item difficulty…

Descriptors: Adaptive Testing, Computer Assisted Testing, Difficulty Level, Error of Measurement

The CDC AIDS Survey: A Psychometric Critique.

Download full text

Volkan, Kevin – 1989

The latent structure, reliability, and item discrimination of 33 items on a Centers for Disease Control (CDC) instrument representing knowledge, attitudes, and beliefs about the acquired immune deficiency syndrome (AIDS) were assessed. The study sample included 311 adolescents ranging from ages 12 to 19 years. Demographic characteristics of the…

Descriptors: Acquired Immune Deficiency Syndrome, Adolescents, At Risk Persons, Attitude Measures

Reckase, Mark D.	2
Andersson, Björn	1
Baker, Thomas A., III.	1
Byon, Kevin K.	1
Enders, Craig K.	1
Finster, Matthew	1
Mahmud, Jumailiyah	1
Maurelli, Vincent A.	1
McKinley, Robert L.	1
Naga, Dali S.	1
Patience, Wayne M.	1
Rogers, Daniel T.	1
Segall, Daniel O.	1
Sutikno, Muzayanah	1
Volkan, Kevin	1
Wang, Zhen	1
Weiss, David J.	1
Xin, Tao	1
Yao, Lihua	1
More ▼