Publication Date
| In 2026 | 0 |
| Since 2025 | 0 |
| Since 2022 (last 5 years) | 0 |
| Since 2017 (last 10 years) | 2 |
| Since 2007 (last 20 years) | 6 |
Descriptor
Source
| Educational and Psychological… | 2 |
| ETS Research Report Series | 1 |
| Educational Research and… | 1 |
| International Journal for the… | 1 |
| Measurement in Physical… | 1 |
| Online Submission | 1 |
| Psychometrika | 1 |
Author
Publication Type
| Reports - Research | 10 |
| Journal Articles | 7 |
| Reports - Descriptive | 1 |
| Reports - Evaluative | 1 |
| Speeches/Meeting Papers | 1 |
| Tests/Questionnaires | 1 |
Education Level
| Higher Education | 2 |
| Postsecondary Education | 2 |
Audience
Location
| Iowa | 1 |
Laws, Policies, & Programs
Assessments and Surveys
| School and College Ability… | 1 |
What Works Clearinghouse Rating
Andersson, Björn; Xin, Tao – Educational and Psychological Measurement, 2018
In applications of item response theory (IRT), an estimate of the reliability of the ability estimates or sum scores is often reported. However, analytical expressions for the standard errors of the estimators of the reliability coefficients are not available in the literature and therefore the variability associated with the estimated reliability…
Descriptors: Item Response Theory, Test Reliability, Test Items, Scores
Mahmud, Jumailiyah; Sutikno, Muzayanah; Naga, Dali S. – Educational Research and Reviews, 2016
The aim of this study is to determine variance difference between maximum likelihood and expected A posteriori estimation methods viewed from number of test items of aptitude test. The variance presents an accuracy generated by both maximum likelihood and Bayes estimation methods. The test consists of three subtests, each with 40 multiple-choice…
Descriptors: Maximum Likelihood Statistics, Computation, Item Response Theory, Test Items
Finster, Matthew – Online Submission, 2017
This brief presents initial evidence about the reliability and validity of a novice teacher survey and a novice teacher supervisor survey. The novice teacher and novice teacher supervisor surveys assess how well prepared novice teachers are to meet the job requirements of teaching. The surveys are designed to provide educator preparation programs…
Descriptors: Test Construction, Test Validity, Teacher Surveys, Beginning Teachers
Wang, Zhen; Yao, Lihua – ETS Research Report Series, 2013
The current study used simulated data to investigate the properties of a newly proposed method (Yao's rater model) for modeling rater severity and its distribution under different conditions. Our study examined the effects of rater severity, distributions of rater severity, the difference between item response theory (IRT) models with rater effect…
Descriptors: Test Format, Test Items, Responses, Computation
Baker, Thomas A., III.; Byon, Kevin K. – Measurement in Physical Education and Exercise Science, 2014
A scale was developed to measure perceptions of sexual abuse in youth sports by assessing (a) the perceived prevalence of sexual abuse committed by pedophilic youth sport coaches, (b) the perceived likelihood that a coach is a pedophile, (c) perceptions on how youth sport organizations should manage the risk of pedophilia, and (d) media influence…
Descriptors: Sexual Abuse, Test Construction, Attitude Measures, Incidence
Rogers, Daniel T. – International Journal for the Scholarship of Teaching and Learning, 2012
Despite potential applications to educational contexts, the working alliance concept has largely been confined to psychotherapy intervention research. Some have explored theoretically related concepts (e.g., immediacy, rapport), but no measure currently exists of the working alliance between a teacher and student within an academic course. The aim…
Descriptors: Test Construction, Test Validity, Test Reliability, Psychometrics
Enders, Craig K. – Educational and Psychological Measurement, 2004
A method for incorporating maximum likelihood (ML) estimation into reliability analyses with item-level missing data is outlined. An ML estimate of the covariance matrix is first obtained using the expectation maximization (EM) algorithm, and coefficient alpha is subsequently computed using standard formulae. A simulation study demonstrated that…
Descriptors: Intervals, Simulation, Test Reliability, Computation
Peer reviewedSegall, Daniel O. – Psychometrika, 1996
Maximum likelihood and Bayesian procedures are presented for item selection and scoring of multidimensional adaptive tests. A demonstration with simulated response data illustrates that multidimensional adaptive testing can provide equal or higher reliabilities with fewer items than are required in one-dimensional adaptive testing. (SLD)
Descriptors: Adaptive Testing, Bayesian Statistics, Computer Assisted Testing, Equations (Mathematics)
Maurelli, Vincent A.; Weiss, David J. – 1981
A monte carlo simulation was conducted to assess the effects in an adaptive testing strategy for test batteries of varying subtest order, subtest termination criterion, and variable versus fixed entry on the psychometric properties of an existent achievement test battery. Comparisons were made among conventionally administered tests and adaptive…
Descriptors: Achievement Tests, Adaptive Testing, Computer Assisted Testing, Latent Trait Theory
McKinley, Robert L.; Reckase, Mark D. – 1981
A study was conducted to compare tailored testing procedures based on a Bayesian ability estimation technique and on a maximum likelihood ability estimation technique. The Bayesian tailored testing procedure selected items so as to minimize the posterior variance of the ability estimate distribution, while the maximum likelihood tailored testing…
Descriptors: Academic Ability, Adaptive Testing, Bayesian Statistics, Comparative Analysis
Patience, Wayne M.; Reckase, Mark D. – 1979
An experiment was performed with computer-generated data to investigate some of the operational characteristics of tailored testing as they are related to various provisions of the computer program and item pool. With respect to the computer program, two characteristics were varied: the size of the step of increase or decrease in item difficulty…
Descriptors: Adaptive Testing, Computer Assisted Testing, Difficulty Level, Error of Measurement
Volkan, Kevin – 1989
The latent structure, reliability, and item discrimination of 33 items on a Centers for Disease Control (CDC) instrument representing knowledge, attitudes, and beliefs about the acquired immune deficiency syndrome (AIDS) were assessed. The study sample included 311 adolescents ranging from ages 12 to 19 years. Demographic characteristics of the…
Descriptors: Acquired Immune Deficiency Syndrome, Adolescents, At Risk Persons, Attitude Measures

Direct link
