ERIC - Search Results

Publication Date

In 2026	0
Since 2025	0
Since 2022 (last 5 years)	1
Since 2017 (last 10 years)	4
Since 2007 (last 20 years)	10

Descriptor

Reliability	17
Sample Size	17
Test Items	17
Statistical Analysis	7
Sampling	6
Correlation	5
Item Analysis	5
Simulation	5
Estimation (Mathematics)	4
Classification	3
Comparative Analysis	3
Computation	3
Difficulty Level	3
Foreign Countries	3
Goodness of Fit	3
Latent Trait Theory	3
Mathematical Models	3
Psychometrics	3
Test Construction	3
Accuracy	2
Achievement Tests	2
Computer Software	2
Diagnostic Tests	2
Equated Scores	2
Higher Education	2
More ▼

Source

Educational and Psychological…	2
ProQuest LLC	2
American Journal of…	1
ETS Research Report Series	1
International Journal of…	1
International Journal of…	1
Journal of Education and…	1
Measurement:…	1
Psychometrika	1

Publication Type

Reports - Research	12
Journal Articles	9
Reports - Evaluative	3
Dissertations/Theses -…	2
Speeches/Meeting Papers	2
Numerical/Quantitative Data	1

Education Level

Higher Education	2
Postsecondary Education	2
Elementary Education	1

Audience

Researchers

Location

Australia	1
California	1
India	1
Turkey	1

Laws, Policies, & Programs

Assessments and Surveys

What Works Clearinghouse Rating

Showing 1 to 15 of 17 results Save | Export

A Simulation Study on the Performance of Different Reliability Estimation Methods

Peer reviewed

Direct link

Edwards, Ashley A.; Joyner, Keanan J.; Schatschneider, Christopher – Educational and Psychological Measurement, 2021

The accuracy of certain internal consistency estimators have been questioned in recent years. The present study tests the accuracy of six reliability estimators (Cronbach's alpha, omega, omega hierarchical, Revelle's omega, and greatest lower bound) in 140 simulated conditions of unidimensional continuous data with uncorrelated errors with varying…

Descriptors: Reliability, Computation, Accuracy, Sample Size

There Are Many Greater Lower Bounds than Cronbach's [alpha]: A Monte Carlo Simulation Study

Peer reviewed

Direct link

Novak, Josip; Rebernjak, Blaž – Measurement: Interdisciplinary Research and Perspectives, 2023

A Monte Carlo simulation study was conducted to examine the performance of [alpha], [lambda]2, [lambda][subscript 4], [lambda][subscript 2], [omega][subscript T], GLB[subscript MRFA], and GLB[subscript Algebraic] coefficients. Population reliability, distribution shape, sample size, test length, and number of response categories were varied…

Descriptors: Monte Carlo Methods, Evaluation Methods, Reliability, Simulation

Evaluating the Effectiveness of the Expectation-Maximization (EM) Algorithm for Bayesian Network Calibration

Direct link

Tingir, Seyfullah – ProQuest LLC, 2019

Educators use various statistical techniques to explain relationships between latent and observable variables. One way to model these relationships is to use Bayesian networks as a scoring model. However, adjusting the conditional probability tables (CPT-parameters) to fit a set of observations is still a challenge when using Bayesian networks. A…

Descriptors: Bayesian Statistics, Statistical Analysis, Scoring, Probability

Diagnostic Classification Models: Recent Developments, Practical Issues, and Prospects

Peer reviewed

Direct link

Ravand, Hamdollah; Baghaei, Purya – International Journal of Testing, 2020

More than three decades after their introduction, diagnostic classification models (DCM) do not seem to have been implemented in educational systems for the purposes they were devised. Most DCM research is either methodological for model development and refinement or retrofitting to existing nondiagnostic tests and, in the latter case, basically…

Descriptors: Classification, Models, Diagnostic Tests, Test Construction

Use of Jackknifing to Evaluate Effects of Anchor Item Selection on Equating with the Nonequivalent Groups with Anchor Test (NEAT) Design. Research Report. ETS RR-15-10

Peer reviewed
PDF on ERIC

Download full text

Lu, Ru; Haberman, Shelby; Guo, Hongwen; Liu, Jinghua – ETS Research Report Series, 2015

In this study, we apply jackknifing to anchor items to evaluate the impact of anchor selection on equating stability. In an ideal world, the choice of anchor items should have little impact on equating results. When this ideal does not correspond to reality, selection of anchor items can strongly influence equating results. This influence does not…

Descriptors: Test Construction, Equated Scores, Test Items, Sampling

Peer reviewed
PDF on ERIC

Download full text

Kalender, Ilker – International Journal of Higher Education, 2015

Student evaluations of teaching (SET) have been the principal instrument to elicit students' opinions in higher education institutions. Many decisions, including high-stake ones, are made based on SET scores reported by students. In this respect, reliability of SET scores is of considerable importance. This paper has an argument that there are…

Descriptors: Higher Education, Reliability, Test Items, Measurement

Coefficient Omega Bootstrap Confidence Intervals: Nonnormal Distributions

Peer reviewed

Direct link

Padilla, Miguel A.; Divers, Jasmin – Educational and Psychological Measurement, 2013

The performance of the normal theory bootstrap (NTB), the percentile bootstrap (PB), and the bias-corrected and accelerated (BCa) bootstrap confidence intervals (CIs) for coefficient omega was assessed through a Monte Carlo simulation under conditions not previously investigated. Of particular interests were nonnormal Likert-type and binary items.…

Descriptors: Sampling, Statistical Inference, Computation, Statistical Analysis

Performance Assessment of High and Low Income Families through "Online RAW Achievement Battery Test" of Primary Grade Students

Peer reviewed
PDF on ERIC

Download full text

Ahmed, Tamim; Hanif, Maria – Journal of Education and Practice, 2016

This study is intended to investigate student's achievement capability among two families i.e. Low and High income families and designed for primary level learners. A Reading, Arithmetic and Writing (RAW) Achievement test that was developed as a part of another research study (Tamim Ahmed Khan, 2015) was adopted for this study. Both English medium…

Descriptors: Low Income, Performance Based Assessment, Elementary School Students, Achievement Tests

Model Choice and Sample Size in Item Response Theory Analysis of Aphasia Tests

Peer reviewed

Direct link

Hula, William D.; Fergadiotis, Gerasimos; Martin, Nadine – American Journal of Speech-Language Pathology, 2012

Purpose: The purpose of this study was to identify the most appropriate item response theory (IRT) measurement model for aphasia tests requiring 2-choice responses and to determine whether small samples are adequate for estimating such models. Method: Pyramids and Palm Trees (Howard & Patterson, 1992) test data that had been collected from…

Descriptors: Sample Size, Guessing (Tests), Aphasia, Item Response Theory

Disability as Diversity: Assessing the Perceptions of Students with Physical Disabilities regarding Access and Equal Opportunity in Postsecondary Education

Direct link

Cooper, Lisa Marie – ProQuest LLC, 2012

The initial purpose of this study was to utilize the Higher Education and Students with Physical Disabilities Survey (HESPDS) to develop a better understanding of the perceptions of students with physical disabilities regarding the extent to which private, residential colleges and universities provide access and equal opportunity. The significance…

Descriptors: Factor Analysis, Higher Education, Student Attitudes, Physical Disabilities

Joint Consistency of Nonparametric Item Characteristic Curve and Ability Estimation.

Peer reviewed

Douglas, Jeff – Psychometrika, 1997

Explores the asymptotic theory of a method of nonparametric item characteristic curve estimation based on kernel smoothing with the theory of obtaining the proper linear ordering of examinees with respect to their true latent abilities. Results support the usefulness of estimates of the type produced by the TESTGRAF program. (SLD)

Descriptors: Ability, Estimation (Mathematics), Nonparametric Statistics, Reliability

The Stability of Four Methods for Estimating Item Bias.

Download full text

Bezruczko, Nikolaus; And Others – 1989

The stability of bias estimates from J. Schueneman's chi-square method, the transformed Delta method, Rasch's one-parameter residual analysis, and the Mantel-Haenszel procedure, were compared across small and large samples for a data set of 30,000 cases. Bias values for 30 samples were estimated for each method, and means and variances of item…

Descriptors: Chi Square, Classification, Estimation (Mathematics), Identification

Accuracy of Estimating Two Parameter Logistic Latent Trait Parameters and Implications for Classroom Tests.

Download full text

Kolen, Michael J.; Whitney, Douglas R. – 1978

The application of latent trait theory to classroom tests necessitates the use of small sample sizes for parameter estimation. Computer generated data were used to assess the accuracy of estimation of the slope and location parameters in the two parameter logistic model with fixed abilities and varying small sample sizes. The maximum likelihood…

Descriptors: Difficulty Level, Item Analysis, Latent Trait Theory, Mathematical Models

A Comparison of the One- and Three-Parameter Logistic Models for Item Calibration.

Download full text

Reckase, Mark D. – 1978

Five comparisons were made relative to the quality of estimates of ability parameters and item calibrations obtained from the one-parameter and three-parameter logistic models. The results indicate: (1) The three-parameter model fit the test data better in all cases than did the one-parameter model. For simulation data sets, multi-factor data were…

Descriptors: Comparative Analysis, Goodness of Fit, Item Analysis, Mathematical Models

Item Characteristic Curve Parameters: Effects of Sample Size on Linear Equating.

Download full text

Ree, Malcom James; Jensen, Harald E. – 1980

By means of computer simulation of test responses, the reliability of item analysis data and the accuracy of equating were examined for hypothetical samples of 250, 500, 1000, and 2000 subjects for two tests with 20 equating items plus 60 additional items on the same scale. Birnbaum's three-parameter logistic model was used for the simulation. The…

Descriptors: Computer Assisted Testing, Equated Scores, Error of Measurement, Item Analysis

Previous Page | Next Page »

Pages: 1 | 2

Ahmed, Tamim	1
Baghaei, Purya	1
Bezruczko, Nikolaus	1
Cooper, Lisa Marie	1
Divers, Jasmin	1
Douglas, Jeff	1
Edwards, Ashley A.	1
Farish, Stephen J.	1
Fergadiotis, Gerasimos	1
Guo, Hongwen	1
Haberman, Shelby	1
Hanif, Maria	1
Hula, William D.	1
Jensen, Harald E.	1
Joyner, Keanan J.	1
Kalender, Ilker	1
Kolen, Michael J.	1
Liu, Jinghua	1
Lu, Ru	1
Martin, Nadine	1
Novak, Josip	1
Padilla, Miguel A.	1
Ravand, Hamdollah	1
Rebernjak, Blaž	1
More ▼