ERIC - Search Results

Publication Date

In 2025	0
Since 2024	1
Since 2021 (last 5 years)	11
Since 2016 (last 10 years)	27
Since 2006 (last 20 years)	62

Descriptor

Probability	129
Statistical Analysis	33
Hypothesis Testing	27
Item Response Theory	26
Models	26
Statistical Significance	22
Test Items	20
Classification	19
Correlation	17
Simulation	16
Computation	15
Item Analysis	15
Sample Size	15
Comparative Analysis	13
Scores	13
Bayesian Statistics	12
Measurement Techniques	12
Computer Programs	11
Error of Measurement	11
Test Reliability	11
Goodness of Fit	10
Mathematical Models	10
Statistical Inference	10
Data Analysis	9
Effect Size	9
More ▼

Source

Educational and Psychological…

129

Publication Type

Journal Articles	96
Reports - Research	67
Reports - Evaluative	16
Reports - Descriptive	9
Numerical/Quantitative Data	2
Book/Product Reviews	1
Guides - Non-Classroom	1
Reference Materials - General	1

Education Level

Elementary Education	3
Grade 8	3
Higher Education	3
Junior High Schools	3
Middle Schools	3
Postsecondary Education	3
Secondary Education	3
Elementary Secondary Education	1
Grade 11	1
Grade 5	1
Grade 7	1
Grade 9	1
High Schools	1
More ▼

Audience

Location

Canada	1
China	1
Germany	1
Hong Kong	1
Netherlands	1
Spain	1
United States	1

Laws, Policies, & Programs

No Child Left Behind Act 2001

Assessments and Surveys

Early Childhood Longitudinal…	1
Self Directed Search	1
Test of English as a Foreign…	1
Trends in International…	1

What Works Clearinghouse Rating

Showing 1 to 15 of 129 results Save | Export

A Note on Statistical Hypothesis Testing: Probabilifying "Modus Tollens" Invalidates Its Force? Not True!

Peer reviewed

Direct link

Widaman, Keith F. – Educational and Psychological Measurement, 2023

The import or force of the result of a statistical test has long been portrayed as consistent with deductive reasoning. The simplest form of deductive argument has a first premise with conditional form, such as p[right arrow]q, which means that "if p is true, then q must be true." Given the first premise, one can either affirm or deny…

Descriptors: Hypothesis Testing, Statistical Analysis, Logical Thinking, Probability

Summary Intervals for Model-Based Classification Accuracy and Consistency Indices

Peer reviewed

Direct link

Gonzalez, Oscar – Educational and Psychological Measurement, 2023

When scores are used to make decisions about respondents, it is of interest to estimate classification accuracy (CA), the probability of making a correct decision, and classification consistency (CC), the probability of making the same decision across two parallel administrations of the measure. Model-based estimates of CA and CC computed from the…

Descriptors: Classification, Accuracy, Intervals, Probability

Testing for Differential Item Functioning under the "D"-Scoring Method

Peer reviewed

Direct link

Dimitrov, Dimiter M.; Atanasov, Dimitar V. – Educational and Psychological Measurement, 2022

This study offers an approach to testing for differential item functioning (DIF) in a recently developed measurement framework, referred to as "D"-scoring method (DSM). Under the proposed approach, called "P-Z" method of testing for DIF, the item response functions of two groups (reference and focal) are compared by…

Descriptors: Test Bias, Methods, Test Items, Scoring

Correcting for Extreme Response Style: Model Choice Matters

Peer reviewed

Direct link

Martijn Schoenmakers; Jesper Tijmstra; Jeroen Vermunt; Maria Bolsinova – Educational and Psychological Measurement, 2024

Extreme response style (ERS), the tendency of participants to select extreme item categories regardless of the item content, has frequently been found to decrease the validity of Likert-type questionnaire results. For this reason, various item response theory (IRT) models have been proposed to model ERS and correct for it. Comparisons of these…

Descriptors: Item Response Theory, Response Style (Tests), Models, Likert Scales

A Comparison of Label Switching Algorithms in the Context of Growth Mixture Models

Peer reviewed

Direct link

Cassiday, Kristina R.; Cho, Youngmi; Harring, Jeffrey R. – Educational and Psychological Measurement, 2021

Simulation studies involving mixture models inevitably aggregate parameter estimates and other output across numerous replications. A primary issue that arises in these methodological investigations is label switching. The current study compares several label switching corrections that are commonly used when dealing with mixture models. A growth…

Descriptors: Probability, Models, Simulation, Mathematics

Incorporating Uncertainty into Parallel Analysis for Choosing the Number of Factors via Bayesian Methods

Peer reviewed

Direct link

Levy, Roy; Xia, Yan; Green, Samuel B. – Educational and Psychological Measurement, 2021

A number of psychometricians have suggested that parallel analysis (PA) tends to yield more accurate results in determining the number of factors in comparison with other statistical methods. Nevertheless, all too often PA can suggest an incorrect number of factors, particularly in statistically unfavorable conditions (e.g., small sample sizes and…

Descriptors: Bayesian Statistics, Statistical Analysis, Factor Structure, Probability

Symptom Presence and Symptom Severity as Unique Indicators of Psychopathology: An Application of Multidimensional Zero-Inflated and Hurdle Graded Response Models

Peer reviewed

Direct link

Magnus, Brooke E.; Liu, Yang – Educational and Psychological Measurement, 2022

Questionnaires inquiring about psychopathology symptoms often produce data with excess zeros or the equivalent (e.g., none, never, and not at all). This type of zero inflation is especially common in nonclinical samples in which many people do not exhibit psychopathology, and if unaccounted for, can result in biased parameter estimates when…

Descriptors: Symptoms (Individual Disorders), Psychopathology, Research Methodology, Probability

Combined Approach to Multi-Informant Data Using Latent Factors and Latent Classes: Trifactor Mixture Model

Peer reviewed

Direct link

Kim, Eunsook; von der Embse, Nathaniel – Educational and Psychological Measurement, 2021

Although collecting data from multiple informants is highly recommended, methods to model the congruence and incongruence between informants are limited. Bauer and colleagues suggested the trifactor model that decomposes the variances into common factor, informant perspective factors, and item-specific factors. This study extends their work to the…

Descriptors: Probability, Models, Statistical Analysis, Congruence (Psychology)

A Simple Model to Determine the Efficient Duration of Exams

Peer reviewed

Direct link

Ellis, Jules L. – Educational and Psychological Measurement, 2021

This study develops a theoretical model for the costs of an exam as a function of its duration. Two kind of costs are distinguished: (1) the costs of measurement errors and (2) the costs of the measurement. Both costs are expressed in time of the student. Based on a classical test theory model, enriched with assumptions on the context, the costs…

Descriptors: Test Length, Models, Error of Measurement, Measurement

Estimating Probabilities of Passing for Examinees with Incomplete Data in Mastery Tests

Peer reviewed

Direct link

Sinharay, Sandip – Educational and Psychological Measurement, 2022

Administrative problems such as computer malfunction and power outage occasionally lead to missing item scores and hence to incomplete data on mastery tests such as the AP and U.S. Medical Licensing examinations. Investigators are often interested in estimating the probabilities of passing of the examinees with incomplete data on mastery tests.…

Descriptors: Mastery Tests, Computer Assisted Testing, Probability, Test Wiseness

On the Unlikely Case of an Error-Free Principal Component from a Set of Fallible Measures

Peer reviewed

Direct link

Raykov, Tenko; Marcoulides, George A.; Li, Tenglong – Educational and Psychological Measurement, 2018

This note extends the results in the 2016 article by Raykov, Marcoulides, and Li to the case of correlated errors in a set of observed measures subjected to principal component analysis. It is shown that when at least two measures are fallible, the probability is zero for any principal component--and in particular for the first principal…

Descriptors: Factor Analysis, Error of Measurement, Correlation, Reliability

Growth Mixture Modeling with Nonnormal Distributions: Implications for Data Transformation

Peer reviewed

Direct link

Nam, Yeji; Hong, Sehee – Educational and Psychological Measurement, 2021

This study investigated the extent to which class-specific parameter estimates are biased by the within-class normality assumption in nonnormal growth mixture modeling (GMM). Monte Carlo simulations for nonnormal GMM were conducted to analyze and compare two strategies for obtaining unbiased parameter estimates: relaxing the within-class normality…

Descriptors: Probability, Models, Statistical Analysis, Statistical Distributions

A Propensity Score Method for Investigating Differential Item Functioning in Performance Assessment

Peer reviewed

Direct link

Chen, Michelle Y.; Liu, Yan; Zumbo, Bruno D. – Educational and Psychological Measurement, 2020

This study introduces a novel differential item functioning (DIF) method based on propensity score matching that tackles two challenges in analyzing performance assessment data, that is, continuous task scores and lack of a reliable internal variable as a proxy for ability or aptitude. The proposed DIF method consists of two main stages. First,…

Descriptors: Probability, Scores, Evaluation Methods, Test Items

Relative Diagnostic Profile: A Subscore Reporting Framework

Peer reviewed

Direct link

Liu, Ren; Qian, Hong; Luo, Xiao; Woo, Ada – Educational and Psychological Measurement, 2018

Subscore reporting under item response theory models has always been a challenge partly because the test length of each subdomain is limited for precisely locating individuals on multiple continua. Diagnostic classification models (DCMs), providing a pass/fail decision and associated probability of pass on each subdomain, are promising…

Descriptors: Classification, Probability, Pass Fail Grading, Scores

A Graphical Method for Displaying the Model Fit of Item Response Theory Trace Lines

Peer reviewed

Direct link

Kalinowski, Steven T. – Educational and Psychological Measurement, 2019

Item response theory (IRT) is a statistical paradigm for developing educational tests and assessing students. IRT, however, currently lacks an established graphical method for examining model fit for the three-parameter logistic model, the most flexible and popular IRT model in educational testing. A method is presented here to do this. The graph,…

Descriptors: Item Response Theory, Educational Assessment, Goodness of Fit, Probability

Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9

Wilcox, Rand R.	7
Algina, James	5
Keselman, H. J.	5
Berry, Kenneth J.	4
Mielke, Paul W., Jr.	4
Penfield, Randall D.	4
Raykov, Tenko	4
Aiken, Lewis R.	3
Ferdous, Abdullah A.	3
Hofmann, Richard J.	3
Marcoulides, George A.	3
Plake, Barbara S.	3
Zimmerman, Donald W.	3
Atanasov, Dimitar V.	2
Dimitrov, Dimiter M.	2
Jones, W. Paul	2
Liu, Ren	2
Marsman, Maarten	2
Wagenmakers, Eric-Jan	2
Wang, Wen-Chung	2
Zumbo, Bruno D.	2
Abad, Francisco J.	1
Aiken, Timothy A.	1
Anderson, Edwin L.	1
More ▼