ERIC - Search Results

Descriptor

Comparative Analysis	12
Simulation	12
Statistical Studies	12
Test Items	6
Research Methodology	5
Item Response Theory	4
Selection	4
Factor Analysis	3
Goodness of Fit	3
Models	3
Multivariate Analysis	3
Sample Size	3
Ability	2
Analysis of Covariance	2
Equated Scores	2
Item Bias	2
Matrices	2
Performance	2
Power (Statistics)	2
Raw Scores	2
Scores	2
Statistical Analysis	2
Statistical Distributions	2
Test Theory	2
Analysis of Variance	1
More ▼

Source

Educational and Psychological…	2
Applied Psychological…	1
Journal of Educational…	1
Multivariate Behavioral…	1

Author

Schumacker, Randall E.	2
Algina, James	1
Beasley, T. Mark	1
Cohen, Allan S.	1
Cope, Ronald T.	1
Kolen, Michael J.	1
Marcoulides, George A.	1
Nandakumar, Ratna	1
Pommerich, Mary	1
Sarvela, Paul D.	1
Sheehan, Janet K.	1
Smith, Richard M.	1
Tang, K. Linda	1
Wang, Tianyou	1
More ▼

Publication Type

Reports - Research	7
Speeches/Meeting Papers	7
Journal Articles	5
Reports - Evaluative	5

Education Level

Audience

Researchers

Location

Laws, Policies, & Programs

Assessments and Surveys

ACT Assessment

What Works Clearinghouse Rating

Showing all 12 results Save | Export

Selecting Weighting Schemes in Multivariate Generalizability Studies.

Peer reviewed

Marcoulides, George A. – Educational and Psychological Measurement, 1994

Effects of different weighting schemes on selecting the optimal number of observations in multivariate-multifacet generalizability designs are studied when cost constraints are imposed. Comparison of four schemes through simulation indicates that all four produce similar optimal values and that reliability should be similar. (SLD)

Descriptors: Budgeting, Comparative Analysis, Costs, Factor Analysis

A Comparison of the Power of Rasch Total and Between-Item Fit Statistics to Detect Measurement Disturbances.

Peer reviewed

Smith, Richard M. – Educational and Psychological Measurement, 1994

Rasch model total-fit statistics and between-item fit statistics were compared for their ability to detect measurement disturbances through the use of simulated data. Results indicate that the between-fit statistic appears more sensitive to systematic measurement disturbances and the total-fit statistic is more sensitive to random measurement…

Descriptors: Comparative Analysis, Goodness of Fit, Item Response Theory, Measurement Techniques

Detection of Differential Item Functioning in the Graded Response Model.

Peer reviewed

Cohen, Allan S.; And Others – Applied Psychological Measurement, 1993

Three measures of differential item functioning for the dichotomous response model are extended to include Samejima's graded response model. Two are based on area differences between item true score functions, and one is a chi-square statistic for comparing differences in item parameters. (SLD)

Descriptors: Chi Square, Comparative Analysis, Identification, Item Bias

The Performance of the Mantel-Haenszel DIF Statistic When Comparison Group Distributions Are Incongruent.

Download full text

Pommerich, Mary; And Others – 1994

The functioning of two population-based Mantel-Haenszel (MH) common-odds ratios was compared. One ratio is conditioned on the observed test score, while the other is conditioned on a latent trait or true ability score. When the comparison group distributions are incongruent or nonoverlapping to some degree, the observed score represents different…

Descriptors: Ability, Comparative Analysis, Item Bias, Performance

Assessing Dimensionality of a Set of Item Responses--Comparison of Different Approaches.

Peer reviewed

Nandakumar, Ratna – Journal of Educational Measurement, 1994

Using simulated and real data, this study compares the performance of three methodologies for assessing unidimensionality: (1) DIMTEST; (2) the approach of Holland and Rosenbaum; and (3) nonlinear factor analysis. All three models correctly confirm unidimensionality, but they differ in their ability to detect the lack of unidimensionality.…

Descriptors: Ability, Comparative Analysis, Evaluation Methods, Factor Analysis

Choosing a MANOVA Test Statistic When Covariances Are Unequal.

Download full text

Beasley, T. Mark; Sheehan, Janet K. – 1994

C. L. Olson (1976, 1979) suggests the Pillai-Bartlett trace (V) as an omnibus multivariate analysis of variance (MANOVA) test statistic for its superior robustness to heterogeneous variances. J. Stevens (1979, 1980) contends that the robustness of V, Wilk's lambda (W) and the Hotelling-Lawley trace (T) are similar, and that their power functions…

Descriptors: Analysis of Covariance, Comparative Analysis, Matrices, Monte Carlo Methods

Examining Replication Effects in Rasch Fit Statistics.

Download full text

Schumacker, Randall E.; And Others – 1994

Rasch between and total weighted and unweighted fit statistics were compared using varying test lengths and sample sizes. Two test lengths (20 and 50 items) and three sample sizes (150, 500, and 1,000 were crossed. Each of the six combinations were replicated 100 times. In addition, power comparisons were made. Results indicated that there were no…

Descriptors: Comparative Analysis, Goodness of Fit, Item Response Theory, Power (Statistics)

A Comparison of the Mallows' C subscript p and Principal Component Criteria for Best Model Selection in Multiple Regression.

PDF pending restoration

Schumacker, Randall E. – 1994

A population data set was randomly generated from which a random sample was drawn. This sample was randomly divided into two data sets, one of which was used to generate parameter estimates, which were then used in the second data set for cross-validation purposes. The best variable subset models were compared between the two data sets on the…

Descriptors: Comparative Analysis, Criteria, Estimation (Mathematics), Factor Analysis

A Quadratic Curve Equating Method To Equate the First Three Moments in Equipercentile Equating.

Download full text

Wang, Tianyou; Kolen, Michael J. – 1994

In this paper a quadratic curve equating method for different test forms under a random-group data-collection design is proposed. Procedures for implementing this method and related issues are described and discussed. The quadratic-curve method was evaluated with real test data (from two 30-item subtests for a professional licensure examination…

Descriptors: Comparative Analysis, Data Collection, Equated Scores, Goodness of Fit

How Well Do the Angoff Design V Linear Equating Methods Stack Up against the Tucker and Levine Methods?

Cope, Ronald T. – 1986

Comparisons were made of three Angoff Design V linear equating methods (two forms equated to a common test, two forms predicted by a common test, or two forms used to predict a common test) and Tucker's and R. Levine's linear methods, under common item linear equating with non-equivalent populations. Forms of a professional certification test…

Descriptors: Certification, Comparative Analysis, Equated Scores, Higher Education

Performance of Four Multivariate Tests under Variance-Covariance Heteroscedasticity.

Peer reviewed

Tang, K. Linda; Algina, James – Multivariate Behavioral Research, 1993

Type I error rates of four multivariate tests (Pilai-Bartlett trace, Johansen's test, James' first-order test, and James' second-order test) were compared for heterogeneous covariance matrices in 360 simulated experiments. The superior performance of Johansen's test and James' second-order test is discussed. (SLD)

Descriptors: Analysis of Covariance, Analysis of Variance, Comparative Analysis, Equations (Mathematics)

Discrimination Indices Commonly Used in Military Training Environments: Effects of Departures from Normal Distributions.

Download full text

Sarvela, Paul D. – 1986

Four discrimination indices were compared, using score distributions which were normal, bimodal, and negatively skewed. The score distributions were systematically varied to represent the common circumstances of a military training situation using criterion-referenced mastery tests. Three 20-item tests were administered to 110 simulated subjects.…

Descriptors: Comparative Analysis, Criterion Referenced Tests, Item Analysis, Mastery Tests