ERIC - Search Results

Source

Applied Measurement in…

Author

Cohen, Allan S.	2
Bandalos, Deborah L.	1
Bergstrom, Betty A.	1
Crooks, Terence J.	1
Enders, Craig K.	1
Engelhard, George, Jr.	1
Ferrara, Steven	1
Fitzpatrick, Anne R.	1
Kane, Michael T.	1
Kim, Seock-Ho	1
Lane, Suzanne	1
Linn, Robert L.	1
Yen, Wendy M.	1
More ▼

Publication Type

Journal Articles	9
Speeches/Meeting Papers	9
Reports - Evaluative	6
Reports - Research	2
Reports - Descriptive	1

Education Level

Audience

Location

Laws, Policies, & Programs

Assessments and Surveys

What Works Clearinghouse Rating

Showing all 9 results Save | Export

The Effects of Nonnormality and Number of Response Categories on Reliability.

Peer reviewed

Bandalos, Deborah L.; Enders, Craig K. – Applied Measurement in Education, 1996

Computer simulation indicated that reliability increased with the degree of similarity between underlying and observed distributions when the observed categorical distribution was deliberately constructed to match the shape of the underlying distribution of the trait being measured. Reliability also increased with correlation among variables and…

Descriptors: Computer Simulation, Correlation, Likert Scales, Reliability

Contextual Characteristics of Locally Dependent Open-Ended Item Clusters in a Large-Scale Performance Assessment.

Peer reviewed

Ferrara, Steven; And Others – Applied Measurement in Education, 1997

Causes of local item dependence in a large-scale performance assessment were studied using data from the Maryland School Performance Assessment Program. Contextual characteristics (content and response requirements) were identified to differentiate locally independent and dependent item clusters. Hypothesized explanations are offered for high…

Descriptors: Context Effect, Performance Based Assessment, Responses, Test Content

The Effects of Test Length and Sample Size on the Reliability and Equating of Tests Composed of Constructed-Response Items.

Peer reviewed

Fitzpatrick, Anne R.; Yen, Wendy M. – Applied Measurement in Education, 2001

Examined the effects of test length and sample size on the alternate forms reliability and equating of simulated mathematics tests composed of constructed response items scaled using the two-parameter partial credit model. Results suggest that, to obtain acceptable reliabilities and accurate equated scores, tests should have at least 8 6-point…

Descriptors: Constructed Response, Equated Scores, Mathematics Tests, Reliability

Examination of the Assumptions and Properties of the Graded Item Response Model: An Example Using a Mathematics Performance Assessment.

Peer reviewed

Lane, Suzanne; And Others – Applied Measurement in Education, 1995

Over 5,000 students participated in a study of the dimensionality and stability of the item parameter estimates of a mathematics performance assessment developed for the Quantitative Understanding: Amplifying Student Achievement and Reasoning (QUASAR) Project. Results demonstrate the test's dimensionality and illustrate ways to examine use of the…

Descriptors: College Students, Estimation (Mathematics), Higher Education, Item Response Theory

Has Item Response Theory Increased the Validity of Achievement Test Scores?

Peer reviewed

Linn, Robert L. – Applied Measurement in Education, 1990

The contribution of item response theory to the validity of interpretations of achievement test results is reviewed in the context of four applications. The applications include construction of scales for achievement tests, test construction, development of customized tests, and investigation of the influence of instruction on achievement tests.…

Descriptors: Achievement Tests, Elementary Secondary Education, Instructional Effectiveness, Item Response Theory

A Comparison of Lord's Chi-Square, Raju's Area Measures, and the Likelihood Ratio Test on Detection of Differential Item Functioning.

Peer reviewed

Kim, Seock-Ho; Cohen, Allan S. – Applied Measurement in Education, 1995

Three procedures for the detection of differential item functioning under item response theory were compared. Data for 2 forms of a mathematics test taken by 1,490 college students were analyzed through F. M. Lord's chi-square, N. S. Raju's area measures, and the likelihood ratio test. (SLD)

Descriptors: Chi Square, College Students, Comparative Analysis, Higher Education

A Generalized Examinee-Centered Method for Setting Standards on Achievement Tests.

Peer reviewed

Cohen, Allan S.; Kane, Michael T.; Crooks, Terence J. – Applied Measurement in Education, 1999

Describes examinee-centered method for setting multiple cutscores on a test involving both objective and extended-response items. Judges evaluate a representative sample of examinee performance using a rating scale that is defined in terms of performance standards, and these ratings are linked to examinee's test scores to generate a functional…

Descriptors: Academic Standards, Achievement Tests, Constructed Response, Cutting Scores

Altering the Level of Difficulty in Computer Adaptive Testing.

Peer reviewed

Bergstrom, Betty A.; And Others – Applied Measurement in Education, 1992

Effects of altering test difficulty on examinee ability measures and test length in a computer adaptive test were studied for 225 medical technology students in 3 test difficulty conditions. Results suggest that, with an item pool of sufficient depth and breadth, acceptable targeting to test difficulty is possible. (SLD)

Descriptors: Ability, Adaptive Testing, Change, College Students

The Measurement of Writing Ability with a Many-Faceted Rasch Model.

Peer reviewed

Engelhard, George, Jr. – Applied Measurement in Education, 1992

A Many-Faceted Rasch Model (FACETS) for measurement of writing ability is described, and its use in solving measurement problems in large-scale assessment is illustrated with a random sample of 1,000 students from Georgia's Eighth Grade Writing Test. It is a promising approach to assessment through written compositions. (SLD)

Descriptors: Educational Assessment, Essays, Evaluation Problems, Grade 8

Item Response Theory	5
Test Items	4
College Students	3
Higher Education	3
Achievement Tests	2
Constructed Response	2
Elementary Secondary Education	2
Mathematics Tests	2
Performance Based Assessment	2
Reliability	2
Responses	2
Test Construction	2
Test Length	2
Ability	1
Academic Standards	1
Adaptive Testing	1
Change	1
Chi Square	1
Comparative Analysis	1
Computer Assisted Testing	1
Computer Simulation	1
Context Effect	1
Correlation	1
Cutting Scores	1
Difficulty Level	1
More ▼