NotesFAQContact Us
Collection
Advanced
Search Tips
Showing all 10 results Save | Export
Peer reviewed Peer reviewed
Direct linkDirect link
Bardhoshi, Gerta; Erford, Bradley T. – Measurement and Evaluation in Counseling and Development, 2017
Precision is a key facet of test development, with score reliability determined primarily according to the types of error one wants to approximate and demonstrate. This article identifies and discusses several primary forms of reliability estimation: internal consistency (i.e., split-half, KR-20, a), test-retest, alternate forms, interscorer, and…
Descriptors: Scores, Test Reliability, Accuracy, Pretests Posttests
Peer reviewed Peer reviewed
Direct linkDirect link
Holster, Trevor A.; Lake, J. – Language Assessment Quarterly, 2016
Stewart questioned Beglar's use of Rasch analysis of the Vocabulary Size Test (VST) and advocated the use of 3-parameter logistic item response theory (3PLIRT) on the basis that it models a non-zero lower asymptote for items, often called a "guessing" parameter. In support of this theory, Stewart presented fit statistics derived from…
Descriptors: Guessing (Tests), Item Response Theory, Vocabulary, Language Tests
Peer reviewed Peer reviewed
PDF on ERIC Download full text
Tiantong, Monchai; Teemuangsai, Sanit – International Education Studies, 2013
One of the benefits of using collaborative learning is enhancing learning achievement and increasing social skills, and the second benefits is as the more students work together in collaborative groups, the more they understand, retain, and feel better about themselves and their peers, moreover working together in a collaborative environment…
Descriptors: Foreign Countries, Cooperative Learning, Teamwork, Integrated Learning Systems
Peer reviewed Peer reviewed
Bledsoe, Joseph; And Others – Perceptual and Motor Skills, 1980
Elementary teacher candidates were pretested and posttested with the Graves Design Judgment Test. Of five approaches to analyzing change, only one, a transformation of posttest divided by pretest expressed in percentage, yielded significance. The hypothesis that a sculpture workshop and field experience would result in greater gains was not…
Descriptors: Art Appreciation, Attitude Change, Design Preferences, Field Experience Programs
Knapp, Thomas R. – Measurement and Evaluation in Guidance, 1980
Supports arguments against general use of change scores and recommends the Lord/McNemar estimates of true change. Provides a numerical example illustrating the reliability problem and the problem of the prediction of true change from various linear composites of initial and final measures. (Author)
Descriptors: Counseling Techniques, Literature Reviews, Pretests Posttests, Research Methodology
Bliss, Leonard B. – 1981
The aim of this study was to show that the superiority of corrected-for-guessing scores over number right scores as true score estimates depends on the ability of examinees to recognize situations where they can eliminate one or more alternatives as incorrect and to omit items where they would only be guessing randomly. Previous investigations…
Descriptors: Algorithms, Guessing (Tests), Intermediate Grades, Multiple Choice Tests
Yap, Kim Onn – 1978
A simulation study was designed to assess the severity of regression effects when a set of selection scores is also used as pretest scores as this pertains to RMC Model A of the Elementary and Secondary Education Act Title I evaluation and reporting system. Data sets were created with various characteristics (varying data reliability and…
Descriptors: Achievement Gains, Analysis of Variance, Elementary Secondary Education, Low Achievement
Richards, James M., Jr. – 1974
A computer simulation procedure was developed to reproduce the overall pattern of results obtained in the Educational Testing Service Growth Study. Then simulated data for seven sets of 10,000 to 15,000 cases were analyzed, and findings compared on the basis of correlations between estimated and true growth scores. Findings showed that growth was…
Descriptors: Computers, Educational Assessment, Educational Research, Educational Testing
Stallings, William M.; Anderson, Frances E. – 1968
The reliability and the predictive and concurrent validity of the MATAP were investigated with the implicit goal of improving the prediction of course grades in the College of Fine and Applied Arts. It was found that reliability and validity coefficients were low, and it was suggested that the scoring system was a source of error variance. (MS)
Descriptors: Art Appreciation, Biographical Inventories, College Students, Correlation
Hubert, John A. – 1978
The Elementary and Secondary Education Act Title I Evaluation and Reporting System is a method for giving a federally funded project in reading or math an overall score on its cognitive effectiveness. This System introduced the Normal Curve Equivalent (NCE) as an aid in aggregating Title I program scores across states and nationwide regardless of…
Descriptors: Achievement Gains, Achievement Tests, Compensatory Education, Control Groups