ERIC - Search Results

Publication Date

In 2026	0
Since 2025	0
Since 2022 (last 5 years)	0
Since 2017 (last 10 years)	1
Since 2007 (last 20 years)	3

Descriptor

Pretests Posttests	10
Scoring Formulas	10
Scores	4
Test Reliability	4
Achievement Gains	2
Achievement Tests	2
Art Appreciation	2
College Students	2
Error of Measurement	2
Evaluation Methods	2
Foreign Countries	2
Guessing (Tests)	2
Pretesting	2
Research Methodology	2
Research Problems	2
Test Construction	2
Test Interpretation	2
Testing Problems	2
True Scores	2
Academic Achievement	1
Accuracy	1
Algorithms	1
Alternative Assessment	1
Analysis of Variance	1
Attitude Change	1
More ▼

Source

International Education…	1
Language Assessment Quarterly	1
Measurement and Evaluation in…	1
Measurement and Evaluation in…	1
Perceptual and Motor Skills	1

Author

Anderson, Frances E.	1
Bardhoshi, Gerta	1
Bledsoe, Joseph	1
Bliss, Leonard B.	1
Erford, Bradley T.	1
Holster, Trevor A.	1
Hubert, John A.	1
Knapp, Thomas R.	1
Lake, J.	1
Richards, James M., Jr.	1
Stallings, William M.	1
Teemuangsai, Sanit	1
Tiantong, Monchai	1
Yap, Kim Onn	1
More ▼

Publication Type

Reports - Research	7
Journal Articles	5
Information Analyses	2
Speeches/Meeting Papers	2
Reports - Descriptive	1

Education Level

Higher Education	2
Postsecondary Education	2

Audience

Location

Japan	1
Thailand	1
Virgin Islands	1

Laws, Policies, & Programs

Elementary and Secondary…

Assessments and Surveys

Iowa Tests of Basic Skills

What Works Clearinghouse Rating

Showing all 10 results Save | Export

Processes and Procedures for Estimating Score Reliability and Precision

Peer reviewed

Direct link

Bardhoshi, Gerta; Erford, Bradley T. – Measurement and Evaluation in Counseling and Development, 2017

Precision is a key facet of test development, with score reliability determined primarily according to the types of error one wants to approximate and demonstrate. This article identifies and discusses several primary forms of reliability estimation: internal consistency (i.e., split-half, KR-20, a), test-retest, alternate forms, interscorer, and…

Descriptors: Scores, Test Reliability, Accuracy, Pretests Posttests

Guessing and the Rasch Model

Peer reviewed

Direct link

Holster, Trevor A.; Lake, J. – Language Assessment Quarterly, 2016

Stewart questioned Beglar's use of Rasch analysis of the Vocabulary Size Test (VST) and advocated the use of 3-parameter logistic item response theory (3PLIRT) on the basis that it models a non-zero lower asymptote for items, often called a "guessing" parameter. In support of this theory, Stewart presented fit statistics derived from…

Descriptors: Guessing (Tests), Item Response Theory, Vocabulary, Language Tests

Student Team Achievement Divisions (STAD) Technique through the Moodle to Enhance Learning Achievement

Peer reviewed
PDF on ERIC

Download full text

Tiantong, Monchai; Teemuangsai, Sanit – International Education Studies, 2013

One of the benefits of using collaborative learning is enhancing learning achievement and increasing social skills, and the second benefits is as the more students work together in collaborative groups, the more they understand, retain, and feel better about themselves and their peers, moreover working together in a collaborative environment…

Descriptors: Foreign Countries, Cooperative Learning, Teamwork, Integrated Learning Systems

Comparison of Selected Methods of Measuring Change in Judgment of Art Design.

Peer reviewed

Bledsoe, Joseph; And Others – Perceptual and Motor Skills, 1980

Elementary teacher candidates were pretested and posttested with the Graves Design Judgment Test. Of five approaches to analyzing change, only one, a transformation of posttest divided by pretest expressed in percentage, yielded significance. The hypothesis that a sculpture workshop and field experience would result in greater gains was not…

Descriptors: Art Appreciation, Attitude Change, Design Preferences, Field Experience Programs

The (Un)reliability of Change Scores in Counseling Research.

Knapp, Thomas R. – Measurement and Evaluation in Guidance, 1980

Supports arguments against general use of change scores and recommends the Lord/McNemar estimates of true change. Provides a numerical example illustrating the reliability problem and the problem of the prediction of true change from various linear composites of initial and final measures. (Author)

Descriptors: Counseling Techniques, Literature Reviews, Pretests Posttests, Research Methodology

An Empirical Test of a Strategy for Training Examinees in the Use of Partial Information in Taking Multiple Choice Tests.

Download full text

Bliss, Leonard B. – 1981

The aim of this study was to show that the superiority of corrected-for-guessing scores over number right scores as true score estimates depends on the ability of examinees to recognize situations where they can eliminate one or more alternatives as incorrect and to omit items where they would only be guessing randomly. Previous investigations…

Descriptors: Algorithms, Guessing (Tests), Intermediate Grades, Multiple Choice Tests

Can Selection Tests Be Used As Pretest?

Download full text

Yap, Kim Onn – 1978

A simulation study was designed to assess the severity of regression effects when a set of selection scores is also used as pretest scores as this pertains to RMC Model A of the Elementary and Secondary Education Act Title I evaluation and reporting system. Data sets were created with various characteristics (varying data reliability and…

Descriptors: Achievement Gains, Analysis of Variance, Elementary Secondary Education, Low Achievement

A Simulation Study Comparing Procedures for Assessing Individual Educational Growth. Report No. 182.

Download full text

Richards, James M., Jr. – 1974

A computer simulation procedure was developed to reproduce the overall pattern of results obtained in the Educational Testing Service Growth Study. Then simulated data for seven sets of 10,000 to 15,000 cases were analyzed, and findings compared on the basis of correlations between estimated and true growth scores. Findings showed that growth was…

Descriptors: Computers, Educational Assessment, Educational Research, Educational Testing

A Note on Some Characteristics and Correlates of the Meier Art Test of Aesthetic Perception.

Download full text

Stallings, William M.; Anderson, Frances E. – 1968

The reliability and the predictive and concurrent validity of the MATAP were investigated with the implicit goal of improving the prediction of course grades in the College of Fine and Applied Arts. It was found that reliability and validity coefficients were low, and it was suggested that the scoring system was a source of error variance. (MS)

Descriptors: Art Appreciation, Biographical Inventories, College Students, Correlation

A Review of the ESEA Title I Evaluation and Reporting System.

Hubert, John A. – 1978

The Elementary and Secondary Education Act Title I Evaluation and Reporting System is a method for giving a federally funded project in reading or math an overall score on its cognitive effectiveness. This System introduced the Normal Curve Equivalent (NCE) as an aid in aggregating Title I program scores across states and nationwide regardless of…

Descriptors: Achievement Gains, Achievement Tests, Compensatory Education, Control Groups