ERIC - Search Results

Descriptor

Test Reliability	28
Test Validity	23
Test Use	22
Test Reviews	14
Test Construction	11
Adults	9
Reliability	9
Scoring	9
Elementary Secondary Education	8
Test Content	8
Adolescents	7
Higher Education	6
Measurement Techniques	6
Scores	6
Children	5
Psychometrics	5
Test Norms	5
Career Counseling	4
Comparative Analysis	4
Foreign Countries	4
Secondary Education	4
Test Theory	4
Achievement Gains	3
Aptitude Tests	3
Career Choice	3
More ▼

Source

Applied Psychological…	3
Educational and Psychological…	3
Journal of Reading	3
Psychological Test Bulletin	3
Journal of Educational…	2
Journal of Experimental…	2
Database	1
Education Policy Analysis…	1
Evaluation Practice	1
Gifted Child Quarterly	1
Journal of Educational…	1
Reading Teacher	1
Roeper Review	1
More ▼

Publication Type

Book/Product Reviews	38
Journal Articles	23
Reports - Evaluative	12
Speeches/Meeting Papers	10
Reports - Descriptive	2
Reference Materials -…	1
Reports - Research	1

Education Level

Audience

Practitioners

Location

Australia	3
Massachusetts	1
United Kingdom (Great Britain)	1
United States	1

Laws, Policies, & Programs

Assessments and Surveys

Boehm Test of Basic Concepts	1
Computer Attitude Scale	1
Differential Aptitude Test	1
Sixteen Personality Factor…	1
Stanford Binet Intelligence…	1
Strong Interest Inventory	1
Values Scale	1
Wechsler Preschool and…	1
Woodcock Reading Mastery Test	1

What Works Clearinghouse Rating

Book/Product Reviews X

Showing 1 to 15 of 38 results Save | Export

Written Language Assessment (Test Review).

Peer reviewed

Spaulding, Cheryl L. – Journal of Reading, 1989

Reviews "Written Language Assessment" (WLA), a new standardized test to evaluate children's and adolescents' written language competence by having students write essays instead of answer multiple choice questions. Finds problems with the WLA in terms of interrater reliability. (RS)

Descriptors: Elementary Secondary Education, Essay Tests, Interrater Reliability, Standardized Tests

Reliability: Rejoinder to Thompson and Vacha-Haase.

Peer reviewed

Sawilowsky, Shlomo S. – Educational and Psychological Measurement, 2000

B. Thompson and T. Vacha-Haase have examined the statement "the reliability of the test" with emphasis on the following three words: (1) the first "the"; (2) "test"; and (3) the second "the." This discussion focuses instead on the word "reliability." (Author)

Descriptors: Generalization, Meta Analysis, Psychometrics, Reliability

Is Reliability Obsolete? A Commentary on "Are Simple Gain Scores Obsolete?"

Peer reviewed

Collins, Linda M. – Applied Psychological Measurement, 1996

The clarification provided by Williams and Zimmerman on the reliability of gain scores is translated into recognizable patterns of change that tend to produce reliable or unreliable gain scores. The relevance of the traditional idea of reliability to the measurement of change is also discussed. (SLD)

Descriptors: Achievement Gains, Change, Measurement Techniques, Reliability

Measurement Error, Multidimensionality, and Scale Shrinkage: A Reply to Yen and Burket.

Peer reviewed

Camilli, Gregory – Journal of Educational Measurement, 1999

Yen and Burket suggested that shrinkage in vertical equating cannot be understood apart from multidimensionality. Reviews research on reliability, multidimensionality, and scale shrinkage, and explores issues of practical importance to educators. (SLD)

Descriptors: Equated Scores, Error of Measurement, Item Response Theory, Reliability

Burns/Roe Informal Reading Inventory (Test Review).

Peer reviewed

Arno, Kevin S. – Journal of Reading, 1990

Notes that the third edition of the Burns/Roe Informal Reading Inventory takes less time to administer. Reports an absence of data on the inventory's reliability. Concludes that if used to study, evaluate, or diagnose reading behaviors, the Burns and Roe IRI could be a popular and valuable tool. (RS)

Descriptors: Elementary Secondary Education, Informal Reading Inventories, Reading Diagnosis, Test Reliability

Analytic Reading Inventory (ARI) (Fourth Edition) (Test Review).

Peer reviewed

Martin-Rehrmann, James – Journal of Reading, 1990

Reviews the fourth edition of the Analytic Reading Inventory (ARI). Notes the addition of suggestions for diagnostic interpretation and teacher interpretations. Finds the ARI to be a convenient yet reliable diagnostic tool. (RS)

Descriptors: Elementary Secondary Education, Informal Reading Inventories, Reading Diagnosis, Test Reliability

Significance, Effect Sizes, Stepwise Methods, and Other Issues: Strong Arguments Move the Field.

Peer reviewed

Thompson, Bruce – Journal of Experimental Education, 2001

Asserts that editors should declare their expectations publicly and expose the rationale for editorial policies to public scrutiny. Supports effect size reporting and the reporting of score reliabilities. Argues against stepwise methods. Also discusses the interpretation of structure coefficients and the use of confidence intervals. (SLD)

Descriptors: Editing, Effect Size, Reliability, Research Methodology

Strong Arguments: Rejoinder to Thompson.

Peer reviewed

Knapp, Thomas R.; Sawilowsky, Shlomo S. – Journal of Experimental Education, 2001

Replies to Bruce Thompson's positions on research methodology and editorial policy, addressing each of these issues in the ongoing discussion: (1) structure coefficients; (2) stepwise regression; (3) test reliability; (4) effect sizes; and (5) meta-analysis. (SLD)

Descriptors: Editing, Effect Size, Reliability, Research Methodology

Agreement Measure Comparisons between Two Independent Sets of Raters.

Peer reviewed

Berry, Kenneth J.; Mielke, Paul W., Jr. – Educational and Psychological Measurement, 1997

Describes a FORTRAN software program that calculates the probability of an observed difference between agreement measures obtained from two independent sets of raters. An example illustrates the use of the DIFFER program in evaluating undergraduate essays. (Author/SLD)

Descriptors: Comparative Analysis, Computer Software, Evaluation Methods, Higher Education

Linear Dependence on Gain Scores in Their Components Imposes Constraints on Their Use and Interpretation: Comment on "Are Simple Gain Scores Obsolete?"

Peer reviewed

Humphreys, Lloyd G. – Applied Psychological Measurement, 1996

The reliability of a gain is determined by the reliabilities of the components, the correlation between them, and their standard deviations. Reliability is not inherently low, but the components of gains in many investigations make low reliability likely and require caution in the use of gain scores. (SLD)

Descriptors: Achievement Gains, Change, Correlation, Error of Measurement

Commentary on the Commentaries of Collins and Humphreys.

Peer reviewed

Williams, Richard H.; Zimmerman, Donald W. – Applied Psychological Measurement, 1996

The critiques by L. Collins and L. Humphreys in this issue illustrate problems with the use of gain scores. Collins' examples show that familiar formulas for the reliability of differences do not reflect the precision of measures of change. Additional examples demonstrate flaws in the conventional approach to reliability. (SLD)

Descriptors: Achievement Gains, Change, Correlation, Error of Measurement

Test Pac: A Program for Comprehensive Item and Reliability Analysis.

Peer reviewed

Luecht, Richard M. – Educational and Psychological Measurement, 1987

Test Pac, a test scoring and analysis computer program for moderate-sized sample designs using dichotomous response items, performs comprehensive item analyses and multiple reliability estimates. It also performs single-facet generalizability analysis of variance, single-parameter item response theory analyses, test score reporting, and computer…

Descriptors: Computer Assisted Testing, Computer Software, Computer Software Reviews, Item Analysis

Ability Explorer: A Review and Critique.

Download full text

Hoffman, Anne – 1997

The Ability Explorer (AE) is a newly developed self-report inventory of abilities that is appropriate for group or individual administration. There are machine-scorable and hand-scorable versions of the test, and there are two levels. Level 1 is for students from junior high to high school, and Level 2 is for high school students and adults.…

Descriptors: Ability, Adolescents, Adults, Aptitude Tests

Test Review: Standardized Reading Inventory (SRI).

Peer reviewed

Mathewson, Grover C. – Reading Teacher, 1988

Concludes that the instrument reviewed is a carefully designed test incorporating a new interpretation of standardization and improved definitions of traditional reading levels. (FL)

Descriptors: Elementary Education, Reading Ability, Reading Instruction, Reading Tests

Assessment of Leadership in Children, Youth and Adults.

Peer reviewed

Oakland, Thomas; And Others – Gifted Child Quarterly, 1996

Eleven leadership measures for children, youth, and adults are reviewed in the context of current leadership theories and psychometric standards for test use. Measures for assessing leadership among children are considered inadequately normed and lacking in reliability and validity data, but leadership measures for adults are seen as more…

Descriptors: Adolescents, Adults, Children, Gifted

Previous Page | Next Page »

Pages: 1 | 2 | 3

Sawilowsky, Shlomo S.	2
Alderson, J. Charles, Ed.	1
Arno, Kevin S.	1
Baker, Carl E.	1
Bartlett, Jane Finegan	1
Beaubien, Denise M.	1
Berry, Kenneth J.	1
Boland, Lyn	1
Camilli, Gregory	1
Collins, Linda M.	1
Cook, Allison A.	1
Dolenz, Beverly	1
Ercikan, Kadriye	1
Fitzpatrick, Anne R.	1
Hoffman, Anne	1
Humphreys, Lloyd G.	1
Ito, Kyoko	1
Knapp, Thomas R.	1
Leyva, Collette	1
Luecht, Richard M.	1
Martin-Rehrmann, James	1
Mathewson, Grover C.	1
Merschman, Jane	1
Mielke, Paul W., Jr.	1
More ▼