NotesFAQContact Us
Collection
Advanced
Search Tips
Publication Date
In 20260
Since 20250
Since 2022 (last 5 years)0
Since 2017 (last 10 years)0
Since 2007 (last 20 years)3
Laws, Policies, & Programs
What Works Clearinghouse Rating
Showing 1 to 15 of 47 results Save | Export
Peer reviewed Peer reviewed
Direct linkDirect link
Beglar, David – Language Testing, 2010
The primary purpose of this study was to provide preliminary validity evidence for a 140-item form of the Vocabulary Size Test, which is designed to measure written receptive knowledge of the first 14,000 words of English. Nineteen native speakers of English and 178 native speakers of Japanese participated in the study. Analyses based on the Rasch…
Descriptors: Test Items, Native Speakers, Test Validity, Vocabulary
Peer reviewed Peer reviewed
Direct linkDirect link
Porter, Andrew C.; Polikoff, Morgan S.; Goldring, Ellen; Murphy, Joseph; Elliott, Stephen N.; May, Henry – Educational Administration Quarterly, 2010
Research has consistently shown that principal leadership matters for successful schools. Evaluating principals on the behaviors shown to improve student learning should be an important leverage point for raising leadership quality. Yet principals are often evaluated with the use of instruments with no theoretical background and little, if any,…
Descriptors: Psychometrics, Instructional Leadership, Principals, Test Construction
Vance, Booney; Sabatino, David – Diagnostique, 1991
The issues of construct validity, predictive validity, and item content bias on the Wechsler Intelligence Scale for Children-Revised (WISC-R) are examined. The review concludes that most objective data have not supported the issue of bias of the WISC-R when used with children of different ethnic backgrounds. (JDD)
Descriptors: Construct Validity, Content Validity, Elementary Secondary Education, Ethnic Groups
Peer reviewed Peer reviewed
Direct linkDirect link
Seol, Hyunsoo – Measurement and Evaluation in Counseling and Development, 2007
The author used Rasch measurement to examine the reliability and validity of 382 Korean university students' scores on the Marlowe-Crowne Social Desirability Scale (MCSDS; D. P. Crowne and D. Marlowe, 1960). Results revealed that item-fit statistics and principal component analysis with standardized residuals provide evidence of MCSDS'…
Descriptors: Social Desirability, Validity, Measures (Individuals), Factor Analysis
Brennan, Mervin M.; Redding, Kenneth R. – 1985
A Teacher Survey was developed for administration to Illinois teachers whose students take the Illinois Inventory of Educational Progress (IIEP). The IIEP are the achievement tests of the state assessment program. The purposes of the Teacher Survey range from determination of a test's curricular validity to investigations of teachers' abilities to…
Descriptors: Academic Achievement, Data Analysis, Elementary Secondary Education, Predictive Validity
Solomon, Alan – 1987
A panel of expert referees from the Philadelphia school district categorized items from secondary-level standardized mathematics tests according to National Assessment of Educational Progress (NAEP) subobjectives for mathematics. The following tests were covered by the study: (1) California Achievement Tests (Levels 19 and 20); (2) Comprehensive…
Descriptors: Content Validity, Educational Objectives, High Schools, Mathematical Concepts
Holden, Ronald R. – 1985
Modern test construction strategies in the areas of personality and psychopathology differ in the use of disguise within test stimulus material. Previous research on the validity of using disguised test item content has favored the rational strategy of test construction which views disguise as a liability under normal test-taking circumstances.…
Descriptors: Adults, Evaluation Methods, Psychopathology, Test Construction
Lin, Miao-Hsiang – 1986
Specific questions addressed in this study include how time limits affect a test's construct and predictive validities, how time limits affect an examinee's time allocation and test performance, and whether the assumption about how examinees answer items is valid. Interactions involving an examinee's sex and age are studied. Two parallel forms of…
Descriptors: Age Differences, Computer Assisted Testing, Construct Validity, Difficulty Level
Holland, Paul W.; Thayer, Dorothy T. – 1985
An alternative definition has been developed of the delta scale of item difficulty used at Educational Testing Service. The traditional delta scale uses an inverse normal transformation based on normal ogive models developed years ago. However, no use is made of this fact in typical uses of item deltas. It is simply one way to make the probability…
Descriptors: Difficulty Level, Error Patterns, Estimation (Mathematics), Item Analysis
Peer reviewed Peer reviewed
Antonak, Richard F.; Harth, Robert – Mental Retardation, 1994
Psychometric analyses of data from 230 individuals yielded a 29-item 4-scale revision of the original 50-item 5-scale Mental Retardation Attitude Inventory. Results showed adequate item characteristics; adequate reliability and homogeneity; adequate reliability, homogeneity, specificity, and independence of the four scales; and initial validity…
Descriptors: Attitude Measures, Attitudes toward Disabilities, Mental Retardation, Psychometrics
Peer reviewed Peer reviewed
Brambring, M.; Troster, H. – Journal of Visual Impairment and Blindness, 1994
This study evaluated the Bielefeld Developmental Test for Blind Infants and Preschoolers by comparing cognitive performance of blind and sighted children (ages three and four). Results indicated that even this test (with "blind-neutral" items) did not permit a fair comparative assessment, though it did prove suitable for within-group…
Descriptors: Blindness, Cognitive Development, Cognitive Tests, Infants
Furst, Edward J. – 1983
Enough evidence has accumulated on Bloom's "Taxonomy of Educational Objectives" for the cognitive domain to justify a review of its communicability. This article covers both published and unpublished studies as well as certain informal reports that bear on this property. It also examines possibilities for improving agreement among…
Descriptors: Achievement Tests, Classification, Cognitive Processes, Diffusion (Communication)
Korpi, Meg; Haertel, Edward – 1984
The purpose of this paper is to further the cause of clarifying construct interpretations of tests, by proposing that non-metric multidimensional scaling may be more useful than factor analysis or other latent structure models for investigating the internal structure of tests. It also suggests that typical problems associated with scaling…
Descriptors: Correlation, Factor Structure, Intermediate Grades, Item Analysis
Peer reviewed Peer reviewed
Collis, Kevin F.; And Others – Journal for Research in Mathematics Education, 1986
Described are procedures followed in developing, administering, and scoring a set of mathematical problem-solving superitems and examining their construct validity through a recently developed evaluation technique associated with a taxonomy of the structure of learned outcomes. Data strongly support the validity of the underlying theoretical…
Descriptors: Educational Research, Elementary Secondary Education, Mathematics Education, Problem Solving
Haladyna, Thomas M. – 1984
The purpose of this study is to examine an option-weighting method as it affects pass-fail decisions in formative and summative evaluation of student achievement for instructional units, certification, advancement, licensure, admissions, placement, and selection. A database was constructed using high school achievement test data where a…
Descriptors: Achievement Tests, Cutting Scores, High Schools, Multiple Choice Tests
Previous Page | Next Page ยป
Pages: 1  |  2  |  3  |  4