Publication Date
| In 2026 | 0 |
| Since 2025 | 0 |
| Since 2022 (last 5 years) | 0 |
| Since 2017 (last 10 years) | 0 |
| Since 2007 (last 20 years) | 1 |
Descriptor
Author
Publication Type
Education Level
| Higher Education | 1 |
| Postsecondary Education | 1 |
Audience
| Researchers | 13 |
| Practitioners | 4 |
| Counselors | 1 |
| Teachers | 1 |
Laws, Policies, & Programs
| Elementary and Secondary… | 1 |
Assessments and Surveys
What Works Clearinghouse Rating
Wielicki, Tom – International Association for Development of the Information Society, 2016
This paper reports on longitudinal study regarding integrity of testing in an online format as used by e-learning platforms. Specifically, this study explains whether online testing, which implies an open book format is compromising integrity of assessment by encouraging cheating among students. Statistical experiment designed for this study…
Descriptors: Integrity, Online Courses, Statistical Surveys, Longitudinal Studies
Thrash, Susan K.; Porter, Andrew C. – 1974
The purpose of this paper is to prove that one currently recommended method of obtaining the reliability of an instrument defined on a population of aggregate units is invalid. This method randomly splits the aggregate into two halves, correlates the two half unit scores by a Pearson product moment correlation coefficient, and corrects the…
Descriptors: Comparative Analysis, Correlation, Measurement Techniques, Sampling
Osborn, William C. – 1977
Four essential dimensions of a performance test are detailed: directness of test method, type of criterion, standardization of conditions, and objectivity of scoring. For simplicity these factors are described as if each were dichotomous, when in actuality each is a continuum; a test method may be more or less direct, conditions more or less…
Descriptors: Performance Tests, Scoring, Test Reliability, Test Validity
Paradis, Edward; Peterson, Joe – 1977
This study, involving 131 students in grades ten, eleven, and twelve, investigated the effects of order of administration of subtests on scores from the Nelson-Denny Reading Test. Results indicated that order of administration had no significant effect on scores from the vocabulary subtest or on the total test score, but subjects taking the…
Descriptors: Reading Research, Reading Tests, Secondary Education, Test Reliability
Andrich, David – 1984
Both the attenuation paradox of traditional test theory and the assumption of local independence in person-item response theory have caused problems in interpretation. This paper demonstrates that the two are related concepts, and, through this demonstration, both are clarified. It is demonstrated that the breakdown of local independence leads to…
Descriptors: Latent Trait Theory, Test Interpretation, Test Items, Test Reliability
PDF pending restorationCundiff, D.; Schwane, J. – 1977
Observations during research involving the Bruce Treadmill Test (BTMT) indicating that Stage III for females and Stage IV for males represented speeds which are intermediate between comfortable walking and confortable jogging for many subjects, prompted this study to determine ways to obtain more consistent group results. Twenty-eight subjects…
Descriptors: Measurement Instruments, Measurement Techniques, Physical Activities, Predictor Variables
Alliger, R. J.; Harvey, A. L. – 1984
This article discusses practical and theoretical problems related to the measurement of formal operations. The first section of the article discusses problems in measuring formal operations using the clinical interview method. These problems include the lack of both a standardized interview and a uniform scoring procedure. Section two discusses…
Descriptors: Developmental Stages, Group Testing, Interviews, Objective Tests
Vandiver, Richard – 1974
The variable of permissiveness developed and measured by Ira Reiss in the form of a Guttman scale of premarital sexual permissiveness was subjected to critical analysis. Both conceptual analysis and testing of questions regarding methodology and interpretation of the scale were used. Several questions were raised about the Reiss scale including:…
Descriptors: College Students, Data Analysis, Permissive Environment, Research Projects
Hoover, Randy L.; Kadunc, Nancy – 1983
The purpose of this paper is to examine the nature of discrepancy score phenomena of the Myers-Briggs Type Indicator (MBTI), as related to internal consistency and construct validity of the instrument. Data were collected from 140 university research managers. The data suggest internal consistency problems: only 37.3 percent of the subjects…
Descriptors: Adults, Personality Measures, Personality Traits, Sampling
Cahan, Sorel; Cohen, Nora – 1987
Two types of classification error are possible in competency tests: erroneous classification of an individual as a "master" of the subject (Type II error), and erroneous classification of a master as a "nonmaster" of the subject (Type I). If steps are taken to minimize Type II errors, an artificially high number of true masters…
Descriptors: Classification, Cutting Scores, Foreign Countries, Mastery Tests
Ekstrom, Ruth B. – 1979
Three areas of concern related to test bias and validity should be considered during the revision of the Standards for Educational and Psychological Tests. The first area concerns the sources and consequences of test bias. Five sources of bias have been identified: numerical bias, role bias, status bias, stereotypic bias, and familiarity bias. The…
Descriptors: Evaluation Criteria, Psychometrics, Test Bias, Test Construction
Schumacker, Randall E.; Harris, Mark J. – 1991
Designing a test using three-parameter item response theory (IRT) is discussed. A brief review of IRT is followed by a discussion of two types of test design: (1) selecting items using confidence envelopes (confidence envelope method); and (2) using item characteristic curves and their confidence intervals (test envelope method). The confidence…
Descriptors: Ability, Equations (Mathematics), Item Banks, Item Response Theory
Johnsen, Susan K. – 1989
This paper rates the technical aspects of 41 tests that match characteristics associated with giftedness, using criteria developed in "A Consumer's Guide to Tests in Print" (Hammill, Brown, Bryant, 1989). A summary of validity studies showing these tests' use in identifying gifted youngsters for special programs is also provided. In this sample of…
Descriptors: Aptitude Tests, Elementary Secondary Education, Gifted, Screening Tests
Vernon, Philip E. – 1979
Attention is drawn to the ways in which current conceptions of intelligence and its measurement differ from those which were generally accepted in 1928. The following principles underlying intelligence testing were generally agreed upon in 1928: (1) the assumption of intelligence as a recognizable attribute, responsible for differences among…
Descriptors: Cognitive Development, Educational History, Intelligence, Intelligence Quotient
Jacko, Edward J.; Huck, Schuyler W. – 1974
The Alpert-Haber Achievement Anxiety Test was developed to measure the extent to which individuals experience test anxiety. In at least two published studies, the authors claim to have used the test when in fact the response format was changed from that used in the original instrument and the "buffer" items were omitted. To investigate…
Descriptors: Achievement Tests, Anxiety, College Students, Comparative Analysis


