NotesFAQContact Us
Collection
Advanced
Search Tips
Publication Date
In 20260
Since 20250
Since 2022 (last 5 years)0
Since 2017 (last 10 years)1
Since 2007 (last 20 years)1
Education Level
Audience
Researchers3
Laws, Policies, & Programs
What Works Clearinghouse Rating
Showing 1 to 15 of 35 results Save | Export
Peer reviewed Peer reviewed
PDF on ERIC Download full text
Yu, Guoxing; He, Lianzhen; Rea-Dickins, Pauline; Kiely, Richard; Lu, Yanbin; Zhang, Jing; Zhang, Yan; Xu, Shasha; Fang, Lin – ETS Research Report Series, 2017
Language test preparation has often been studied within the consequential validity framework in relation to ethics, equity, fairness, and washback of assessment. The use of independent and integrated speaking tasks in the "TOEFL iBT"® test represents a significant development and innovation in assessing speaking ability in academic…
Descriptors: English (Second Language), Language Tests, Second Language Learning, Oral Language
Stout, William – 1984
An important problem in psychological test theory is the development of a sound method for determining whether a test which purports to measure the level of a certain ability is, in reality, significantly contaminated by one or more other abilities displayed by persons taking the test. Because of the large number of private and governmental…
Descriptors: Latent Trait Theory, Statistical Analysis, Statistical Distributions, Test Validity
Peer reviewed Peer reviewed
Echternacht, Gary – Educational and Psychological Measurement, 1974
Descriptors: Evaluation Criteria, Probability, Statistical Analysis, Test Bias
Frederiksen, Norman – 1976
A number of different ways of ascertaining whether or not a test measures the same thing in different cultures are examined. Methods range from some that are obvious and simple to those requiring statistical and psychological sophistication. Simpler methods include such things as having candidates "think aloud" and interviewing them about how they…
Descriptors: Analysis of Covariance, Culture Fair Tests, Factor Analysis, Item Analysis
Kapes, Jerome T. – 1975
Two independent studies were conducted to investigate possible differences in General Aptitude Test Battery (GATB) aptitude M resulting from the use of different test equipment (wooden vs. plastic apparatus.) As part of a ten-year longitudinal study of Vocational Development being conducted in the Department of Vocational Education at The…
Descriptors: Aptitude Tests, Comparative Analysis, Elementary Secondary Education, Scores
Larsson, Bernt – Didakometry, 1974
Subjects are asked to answer six questions, partly with a frequency and partly by marking a verbally anchored scale with five categories. Some univariate and multivariate analyses are performed to elucidate the relations between variables with the two different modes of response. Although there are similarities in results for the two types of…
Descriptors: Measurement Techniques, Measures (Individuals), Rating Scales, Responses
Peer reviewed Peer reviewed
Tittle, Carol Kehr – Education and Urban Society, 1975
The purpose of this paper is to describe a set of procedures, that, when carried out, permit the conclusion that a test is a fair measure from the standpoint of specific sub-groups within a test population. A fair test is defined as a test for which a set of data-collection procedures have been carried out and the results reported. (Author/JM)
Descriptors: Academic Achievement, Achievement Tests, Evaluation Criteria, Measurement Techniques
Miller, M. David; Burstein, Leigh – 1981
Two studies are presented in this report. The first is titled "Empirical Studies of Multilevel Approaches to Test Development and Interpretation: Measuring Between-Group Differences in Instruction." Because of a belief that schooling does affect student achievement, researchers have questioned the empirical and measurement techniques…
Descriptors: Error Patterns, Evaluation Methods, Item Analysis, Models
Morse, David T.; Morse, Linda W. – 1976
Performance testing often entails the usage of expensive, time-consuming measures in the quest for determining the level of performance on some desired behavior. It is concluded that a generalizability theory approach to dealing with departures from reality in testing can aid in the establishment of empirically-based choices of measurement…
Descriptors: Cost Effectiveness, Decision Making, Mathematical Models, Measurement Techniques
Green, Donald Ross – 1976
During the past few years the problem of bias in testing has become an increasingly important issue. In most research, bias refers to the fair use of tests and has thus been defined in terms of an outside criterion measure of the performance being predicted by the test. Recently however, there has been growing interest in assessing bias when such…
Descriptors: Achievement Tests, Item Analysis, Mathematical Models, Minority Groups
Kuntz, Patricia – 1982
The quality of mathematics multiple choice items and their susceptibility to test wiseness were examined. Test wiseness was defined as "a subject's capacity to utilize the characteristics and formats of the test and/or test taking situation to receive a high score." The study used results of the Graduate Record Examinations Aptitude Test (GRE) and…
Descriptors: Cues, Item Analysis, Multiple Choice Tests, Psychometrics
Scheuneman, Janice – 1975
In order to screen out items which may be biased against some ethnic group prior to the final selection of items in test construction, a statistical technique for assessing item bias was developed. Based on a theoretical formulation of R. B. Darlington, the method compares the performance of individuals who belong to different ethnic groups, but…
Descriptors: Achievement Tests, Content Analysis, Cultural Influences, Ethnic Groups
Peer reviewed Peer reviewed
Lord, Frederic M. – Psychometrika, 1974
Omitted items cannot properly be treated as wrong when estimating ability and item parameters. A convenient method for utilizing the information provided by omissions is presented. Theoretical and empirical justifications are presented for the estimates obtained by the new method. (Author)
Descriptors: Academic Ability, Guessing (Tests), Item Analysis, Latent Trait Theory
Ebel, Robert L.; Livingston, Samuel A. – NCME Measurement in Education, 1981
This issue of Measurement in Education is presented in the form of a dialogue between Dr. Robert L. Ebel, Distinguished Professor of Educational Measurement at Michigan State University, and Dr. Samual A. Livingston, Program Research Scientist at the Educational Testing Service. Alternative views on some aspects of the use of tests in assessing…
Descriptors: Competence, Criterion Referenced Tests, Multiple Choice Tests, Norm Referenced Tests
Theunissen, Phiel J. J. M. – 1983
Any systematic approach to the assessment of students' ability implies the use of a model. The more explicit the model is, the more its users know about what they are doing and what the consequences are. The Rasch model is a strong model where measurement is a bonus of the model itself. It is based on four ideas: (1) separation of observable…
Descriptors: Ability Grouping, Difficulty Level, Evaluation Criteria, Item Sampling
Previous Page | Next Page »
Pages: 1  |  2  |  3