ERIC - Search Results

Publication Date

In 2026	0
Since 2025	0
Since 2022 (last 5 years)	0
Since 2017 (last 10 years)	1
Since 2007 (last 20 years)	1

Descriptor

Statistical Analysis	35
Test Validity	35
Testing Problems	35
Test Construction	13
Test Reliability	13
Test Bias	10
Item Analysis	9
Scores	8
Testing	7
Achievement Tests	6
Elementary Secondary Education	6
Measurement Techniques	6
Test Interpretation	6
Test Items	6
Multiple Choice Tests	5
Correlation	4
Culture Fair Tests	4
Evaluation Criteria	4
Language Tests	4
Mathematical Models	4
Psychometrics	4
Research Methodology	4
Standardized Tests	4
Test Results	4
Aptitude Tests	3
More ▼

Source

Didakometry	1
ETS Research Report Series	1
Education and Urban Society	1
Educational and Psychological…	1
NCME Measurement in Education	1
Psychometrika	1

Publication Type

Reports - Research	19
Speeches/Meeting Papers	8
Reports - Evaluative	3
Information Analyses	2
Collected Works - Proceedings	1
Collected Works - Serials	1
Journal Articles	1
Opinion Papers	1
Reference Materials -…	1
Tests/Questionnaires	1

Education Level

Audience

Researchers

Location

Netherlands	2
California (Stanford)	1
Canada	1
China	1
Colorado (Denver)	1
Minnesota	1
Sweden	1

Laws, Policies, & Programs

Assessments and Surveys

General Aptitude Test Battery	2
Armed Services Vocational…	1
Metropolitan Achievement Tests	1
Metropolitan Readiness Tests	1
Test of English as a Foreign…	1

What Works Clearinghouse Rating

Showing 1 to 15 of 35 results Save | Export

Preparing for the Speaking Tasks of the "TOEFL iBT"® Test: An Investigation of the Journeys of Chinese Test Takers. "TOEFL iBT"® Research Report. TOEFL iBT-28. ETS Research Report. RR-17-19

Peer reviewed
PDF on ERIC

Download full text

Yu, Guoxing; He, Lianzhen; Rea-Dickins, Pauline; Kiely, Richard; Lu, Yanbin; Zhang, Jing; Zhang, Yan; Xu, Shasha; Fang, Lin – ETS Research Report Series, 2017

Language test preparation has often been studied within the consequential validity framework in relation to ethics, equity, fairness, and washback of assessment. The use of independent and integrated speaking tasks in the "TOEFL iBT"® test represents a significant development and innovation in assessing speaking ability in academic…

Descriptors: English (Second Language), Language Tests, Second Language Learning, Oral Language

A Statistical Procedure for Assessing Test Dimensionality. Measurement Series 84-2.

Stout, William – 1984

An important problem in psychological test theory is the development of a sound method for determining whether a test which purports to measure the level of a certain ability is, in reality, significantly contaminated by one or more other abilities displayed by persons taking the test. Because of the large number of private and governmental…

Descriptors: Latent Trait Theory, Statistical Analysis, Statistical Distributions, Test Validity

A Quick Method for Determining Test Bias

Peer reviewed

Echternacht, Gary – Educational and Psychological Measurement, 1974

Descriptors: Evaluation Criteria, Probability, Statistical Analysis, Test Bias

How to Tell if a Test Measures the Same Thing in Different Cultures.

Download full text

Frederiksen, Norman – 1976

A number of different ways of ascertaining whether or not a test measures the same thing in different cultures are examined. Methods range from some that are obvious and simple to those requiring statistical and psychological sophistication. Simpler methods include such things as having candidates "think aloud" and interviewing them about how they…

Descriptors: Analysis of Covariance, Culture Fair Tests, Factor Analysis, Item Analysis

GATB: Does the Apparatus Make a Difference?

Download full text

Kapes, Jerome T. – 1975

Two independent studies were conducted to investigate possible differences in General Aptitude Test Battery (GATB) aptitude M resulting from the use of different test equipment (wooden vs. plastic apparatus.) As part of a ten-year longitudinal study of Vocational Development being conducted in the Department of Vocational Education at The…

Descriptors: Aptitude Tests, Comparative Analysis, Elementary Secondary Education, Scores

Frequency Words and Frequencies: A Pilot Study on Relations Between Differently Anchored Scales. Didakometry; No. 44, November 1974.

Download full text

Larsson, Bernt – Didakometry, 1974

Subjects are asked to answer six questions, partly with a frequency and partly by marking a verbally anchored scale with five categories. Some univariate and multivariate analyses are performed to elucidate the relations between variables with the two different modes of response. Although there are similarities in results for the two types of…

Descriptors: Measurement Techniques, Measures (Individuals), Rating Scales, Responses

Fairness in Educational Achievement Testing

Peer reviewed

Tittle, Carol Kehr – Education and Urban Society, 1975

The purpose of this paper is to describe a set of procedures, that, when carried out, permit the conclusion that a test is a fair measure from the standpoint of specific sub-groups within a test population. A fair test is defined as a test for which a set of data-collection procedures have been carried out and the results reported. (Author/JM)

Descriptors: Academic Achievement, Achievement Tests, Evaluation Criteria, Measurement Techniques

Evaluation Design Project: Multilevel Interpretation of Evaluation Data Study.

Download full text

Miller, M. David; Burstein, Leigh – 1981

Two studies are presented in this report. The first is titled "Empirical Studies of Multilevel Approaches to Test Development and Interpretation: Measuring Between-Group Differences in Instruction." Because of a belief that schooling does affect student achievement, researchers have questioned the empirical and measurement techniques…

Descriptors: Error Patterns, Evaluation Methods, Item Analysis, Models

A Model for Assessing the Effects of Departures from Reality in Performance Testing.

Download full text

Morse, David T.; Morse, Linda W. – 1976

Performance testing often entails the usage of expensive, time-consuming measures in the quest for determining the level of performance on some desired behavior. It is concluded that a generalizability theory approach to dealing with departures from reality in testing can aid in the establishment of empirically-based choices of measurement…

Descriptors: Cost Effectiveness, Decision Making, Mathematical Models, Measurement Techniques

Reducing Bias in Achievement Tests.

Download full text

Green, Donald Ross – 1976

During the past few years the problem of bias in testing has become an increasingly important issue. In most research, bias refers to the fair use of tests and has thus been defined in terms of an outside criterion measure of the performance being predicted by the test. Recently however, there has been growing interest in assessing bias when such…

Descriptors: Achievement Tests, Item Analysis, Mathematical Models, Minority Groups

Test-Wiseness Cues in the Options of Mathematics Items.

Kuntz, Patricia – 1982

The quality of mathematics multiple choice items and their susceptibility to test wiseness were examined. Test wiseness was defined as "a subject's capacity to utilize the characteristics and formats of the test and/or test taking situation to receive a high score." The study used results of the Graduate Record Examinations Aptitude Test (GRE) and…

Descriptors: Cues, Item Analysis, Multiple Choice Tests, Psychometrics

A New Method of Assessing Bias in Test Items.

Download full text

Scheuneman, Janice – 1975

In order to screen out items which may be biased against some ethnic group prior to the final selection of items in test construction, a statistical technique for assessing item bias was developed. Based on a theoretical formulation of R. B. Darlington, the method compares the performance of individuals who belong to different ethnic groups, but…

Descriptors: Achievement Tests, Content Analysis, Cultural Influences, Ethnic Groups

Estimation of Latent Ability and Item Parameters When There Are Omitted Responses

Peer reviewed

Lord, Frederic M. – Psychometrika, 1974

Omitted items cannot properly be treated as wrong when estimating ability and item parameters. A convenient method for utilizing the information provided by omissions is presented. Theoretical and empirical justifications are presented for the estimates obtained by the new method. (Author)

Descriptors: Academic Ability, Guessing (Tests), Item Analysis, Latent Trait Theory

Issues in Testing for Competency.

Download full text

Ebel, Robert L.; Livingston, Samuel A. – NCME Measurement in Education, 1981

This issue of Measurement in Education is presented in the form of a dialogue between Dr. Robert L. Ebel, Distinguished Professor of Educational Measurement at Michigan State University, and Dr. Samual A. Livingston, Program Research Scientist at the Educational Testing Service. Alternative views on some aspects of the use of tests in assessing…

Descriptors: Competence, Criterion Referenced Tests, Multiple Choice Tests, Norm Referenced Tests

Introduction to Rasch Measurement: Some Implications for Languages.

Theunissen, Phiel J. J. M. – 1983

Any systematic approach to the assessment of students' ability implies the use of a model. The more explicit the model is, the more its users know about what they are doing and what the consequences are. The Rasch model is a strong model where measurement is a bonus of the model itself. It is based on four ideas: (1) separation of observable…

Descriptors: Ability Grouping, Difficulty Level, Evaluation Criteria, Item Sampling

Previous Page | Next Page »

Pages: 1 | 2 | 3

Hurley, Christine	2
Spicuzza, Richard	2
Thurlow, Martha	2
ANDRADE, MANUEL	1
Barker, Pierce	1
Bleistein, Carole A.	1
Bormuth, John R.	1
Broussard, Rolland L.	1
Burstein, Leigh	1
Ebel, Robert L.	1
Echternacht, Gary	1
El Sawaf, Hamdy	1
Erickson, Ronald	1
Fang, Lin	1
Frederiksen, Norman	1
Gordon, Howard R. D.	1
Green, Donald Ross	1
Hambleton, Ronald K.	1
He, Lianzhen	1
Hendrickson, Gerry F.	1
Kane, Robert B.	1
Kapes, Jerome T.	1
Kiely, Richard	1
More ▼