ERIC - Search Results

Publication Date

In 2026	0
Since 2025	8
Since 2022 (last 5 years)	16
Since 2017 (last 10 years)	31
Since 2007 (last 20 years)	48

Descriptor

Test Length	113
Test Validity	113
Test Reliability	63
Test Construction	47
Test Items	32
Test Format	23
Foreign Countries	20
Computer Assisted Testing	18
Testing Problems	17
Psychometrics	15
Factor Structure	14
Comparative Analysis	13
Factor Analysis	13
Higher Education	12
Language Tests	12
Scores	12
Adaptive Testing	11
Testing	11
Criterion Referenced Tests	10
Item Response Theory	10
Measures (Individuals)	10
Correlation	9
English (Second Language)	9
Intelligence Tests	9
Item Analysis	9
More ▼

Publication Type

Reports - Research	77
Journal Articles	67
Reports - Evaluative	17
Speeches/Meeting Papers	17
Reports - Descriptive	5
Guides - Non-Classroom	3
Information Analyses	3
Opinion Papers	2
Reference Materials -…	2
Tests/Questionnaires	2
Dissertations/Theses -…	1
Guides - General	1
Numerical/Quantitative Data	1
More ▼

Education Level

Higher Education	15
Postsecondary Education	15
Elementary Education	7
Secondary Education	7
Middle Schools	4
Grade 6	3
High Schools	3
Intermediate Grades	3
Junior High Schools	3
Early Childhood Education	2
Elementary Secondary Education	2
Grade 5	2
Primary Education	2
Grade 2	1
Grade 3	1
Grade 4	1
Grade 7	1
Grade 8	1
More ▼

Audience

Researchers	5
Practitioners	2
Community	1
Support Staff	1

Location

Turkey	5
China	3
United Kingdom	3
Japan	2
California	1
Canada	1
Germany	1
Italy	1
Kenya	1
Michigan	1
New Jersey	1
Pennsylvania	1
Peru	1
Portugal	1
Singapore	1
Spain	1
Vermont	1
More ▼

Laws, Policies, & Programs

Job Training Partnership Act…

What Works Clearinghouse Rating

Test Validity X

Showing 91 to 105 of 113 results Save | Export

Evaluating Writing: Linking Large-Scale Testing and Classroom Assessment. Occasional Paper No. 27.

Download full text

Freedman, Sarah Warshauer – 1991

Writing teachers and educators can add to information from large-scale testing and teachers can strengthen classroom assessment by creating a tight fit between large-scale testing and classroom assessment. Across the years, large-scale testing programs have struggled with a difficult problem: how to evaluate student writing reliably and…

Descriptors: Elementary Secondary Education, Foreign Countries, Informal Assessment, Portfolios (Background Materials)

Comparative Racial Analysis of Enlisted Advancement Exams: Item Differentiation. Final Report.

Download full text

Robertson, David W.; And Others – 1977

A comparative study of item analysis was conducted on the basis of race to determine whether alternative test construction or processing might increase the proportion of black enlisted personnel among those passing various military technical knowledge examinations. The study used data from six specialists at four grade levels and investigated item…

Descriptors: Difficulty Level, Enlisted Personnel, Item Analysis, Occupational Tests

A Preliminary Version of a Scale to Measure Sex-Role Attitudes in the Army. Research Memorandum 76-3.

Download full text

Woelfel, John C.; And Others – 1976

To measure the sex role attitudes of Army personnel, an initial set of 174 items was developed. These items were administered to 721 soldiers at three Army installations; the sample consisted of 540 men and 181 women--401 of these were officers and 320 were enlisted personnel. Factor analysis of these 174 items indicated one strong…

Descriptors: Adults, Attitude Measures, Factor Structure, Females

The Assessment of Writing Proficiency via Qualitative Ratings of Writing Samples.

Steele, Joe M. – 1979

The College Outcome Measures Project/American College Testing Program (COMP/ACT) Writing Assessment is described, and issues of validity and reliability in the assessment of writing samples using qualitative rating scales are explored. COMP/ACT is composed of three role-playing tasks in the social sciences, natural sciences, and arts, which are…

Descriptors: Adults, Essay Tests, Evaluators, Higher Education

Determining Optimal Test Lengths with a Fixed Total Testing Time.

Download full text

Hambleton, Ronald K. – 1986

The problem of determining optimal test lengths with fixed total testing time has proved to be a difficult one for criterion-referenced test developers. An algorithm is needed which can be used by test developers to allocate available testing time to maximize the validity of their total criterion-referenced tests or testing programs. To be…

Descriptors: Algorithms, Criterion Referenced Tests, Elementary Secondary Education, Psychometrics

A Validity Comparison of Adaptive and Conventional Strategies for Mastery Testing.

Download full text

Kingsbury, G. Gage; Weiss, David J. – 1981

Conventional mastery tests designed to make optimal mastery classifications were compared with fixed-length and variable-length adaptive mastery tests. Comparisons between the testing procedures were made across five content areas in an introductory biology course from tests administered to volunteers. The criterion was the student's standing in…

Descriptors: Achievement Tests, Adaptive Testing, Biology, Comparative Analysis

Customized Tests and Customized Norms.

Peer reviewed

Linn, Robert L.; Hambleton, Ronald K. – Applied Measurement in Education, 1991

Four main approaches to customized testing are described, and their resulting scores' valid uses and interpretations are discussed. Customized testing can yield valid normative and curriculum-specific information, although cautious application is needed to avoid misleading inferences about student achievement. (SLD)

Descriptors: Academic Achievement, Accountability, Criterion Referenced Tests, Curriculum

The Second Century of Ability Testing: Some Predictions and Speculations

Peer reviewed

Direct link

Embretson, Susan E. – Measurement: Interdisciplinary Research and Perspectives, 2004

The last century was marked by dazzling changes in many areas, such as technology and communications. Predictions into the second century of testing are seemingly difficult in such a context. Yet, looking back to the turn of the last century, Kirkpatrick (1900), in his American Psychological Association presidential address, presented fundamental…

Descriptors: Ability, Testing, Futures (of Society), Psychometrics

Response Length and Quality in the Grading of Essay Tests.

Tollefson, Nona; Tracy, D. B. – 1979

The validity and reliability of essay scores were examined by comparing the mean scores assigned to good and poor quality essay responses of different lengths written by high school sophomores. In-service and pre-service social studies teachers graded essay responses to a test question requiring knowledge of the Constitutional provisions for…

Descriptors: Essay Tests, Essays, Evaluation Criteria, High Schools

The Factorial Validity of the Study Attitudes and Methods Scale (SAMS).

Peer reviewed

Michael, William B.; And Others – Educational and Psychological Measurement, 1985

A shortened Study Attitudes and Methods Survey (SAMS) was administered to 181 community college students. Four original factors remained in the new version: academic interest--love of learning; study anxiety; manipulation; and alienation toward authority. Academic drive--conformity and study methods were dropped, while facilitative study behaviors…

Descriptors: Attitude Measures, Factor Structure, Item Analysis, School Attitudes

WISC-R Short Forms: Long on Problems.

Boyd, Thomas A.; Tramontana, Michael G. – 1984

To examine the validity of short forms of the Wechsler Intelligence Scale for Children-Revised (WISC-R), the WISC-R was first administered to 106 hospitalized psychiatric patients, aged 8-16. No subjects had a primary diagnosis of mental retardation or learning disability, and one-third were receiving psychotropic medication. WISC-R IQ scores…

Descriptors: Adolescents, Children, Correlation, Elementary Secondary Education

Predictive Validity of Short Form Placement Tests under Two Scoring Systems.

Hisama, Kay K.; And Others – 1977

The optimal test length, using predictive validity as a criterion, depends on two major conditions: the appropriate item-difficulty rather than the total number of items, and the method used in scoring the test. These conclusions were reached when responses to a 100-item multi-level test of reading comprehension from 136 non-native speakers of…

Descriptors: College Students, Difficulty Level, English (Second Language), Foreign Students

Listening, a Single Trait in First and Second Language Learning.

Download full text

de Jong, John H. A. L. – Toegepaste taalwetenschap in artikelen 20, 1984

A study investigated the validity of an English listening skills test by comparing the results of native American and British English speakers with those of Dutch students of English as a second language. A hypothesis suggested that two-thirds of the items would test listening skills and the remaining third would test other knowledge. Test results…

Descriptors: Age Differences, Comparative Analysis, Correlation, Educational Background

Confidence in Pass/Fail Decisions for Computer Adaptive and Paper and Pencil Examinations.

Bergstrom, Betty A.; Lunz, Mary E. – 1991

The level of confidence in pass/fail decisions obtained with computer adaptive tests (CATs) was compared to decisions based on paper-and-pencil tests. Subjects included 645 medical technology students from 238 educational programs across the country. The tests used in this study constituted part of the subjects' review for the certification…

Descriptors: Adaptive Testing, Certification, Comparative Testing, Computer Assisted Testing

Vocational Assessment Instruments Reference Guide. A Review of Interest, Aptitude & Pre-Employment/Job Readiness Tests.

Download full text

New York State Div. for Youth, Albany. – 1985

This guide is designed to serve as a reference to assist providers of Job Training Partnership Act-funded programs in selecting appropriate interest, aptitude, and pre-employment and job readiness tests. Descriptions of 53 interest tests, 38 aptitude tests, and 37 pre-employment and job readiness tests are provided. Each description contains…

Descriptors: Aptitude Tests, Employment Potential, Evaluation Criteria, Guidelines

« Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8

Journal of Psychoeducational…	10
Educational and Psychological…	7
Journal of Educational…	4
Language Testing	4
Psychological Assessment	4
International Journal of…	3
Physical Review Physics…	3
Applied Measurement in…	2
Assessment	2
Journal of Clinical Psychology	2
ACT Education Corp.	1
African Educational Research…	1
Applied Psychological…	1
British Journal of Guidance &…	1
British Journal of Learning…	1
Education 3-13	1
Education and Information…	1
Educational Assessment	1
Educational Research and…	1
Eurasian Journal of…	1
Grantee Submission	1
International Educational…	1
International Journal of…	1
International Journal of…	1
Journal of Career Assessment	1
More ▼

Hambleton, Ronald K.	6
Wainer, Howard	3
Michael, William B.	2
Abrams, Matthew	1
Acar, Selcuk	1
Almeida, Leandro S.	1
Alonso, Jordi	1
Andrea Fuster	1
Andy Rick Sánchez-Villena	1
Anthony, Christopher J.	1
Arbet, Scott E.	1
Arens, A. Katrin	1
Aydin, Selami	1
Bao, Lei	1
Basman, Munevver	1
Bergstrom, Betty A.	1
Boer, Marian	1
Bond, Mark	1
Boyd, Lenore A.	1
Boyd, Thomas A.	1
Brown, Steven D.	1
Browne, Janet	1
Bruce, K.	1
Bullick, Stephanie	1
More ▼

Minnesota Multiphasic…	4
Wechsler Adult Intelligence…	4
Test of English as a Foreign…	3
Wechsler Intelligence Scale…	3
Peabody Picture Vocabulary…	2
ACT Assessment	1
Academic Motivation Scale	1
Adaptive Behavior Scale	1
Bar Examinations	1
Developmental Indicators for…	1
Force Concept Inventory	1
General Educational…	1
International English…	1
Kaufman Brief Intelligence…	1
Marlowe Crowne Social…	1
McCarthy Scales of Childrens…	1
Multidimensional…	1
NEO Five Factor Inventory	1
National Assessment of…	1
Positive and Negative Affect…	1
Self Description Questionnaire	1
Sensation Seeking Scale	1
Stanford Achievement Tests	1
Stanford Binet Intelligence…	1
Wechsler Intelligence Scales…	1
More ▼