ERIC - Search Results

Publication Date

In 2026	0
Since 2025	8
Since 2022 (last 5 years)	16
Since 2017 (last 10 years)	31
Since 2007 (last 20 years)	48

Descriptor

Test Length	113
Test Validity	113
Test Reliability	63
Test Construction	47
Test Items	32
Test Format	23
Foreign Countries	20
Computer Assisted Testing	18
Testing Problems	17
Psychometrics	15
Factor Structure	14
Comparative Analysis	13
Factor Analysis	13
Higher Education	12
Language Tests	12
Scores	12
Adaptive Testing	11
Testing	11
Criterion Referenced Tests	10
Item Response Theory	10
Measures (Individuals)	10
Correlation	9
English (Second Language)	9
Intelligence Tests	9
Item Analysis	9
More ▼

Publication Type

Reports - Research	77
Journal Articles	67
Reports - Evaluative	17
Speeches/Meeting Papers	17
Reports - Descriptive	5
Guides - Non-Classroom	3
Information Analyses	3
Opinion Papers	2
Reference Materials -…	2
Tests/Questionnaires	2
Dissertations/Theses -…	1
Guides - General	1
Numerical/Quantitative Data	1
More ▼

Education Level

Higher Education	15
Postsecondary Education	15
Elementary Education	7
Secondary Education	7
Middle Schools	4
Grade 6	3
High Schools	3
Intermediate Grades	3
Junior High Schools	3
Early Childhood Education	2
Elementary Secondary Education	2
Grade 5	2
Primary Education	2
Grade 2	1
Grade 3	1
Grade 4	1
Grade 7	1
Grade 8	1
More ▼

Audience

Researchers	5
Practitioners	2
Community	1
Support Staff	1

Location

Turkey	5
China	3
United Kingdom	3
Japan	2
California	1
Canada	1
Germany	1
Italy	1
Kenya	1
Michigan	1
New Jersey	1
Pennsylvania	1
Peru	1
Portugal	1
Singapore	1
Spain	1
Vermont	1
More ▼

Laws, Policies, & Programs

Job Training Partnership Act…

What Works Clearinghouse Rating

Test Validity X

Showing 76 to 90 of 113 results Save | Export

Test Length and Validity: An Application of Test Theory to a Finite World.

Myers, Charles T. – 1978

The viewpoint is expressed that adding to test reliability by either selecting a more homogeneous set of items, restricting the range of item difficulty as closely as possible to the most efficient level, or increasing the number of items will not add to test validity and that there is considerable danger that efforts to increase reliability may…

Descriptors: Achievement Tests, Item Analysis, Multiple Choice Tests, Test Construction

A Comparison of the Performance of Simulated Hierarchical and Linear Testlets.

Peer reviewed

Wainer, Howard; And Others – Journal of Educational Measurement, 1992

Computer simulations were run to measure the relationship between testlet validity and factors of item pool size and testlet length for both adaptive and linearly constructed testlets. Making a testlet adaptive yields only modest increases in aggregate validity because of the peakedness of the typical proficiency distribution. (Author/SLD)

Descriptors: Adaptive Testing, Comparative Testing, Computer Assisted Testing, Computer Simulation

Testing the Robustness of DIMTEST on Nonnormal Ability Distributions.

Download full text

Nandakumar, Ratna; Yu, Feng – 1994

DIMTEST is a statistical test procedure for assessing essential unidimensionality of binary test item responses. The test statistic T used for testing the null hypothesis of essential unidimensionality is a nonparametric statistic. That is, there is no particular parametric distribution assumed for the underlying ability distribution or for the…

Descriptors: Ability, Content Validity, Correlation, Nonparametric Statistics

A Comparison of an Expert Systems Approach to Computerized Adaptive Testing and an Item Response Theory Model.

Download full text

Frick, Theodore W. – 1991

Expert systems can be used to aid decisionmaking. A computerized adaptive test is one kind of expert system, although not commonly recognized as such. A new approach, termed EXSPRT, was devised that combines expert systems reasoning and sequential probability ratio test stopping rules. Two versions of EXSPRT were developed, one with random…

Descriptors: Adaptive Testing, Comparative Analysis, Computer Assisted Testing, Expert Systems

A Method for Determining the Length of Criterion-Referenced Tests Using Reliability and Validity Indices.

Download full text

Mills, Craig N.; Simon, Robert – 1981

When criterion-referenced tests are used to assign examinees to states reflecting their performance level on a test, the better known methods for determining test length, which consider relationships among domain scores and errors of measurement, have their limitations. The purpose of this paper is to present a computer system named TESTLEN, which…

Descriptors: Computer Assisted Testing, Criterion Referenced Tests, Cutting Scores, Error of Measurement

An Investigation of Proposed Revisions to Section 3 of the TOEFL Test. TOEFL Research Report 47.

Download full text

Schedl, Mary; And Others – 1995

The Test of English as a Foreign Language (TOEFL) program is exploring a change in Section 3 of the TOEFL test that would replace the vocabulary subpart with additional reading comprehension questions. This study investigated the proposed revision in terms of the length and timing that would be necessary to address concerns of test speededness of…

Descriptors: Adult Students, English (Second Language), Language Tests, Psychometrics

Practical Procedures for Constructing Mastery Tests to Minimize Errors of Classification and to Maximize or Optimize Decision Reliability.

Byars, Alvin Gregg – 1980

The objectives of this investigation are to develop, describe, assess, and demonstrate procedures for constructing mastery tests to minimize errors of classification and to maximize decision reliability. The guidelines are based on conditions where item exchangeability is a reasonable assumption and the test constructor can control the number of…

Descriptors: Cutting Scores, Difficulty Level, Grade 4, Intermediate Grades

An Investigation of the Differential Effort Received by Items on a Low-Stakes Computer-Based Test

Peer reviewed

Direct link

Wise, Steven L. – Applied Measurement in Education, 2006

In low-stakes testing, the motivation levels of examinees are often a matter of concern to test givers because a lack of examinee effort represents a direct threat to the validity of the test data. This study investigated the use of response time to assess the amount of examinee effort received by individual test items. In 2 studies, it was found…

Descriptors: Computer Assisted Testing, Motivation, Test Validity, Item Response Theory

Preschool Assessment Instrument Ratings Guide.

Metropolitan Atlanta Consortium of Consultants and Lead Speech-Language Pathologists, GA. – 1990

This guide presents ratings of assessment instruments for use by speech-language pathologists with preschool students. Tests are reviewed in alphabetical order on forms filled out by practicing speech-language pathologists, including data on speech components covered by each test, age range, factors of norms where norms are used, reliability,…

Descriptors: Diagnostic Tests, Examiners, Preschool Education, Preschool Tests

The Effect of Test Speededness and Random Guessing on the Validity of Reading Comprehension Scores.

Jolly, S. Jean; And Others – 1985

Scores from the Stanford Achievement Tests administered to 50,000 students in Palm Beach County, Florida, were studied in order to determine whether the speeded nature of the reading comprehension subtest was related to inconsistencies in the score profiles. Specifically, the probable effect of random guessing was examined. Reading scores were…

Descriptors: Achievement Tests, Elementary Secondary Education, Guessing (Tests), Item Analysis

Comparison of Difficulties and Reliabilities of Math-Completion and Multiple-Choice Item Formats.

Download full text

Oosterhof, Albert C.; Coats, Pamela K. – 1981

Instructors who develop classroom examinations that require students to provide a numerical response to a mathematical problem are often very concerned about the appropriateness of the multiple-choice format. The present study augments previous research relevant to this concern by comparing the difficulty and reliability of multiple-choice and…

Descriptors: Comparative Analysis, Difficulty Level, Grading, Higher Education

Effects of Test Length and Advancement Score on Several Criterion-Referenced Test Reliability and Validity Indices. Laboratory of Psychometric and Evaluation Research Report No. 86.

Download full text

Eignor, Daniel R.; Hambleton, Ronald K. – 1979

The purpose of the investigation was to obtain some relationships among (1) test lengths, (2) shape of domain-score distributions, (3) advancement scores, and (4) several criterion-referenced test score reliability and validity indices. The study was conducted using computer simulation methods. The values of variables under study were set to be…

Descriptors: Comparative Analysis, Computer Assisted Testing, Criterion Referenced Tests, Cutting Scores

Criterion-Referenced Testing and Measurement: A Review of Technical Issues and Developments

Peer reviewed

Hambleton, Ronald K.; And Others – Review of Educational Research, 1978

Reviewing psychometric and statistical developments in criterion- referenced testing, this paper presents six sections: uses of criterion- referenced test scores, reliability of criterion-referenced test scores, determination of test length, determination of cut-off scores, test development and validation, and summary and suggestions for further…

Descriptors: Criterion Referenced Tests, Cutting Scores, Mastery Tests, Mathematical Models

An Adaptive Algebra Test: A Testlet-Based, Hierarchically-Structured Test with Validity-Based Scoring. Technical Report No. 90-92.

Download full text

Wainer, Howard; And Others – 1990

The initial development of a testlet-based algebra test was previously reported (Wainer and Lewis, 1990). This account provides the details of this excursion into the use of hierarchical testlets and validity-based scoring. A pretest of two 15-item hierarchical testlets was carried out in which examinees' performance on a 4-item subset of each…

Descriptors: Adaptive Testing, Algebra, Comparative Analysis, Computer Assisted Testing

The Implications of Accommodations in Testing Students with Disabilities.

Download full text

Hopper, Margaret F. – 2001

This paper provides an overview of the types of testing accommodations used for students with disabilities and presents arguments for and against their use. It begins by discussing student participation in educational assessments and federal requirements concerning the participation of students with disabilities. The types of accommodations are…

Descriptors: Academic Accommodations (Disabilities), Academic Standards, Disabilities, Educational Assessment

« Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8

Journal of Psychoeducational…	10
Educational and Psychological…	7
Journal of Educational…	4
Language Testing	4
Psychological Assessment	4
International Journal of…	3
Physical Review Physics…	3
Applied Measurement in…	2
Assessment	2
Journal of Clinical Psychology	2
ACT Education Corp.	1
African Educational Research…	1
Applied Psychological…	1
British Journal of Guidance &…	1
British Journal of Learning…	1
Education 3-13	1
Education and Information…	1
Educational Assessment	1
Educational Research and…	1
Eurasian Journal of…	1
Grantee Submission	1
International Educational…	1
International Journal of…	1
International Journal of…	1
Journal of Career Assessment	1
More ▼

Hambleton, Ronald K.	6
Wainer, Howard	3
Michael, William B.	2
Abrams, Matthew	1
Acar, Selcuk	1
Almeida, Leandro S.	1
Alonso, Jordi	1
Andrea Fuster	1
Andy Rick Sánchez-Villena	1
Anthony, Christopher J.	1
Arbet, Scott E.	1
Arens, A. Katrin	1
Aydin, Selami	1
Bao, Lei	1
Basman, Munevver	1
Bergstrom, Betty A.	1
Boer, Marian	1
Bond, Mark	1
Boyd, Lenore A.	1
Boyd, Thomas A.	1
Brown, Steven D.	1
Browne, Janet	1
Bruce, K.	1
Bullick, Stephanie	1
More ▼

Minnesota Multiphasic…	4
Wechsler Adult Intelligence…	4
Test of English as a Foreign…	3
Wechsler Intelligence Scale…	3
Peabody Picture Vocabulary…	2
ACT Assessment	1
Academic Motivation Scale	1
Adaptive Behavior Scale	1
Bar Examinations	1
Developmental Indicators for…	1
Force Concept Inventory	1
General Educational…	1
International English…	1
Kaufman Brief Intelligence…	1
Marlowe Crowne Social…	1
McCarthy Scales of Childrens…	1
Multidimensional…	1
NEO Five Factor Inventory	1
National Assessment of…	1
Positive and Negative Affect…	1
Self Description Questionnaire	1
Sensation Seeking Scale	1
Stanford Achievement Tests	1
Stanford Binet Intelligence…	1
Wechsler Intelligence Scales…	1
More ▼