ERIC - Search Results

Publication Date

In 2026	0
Since 2025	2
Since 2022 (last 5 years)	12
Since 2017 (last 10 years)	28
Since 2007 (last 20 years)	48

Descriptor

Test Construction	150
Test Length	150
Test Items	66
Test Validity	47
Test Reliability	42
Computer Assisted Testing	32
Test Format	27
Adaptive Testing	26
Item Banks	21
Psychometrics	20
Testing Problems	20
Higher Education	19
Item Analysis	19
Item Response Theory	19
Criterion Referenced Tests	16
Foreign Countries	16
Scores	15
Cutting Scores	14
Mastery Tests	14
Measurement Techniques	13
Sample Size	13
Achievement Tests	12
Mathematical Models	12
Difficulty Level	11
Factor Structure	11
More ▼

Publication Type

Reports - Research	100
Journal Articles	78
Reports - Evaluative	25
Speeches/Meeting Papers	25
Reports - Descriptive	9
Guides - Non-Classroom	7
Numerical/Quantitative Data	7
Opinion Papers	6
Information Analyses	4
Tests/Questionnaires	4
Dissertations/Theses -…	3
Collected Works - Serials	1
Guides - General	1
Historical Materials	1
Reports - General	1
More ▼

Education Level

Higher Education	9
Postsecondary Education	8
Secondary Education	5
Elementary Education	4
High Schools	4
Elementary Secondary Education	2
Grade 3	2
Grade 6	2
Intermediate Grades	2
Middle Schools	2
Early Childhood Education	1
Grade 4	1
Grade 5	1
Junior High Schools	1
Primary Education	1
More ▼

Audience

Researchers	5
Practitioners	2
Administrators	1

Location

Australia	2
Canada	2
China	2
Ireland	2
Singapore	2
United Kingdom	2
Asia	1
Germany	1
Israel	1
Italy	1
Japan	1
Netherlands	1
New Jersey	1
New Zealand	1
Pennsylvania	1
Poland	1
Rhode Island	1
South Korea	1
Turkey	1
United Kingdom (England)	1
United States	1
More ▼

Laws, Policies, & Programs

Race to the Top

What Works Clearinghouse Rating

Test Construction X

Showing 91 to 105 of 150 results Save | Export

Passing Score and Length of a Mastery Test.

van der Linden, Wim J. – Evaluation in Education: International Progress, 1982

In mastery testing a linear relationship between an optimal passing score and test length is presented with a new optimization criterion. The usual indifference zone approach, a binomial error model, decision errors, and corrections for guessing are discussed. Related results in sequential testing and the latent class approach are included. (CM)

Descriptors: Cutting Scores, Educational Testing, Mastery Tests, Mathematical Models

Discriminability in Multidimensional Performance Evaluations.

Peer reviewed

Kafry, Ditsa; And Others – Applied Psychological Measurement, 1979

A series of behavioral expectation scale applications were analyzed in an attempt to point out an appropriate number of dimensions to be included in such studies. Results reflected the problems of dimension interdependence when the number of dimensions exceeds nine. (Author/JKS)

Descriptors: Behavior Rating Scales, Expectation, Factor Analysis, Higher Education

The Tridimensional Personality Questionnaire: Reliability and Validity Studies and Derivation of a Short Form.

Peer reviewed

Sher, Kenneth J.; And Others – Psychological Assessment, 1995

Interrelated analyses were conducted with more than 4,000 college students to examine the reliability and validity of the Tridimensional Personality Questionnaire (TPQ) and to develop and validate a short version of the scale. Results provide moderate support for the reliability and validity of both the TPQ and the short form. (SLD)

Descriptors: College Students, Factor Analysis, Higher Education, Personality Assessment

Investigating the Effects of Increased SAT Reasoning Test™ Length and Time on Performance of Regular SAT® Examinees. Research Report No. 2006-9

Download full text

Wang, Xiang Bo – College Board, 2007

This research examines the effect of increased testing time by comparing the four performance indices of randomly equivalent examinee subpopulations on sections of similar content and difficulty administered at different times on three SAT administrations. A variety of analyses were used in this study and found no evidence that the current SAT…

Descriptors: College Entrance Examinations, Thinking Skills, High School Students, Test Length

Issues of Candidate Perception in a Performance Test for Lawyers.

Download full text

Kunce, Charles S.; Arbet, Scott E. – 1994

The National Conference of Bar Examiners commissioned American College Testing, Inc., to help them in the development and evaluation of a performance test for use in bar admissions decisions. Because it was recognized that candidate perceptions would provide valuable information, a candidate-perception questionnaire was developed to be…

Descriptors: Attitudes, Demography, Languages, Lawyers

A Comparison of Two Item Selection Procedures for Building Criterion-Referenced Tests.

Download full text

Haladyna, Tom; Roid, Gale – 1981

Two approaches to criterion-referenced test construction are compared. Classical test theory is based on the practice of random sampling from a well-defined domain of test items; latent trait theory suggests that the difficulty of the items should be matched to the achievement level of the student. In addition to these two methods of test…

Descriptors: Criterion Referenced Tests, Error of Measurement, Latent Trait Theory, Test Construction

Test Length and Validity: An Application of Test Theory to a Finite World.

Myers, Charles T. – 1978

The viewpoint is expressed that adding to test reliability by either selecting a more homogeneous set of items, restricting the range of item difficulty as closely as possible to the most efficient level, or increasing the number of items will not add to test validity and that there is considerable danger that efforts to increase reliability may…

Descriptors: Achievement Tests, Item Analysis, Multiple Choice Tests, Test Construction

Application of Item Response Models to Criterion-Referenced Test Item Selection.

Peer reviewed

Hambleton, Ronald K.; De Gruijter, Dato N. M. – Journal of Educational Measurement, 1983

Addressing the shortcomings of classical item statistics for selecting criterion-referenced test items, this paper describes an optimal item selection procedure utilizing item response theory (IRT) and offers examples in which random selection and optimal item selection methods are compared. Theoretical advantages of optimal selection based upon…

Descriptors: Criterion Referenced Tests, Cutting Scores, Item Banks, Latent Trait Theory

Pretesting alongside an Operational CAT.

Download full text

Davey, Tim; Pommerich, Mary; Thompson, Tony D. – 1999

In computerized adaptive testing (CAT), new or experimental items are frequently administered alongside operational tests to gather the pretest data needed to replenish and replace item pools. The two basic strategies used to combine pretest and operational items are embedding and appending. Variable-length CATs are preferred because of the…

Descriptors: Adaptive Testing, Computer Assisted Testing, Item Banks, Measurement Techniques

A Comparison of the Performance of Simulated Hierarchical and Linear Testlets.

Peer reviewed

Wainer, Howard; And Others – Journal of Educational Measurement, 1992

Computer simulations were run to measure the relationship between testlet validity and factors of item pool size and testlet length for both adaptive and linearly constructed testlets. Making a testlet adaptive yields only modest increases in aggregate validity because of the peakedness of the typical proficiency distribution. (Author/SLD)

Descriptors: Adaptive Testing, Comparative Testing, Computer Assisted Testing, Computer Simulation

A Comparison of an Expert Systems Approach to Computerized Adaptive Testing and an Item Response Theory Model.

Download full text

Frick, Theodore W. – 1991

Expert systems can be used to aid decisionmaking. A computerized adaptive test is one kind of expert system, although not commonly recognized as such. A new approach, termed EXSPRT, was devised that combines expert systems reasoning and sequential probability ratio test stopping rules. Two versions of EXSPRT were developed, one with random…

Descriptors: Adaptive Testing, Comparative Analysis, Computer Assisted Testing, Expert Systems

A Method for Determining the Length of Criterion-Referenced Tests Using Reliability and Validity Indices.

Download full text

Mills, Craig N.; Simon, Robert – 1981

When criterion-referenced tests are used to assign examinees to states reflecting their performance level on a test, the better known methods for determining test length, which consider relationships among domain scores and errors of measurement, have their limitations. The purpose of this paper is to present a computer system named TESTLEN, which…

Descriptors: Computer Assisted Testing, Criterion Referenced Tests, Cutting Scores, Error of Measurement

Some Results on the Robustness of Latent Trait Models.

Download full text

Hambleton, Ronald K.; Cook, Linda L. – 1978

The purpose of the present research was to study, systematically, the "goodness-of-fit" of the one-, two-, and three-parameter logistic models. We studied, using computer-simulated test data, the effects of four variables: variation in item discrimination parameters, the average value of the pseudo-chance level parameters, test length,…

Descriptors: Career Development, Difficulty Level, Goodness of Fit, Item Analysis

Test Anxiety: A Major Educational Problem and What Can Be Done about It.

Peer reviewed

Hill, Kennedy T.; Wigfield, Allan – Elementary School Journal, 1984

Discusses the problem of and solution to anxiety in school testing situations. Focuses on Hill and his colleagues' long term program of research. Describes school intervention studies where new evaluation procedures and teaching programs have been developed to help students perform better in evaluative situations. (CB)

Descriptors: Elementary School Students, Elementary Secondary Education, Grades (Scholastic), Intervention

An Investigation of Proposed Revisions to Section 3 of the TOEFL Test. TOEFL Research Report 47.

Download full text

Schedl, Mary; And Others – 1995

The Test of English as a Foreign Language (TOEFL) program is exploring a change in Section 3 of the TOEFL test that would replace the vocabulary subpart with additional reading comprehension questions. This study investigated the proposed revision in terms of the length and timing that would be necessary to address concerns of test speededness of…

Descriptors: Adult Students, English (Second Language), Language Tests, Psychometrics

« Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10

Educational and Psychological…	21
Journal of Educational…	6
Applied Measurement in…	5
Applied Psychological…	5
Journal of Psychoeducational…	4
International Journal of…	3
ProQuest LLC	3
College Board	2
ETS Research Report Series	2
Education and Information…	2
Grantee Submission	2
Journal of Experimental…	2
Psychological Assessment	2
AERA Online Paper Repository	1
Assessment	1
British Journal of Guidance &…	1
British Journal of Learning…	1
Center on Children and…	1
Contemporary Education	1
Education Week	1
Educational Sciences: Theory…	1
Elementary School Journal	1
Evaluation in Education:…	1
Intelligence	1
International Educational…	1
More ▼

Hambleton, Ronald K.	12
Wainer, Howard	5
Reckase, Mark D.	4
Berk, Ronald A.	3
Wilcox, Rand R.	3
Sijtsma, Klaas	2
Thissen, David	2
Abrams, Matthew	1
Ang, Cheng	1
Anil, Duygu	1
Arbet, Scott E.	1
Arens, A. Katrin	1
Arthur, Winfred, Jr.	1
Bandalos, Deborah L.	1
Bao, Lei	1
Batinic, Bernad	1
Beltyukova, Svetlana A.	1
Benson, Jeri	1
Bergstrom, Betty	1
Boulton, Elicia	1
Boyd, Thomas A.	1
Bradshaw, Laine	1
Braun, Virginia	1
Brown, Anna	1
More ▼

Graduate Record Examinations	2
Program for International…	2
SAT (College Admission Test)	2
Test of English as a Foreign…	2
Academic Motivation Scale	1
Advanced Placement…	1
Bar Examinations	1
COMPASS (Computer Assisted…	1
California Achievement Tests	1
Draw a Person Test	1
Fennema Sherman Mathematics…	1
Minnesota Multiphasic…	1
Minnesota Teacher Attitude…	1
National Assessment of…	1
Preliminary Scholastic…	1
Profile of Mood States	1
Raven Advanced Progressive…	1
School and College Ability…	1
Self Description Questionnaire	1
Trends in International…	1
Wechsler Adult Intelligence…	1
Wechsler Intelligence Scale…	1
Wechsler Intelligence Scales…	1
More ▼