ERIC - Search Results

Publication Date

In 2026	0
Since 2025	0
Since 2022 (last 5 years)	1
Since 2017 (last 10 years)	1
Since 2007 (last 20 years)	1

Descriptor

Comparative Testing	4
Test Construction	4
Test Length	4
Ability	1
Abstract Reasoning	1
Adaptive Testing	1
Aptitude Tests	1
Cognitive Processes	1
College Entrance Examinations	1
College Mathematics	1
Computer Assisted Testing	1
Computer Simulation	1
Correlation	1
Critical Thinking	1
Educational Policy	1
Evaluation Criteria	1
Foreign Countries	1
Graduate Study	1
High Stakes Tests	1
Higher Education	1
Item Banks	1
Item Response Theory	1
Mathematical Models	1
Power (Statistics)	1
Psychometrics	1
More ▼

Source

Journal of Educational…	1
Research Matters	1

Author

Ang, Cheng	1
Miller, M. David	1
Tom Benton	1
Wainer, Howard	1
Wild, Cheryl L.	1

Publication Type

Journal Articles	2
Reports - Evaluative	2
Reports - Research	2
Speeches/Meeting Papers	1

Education Level

Audience

Location

Australia	1
Canada	1
China	1
Ireland	1
Japan	1
New Zealand	1
Poland	1
Singapore	1
South Korea	1
United Kingdom (England)	1
United States	1
More ▼

Laws, Policies, & Programs

Assessments and Surveys

Graduate Record Examinations

What Works Clearinghouse Rating

Showing all 4 results Save | Export

How Long Should a High Stakes Test Be?

Download full text

Tom Benton – Research Matters, 2024

Educational assessment is used throughout the world for a range of different formative and summative purposes. Wherever an assessment is developed, whether by a teacher creating a quiz for their class, or by a testing company creating a high stakes assessment, it is necessary to decide how long the test should be. Specifically, how many questions…

Descriptors: Foreign Countries, High Stakes Tests, Test Length, Test Construction

An Investigation of the Power of Stout's Test of Essential Unidimensionality.

Download full text

Ang, Cheng; Miller, M. David – 1993

The power of the procedure of W. Stout to detect deviations from essential unidimensionality in two-dimensional data was investigated for minor, moderate, and large deviations from unidimensionality using criteria for deviations from unidimensionality based on prior research. Test lengths of 20 and 40 items and sample sizes of 700 and 1,500 were…

Descriptors: Ability, Comparative Testing, Correlation, Item Response Theory

A Comparison of the Performance of Simulated Hierarchical and Linear Testlets.

Peer reviewed

Wainer, Howard; And Others – Journal of Educational Measurement, 1992

Computer simulations were run to measure the relationship between testlet validity and factors of item pool size and testlet length for both adaptive and linearly constructed testlets. Making a testlet adaptive yields only modest increases in aggregate validity because of the peakedness of the typical proficiency distribution. (Author/SLD)

Descriptors: Adaptive Testing, Comparative Testing, Computer Assisted Testing, Computer Simulation

Summary of Research on Restructuring the Graduate Record Examinations Aptitude Test.

Download full text

Wild, Cheryl L. – 1979

Three sections of the Graduate Record Examinations (GRE) Aptitude Test were reviewed before the introduction of the restructured test in October, 1977: research on (1) the GRE-Verbal section; (2) the GRE-Quantitative section; and (3) a planned third section, measuring analytical thinking skills. Research in all three areas focused on test…

Descriptors: Abstract Reasoning, Aptitude Tests, Cognitive Processes, College Entrance Examinations