Publication Date
| In 2026 | 0 |
| Since 2025 | 52 |
| Since 2022 (last 5 years) | 194 |
| Since 2017 (last 10 years) | 494 |
| Since 2007 (last 20 years) | 742 |
Descriptor
| Test Items | 1186 |
| Test Reliability | 1186 |
| Test Validity | 684 |
| Test Construction | 565 |
| Foreign Countries | 348 |
| Difficulty Level | 279 |
| Item Analysis | 252 |
| Psychometrics | 233 |
| Item Response Theory | 219 |
| Factor Analysis | 183 |
| Multiple Choice Tests | 172 |
| More ▼ | |
Source
Author
| Schoen, Robert C. | 12 |
| LaVenia, Mark | 5 |
| Liu, Ou Lydia | 5 |
| Anderson, Daniel | 4 |
| Bauduin, Charity | 4 |
| DiLuzio, Geneva J. | 4 |
| Farina, Kristy | 4 |
| Haladyna, Thomas M. | 4 |
| Huck, Schuyler W. | 4 |
| Petscher, Yaacov | 4 |
| Stansfield, Charles W. | 4 |
| More ▼ | |
Publication Type
Education Level
Audience
| Practitioners | 39 |
| Researchers | 30 |
| Teachers | 24 |
| Administrators | 13 |
| Support Staff | 3 |
| Counselors | 2 |
| Students | 2 |
| Community | 1 |
| Parents | 1 |
| Policymakers | 1 |
Location
| Turkey | 68 |
| Indonesia | 37 |
| Germany | 20 |
| Canada | 17 |
| Florida | 17 |
| China | 16 |
| Australia | 15 |
| California | 12 |
| Iran | 11 |
| India | 10 |
| New York | 9 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Meets WWC Standards without Reservations | 1 |
| Meets WWC Standards with or without Reservations | 1 |
Myers, Charles T. – 1978
The viewpoint is expressed that adding to test reliability by either selecting a more homogeneous set of items, restricting the range of item difficulty as closely as possible to the most efficient level, or increasing the number of items will not add to test validity and that there is considerable danger that efforts to increase reliability may…
Descriptors: Achievement Tests, Item Analysis, Multiple Choice Tests, Test Construction
Karmos, Joseph S.; Karmos, Ann H. – 1980
The attitude of students toward taking achievement tests, as well as the relationship between this attitude and student performance on achievement tests, was studied in three southern Illinois schools. Students in grades six through nine were administered the 1973 Stanford Achievement Tests, followed by an attitudinal test developed for the study…
Descriptors: Achievement Tests, Attitude Measures, Correlation, Elementary Secondary Education
Saunders, Joseph C.; Huynh, Huynh – 1980
In most reliability studies, the precision of a reliability estimate varies inversely with the number of examinees (sample size). Thus, to achieve a given level of accuracy, some minimum sample size is required. An approximation for this minimum size may be made if some reasonable assumptions regarding the mean and standard deviation of the test…
Descriptors: Cutting Scores, Difficulty Level, Error of Measurement, Mastery Tests
Peer reviewedEbel, Robert L. – Educational and Psychological Measurement, 1978
A multiple true-false item is one where a testee has to identify statements as true or false within a cluster (of two or more) of such statements. Clusters are then scored as items. This study showed such a procedure to yield less reliable results than traditional true-false items. (JKS)
Descriptors: Guessing (Tests), Higher Education, Item Analysis, Multiple Choice Tests
Schuessler, Karl; And Others – Social Psychology, 1978
The feasibility of measuring responding desirably with attitude-opinion items is discussed, and an index based on 16 such items is presented. Estimates of reliability and validity for this index, and examples of its use as a covariate (control) in attitude research are presented. Similarities and differences from related scales are discussed.…
Descriptors: Adults, Attitude Measures, Measurement Techniques, Response Style (Tests)
Peer reviewedKifer, Edward – Journal of Youth and Adolescence, 1977
An heuristic method for constructing affective tests emphasizes a set of key questions which are used to define an affective construct, decide on a measurement procedure, and develop a framework on which to base items. Affective tests should be based upon prior specifications of constructs, and then manipulated empirically. (Author/MV)
Descriptors: Affective Behavior, Affective Measures, Affective Objectives, Elementary Secondary Education
Peer reviewedFrary, Robert B. – Journal of Educational Measurement, 1985
Responses to a sample test were simulated for examinees under free-response and multiple-choice formats. Test score sets were correlated with randomly generated sets of unit-normal measures. The extent of superiority of free response tests was sufficiently small so that other considerations might justifiably dictate format choice. (Author/DWH)
Descriptors: Comparative Analysis, Computer Simulation, Essay Tests, Guessing (Tests)
Peer reviewedAngoff, William H.; Schrader, William B. – Journal of Educational Measurement, 1984
The reported data provide a basis for evaluating the formula-scoring versus rights-scoring issue and for assessing the effects of directions on the reliability and parallelism of scores for sophisticated examinees taking professionally developed tests. Results support the invariance hypothesis rather than the differential effects hypothesis.…
Descriptors: College Entrance Examinations, Guessing (Tests), Higher Education, Hypothesis Testing
Peer reviewedBaldauf, Richard B., Jr. – Educational and Psychological Measurement, 1982
A Monte Carlo design examined how the effects of guessing and item dependence influence test characteristics and student scores. Although validity for cloze variants was high, multiple-choice cloze had significantly lower reliabilities than did true score equivalents. (Author/PN)
Descriptors: Cloze Procedure, Elementary Education, Guessing (Tests), Reading Comprehension
Peer reviewedKeogh, Barbara K.; And Others – Journal of Educational Measurement, 1982
This paper assesses the reliabilities of the items and dimensions, determines the extent of agreement among raters, identifies the factor structure, and assesses the influence of sex and age of children on raters' perceptions on a short form of the Teacher Temperament Questionnaire. (Author/PN)
Descriptors: Early Childhood Education, Factor Structure, Individual Differences, Measures (Individuals)
Peer reviewedNimmer, Donald N. – Clearing House, 1983
Outlines the benefits associated with true-false and multiple-choice tests and sets forth rules for writing effective items for such tests. (FL)
Descriptors: Elementary Secondary Education, Evaluation Methods, Multiple Choice Tests, Objective Tests
Schnipke, Deborah L. – 1995
Time limits on tests often prevent some examinees from finishing all of the items on the test; the extent of this effect has been called the "speededness" of the test. Traditional speededness indices focus on the number of unreached items. Other examinees in the same situation rapidly fill in answers in the hope of getting some of the…
Descriptors: Computer Assisted Testing, Educational Assessment, Evaluation Methods, Guessing (Tests)
Wasem, Jim – 1993
"Pickleball" is a new racquet sport which is one of the fastest growing educational activities in the Northwest. This paper describes the development of a test battery designed to measure students' pickleball skills for purposes of classification; to determine improvement of playing skills; and to aid in grading of individual…
Descriptors: Higher Education, Physical Education, Preservice Teacher Education, Racquet Sports
Perkins, Kyle; And Others – 1994
This paper reports the results of using a three-layer backpropagation artificial neural network to predict item difficulty in a reading comprehension test. Two network structures were developed, one with and one without a sigmoid function in the output processing unit. The data set, which consisted of a table of coded test items and corresponding…
Descriptors: Artificial Intelligence, Computer Assisted Testing, Expert Systems, Item Analysis
Herman, Joan – 1984
Diagnostic testing can provide specific information about student skills as a decision-making aid to teachers in prescribing instruction, identifying needs for remediation, determining effective instructional materials and methods, and ultimately, improving student learning. Diagnostic testing, as viewed here, includes individual and group…
Descriptors: Diagnostic Tests, Elementary Secondary Education, Skill Analysis, Student Evaluation


