Publication Date
| In 2026 | 0 |
| Since 2025 | 62 |
| Since 2022 (last 5 years) | 388 |
| Since 2017 (last 10 years) | 831 |
| Since 2007 (last 20 years) | 1345 |
Descriptor
Source
Author
Publication Type
Education Level
Audience
| Practitioners | 195 |
| Teachers | 161 |
| Researchers | 93 |
| Administrators | 50 |
| Students | 34 |
| Policymakers | 15 |
| Parents | 12 |
| Counselors | 2 |
| Community | 1 |
| Media Staff | 1 |
| Support Staff | 1 |
| More ▼ | |
Location
| Canada | 63 |
| Turkey | 59 |
| Germany | 41 |
| United Kingdom | 37 |
| Australia | 36 |
| Japan | 35 |
| China | 33 |
| United States | 32 |
| California | 25 |
| Iran | 25 |
| United Kingdom (England) | 25 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
van der Linden, Wim J.; Vos, Hans J.; Chang, Lei – 2000
In judgmental standard setting experiments, it may be difficult to specify subjective probabilities that adequately take the properties of the items into account. As a result, these probabilities are not consistent with each other in the sense that they do not refer to the same borderline level of performance. Methods to check standard setting…
Descriptors: Interrater Reliability, Judges, Probability, Standard Setting
Peer reviewedTorabi-Parizi, Rosa; Campbell, Noma Jo – Elementary School Journal, 1982
Investigates the effects of varying the placement of blanks and the number of options available in multiple-choice items on the reliability of fifth-grade students' scores. Results indicate that scores on three-choice item tests were not less reliable than scores on four-choice item tests. A similar finding was found regarding the placement of…
Descriptors: Elementary Education, Elementary School Students, Scores, Test Format
Peer reviewedSilverstein, A. B. – Journal of Consulting and Clinical Psychology, 1982
Proposes Vocabulary and Block Design as a two-subtest short form of the Wechsler Adult Intelligence Scale-Revised; the addition of Arithmetic and Picture Arrangement provides a four-subtest short form of the scale. Presents tables giving Full Scale IQs for each of nine age groups for both short forms. (Author)
Descriptors: Age Differences, Intelligence Quotient, Intelligence Tests, Tables (Data)
Peer reviewedSchriesheim, Chester A.; Denisi, Angelo S. – Educational and Psychological Measurement, 1980
Two types of questionnaire formats' measuring leadership variables were examined: one with items measuring the same dimensions grouped together and the second with items measuring the same dimensions distributed randomly. The random condition showed superior convergent and discriminant validity, as assessed by multitrait-multimethod and analysis…
Descriptors: Adults, Leadership Qualities, Personality Measures, Questionnaires
Peer reviewedWang, Tianyou; Kolen, Michael J. – Applied Psychological Measurement, 1996
A quadratic curve test equating method for equating different test forms under a random-groups data collection design is proposed that equates the first three central moments of the test forms. When applied to real test data, the method performs as well as other equating methods. Procedures from implementing the test are described. (SLD)
Descriptors: Data Collection, Equated Scores, Standardized Tests, Test Construction
Peer reviewedMaggi, Stefania – International Journal of Testing, 2001
Developed an Italian version of the Self-Description Questionnaire (SDQ-III) and studied the reliability and factorial validity of this translated instrument. Results show that the translated version has psychometric properties similar to those of the original English version. (SLD)
Descriptors: Factor Structure, Foreign Countries, Psychometrics, Reliability
Peer reviewedToppino, Thomas C.; Brochin, H. Ann – Journal of Educational Research, 1989
Study findings indicate that exposure to a statement on a true-false test increased college students' (N=64) tendency to believe the statement was true, regardless of whether the statement actually was true or false. In contrast to previous research, these findings support existence of a negative suggestion effect for true-false exams. (IAH)
Descriptors: Higher Education, Learning Processes, Objective Tests, Test Format
Peer reviewedWallace, Randall R.; And Others – Reading Improvement, 1995
Finds no evidence that the standard procedure for administering a spelling test (in which the examiner pronounces the word, uses it in a sentence, then pronounces it again) is any less effective than those utilizing additional visualization and vocalization components. (RS)
Descriptors: Grade 3, Primary Education, Spelling, Spelling Instruction
Peer reviewedBors, Douglas A.; Stokes, Tonya L. – Educational and Psychological Measurement, 1998
First-year college students (n=506) completed Raven's Advanced Progressive Matrices (J. Raven, J. Court, and J. Raven, 1988). Data were used to contribute to the normative database for American college students. A short form developed from 12 items of the original 36 was found to possess acceptable psychometric properties. (SLD)
Descriptors: College Freshmen, Higher Education, Norms, Psychometrics
Peer reviewedArmstrong, Ronald D.; Jones, Douglas H.; Kunce, Charles S. – Applied Psychological Measurement, 1998
Investigated the use of mathematical programming techniques to generate parallel test forms with passages and items based on item-response theory (IRT) using the Fundamentals of Engineering Examination. Generated four parallel test forms from the item bank of almost 1,100 items. Comparison with human-generated forms supports the mathematical…
Descriptors: Engineering, Item Banks, Item Response Theory, Test Construction
Peer reviewedPershing, James A.; Pershing, Jana L. – Human Resource Development Quarterly, 2001
Question dimensions, construction, and response formats of 50 reactionnaire forms completed by participants in medical school programs were analyzed. Numerous problems in 30 forms and shortcomings in 20 others were identified. Ways to improve layout, appearance, anonymity protection, and questions were suggested. (Contains 53 references.) (SK)
Descriptors: Attitude Measures, Evaluation Problems, Privacy, Surveys
Peer reviewedKobayashi, Miyoko – Language Testing, 2002
Investigates the effects of text organization and response format on second language learners' performance on reading comprehension tests. Analyzes the results of reading comprehension tests that were delivered to Japanese University students. Found that text organization and test format had a significant impact on students' performance.…
Descriptors: College Students, Language Tests, Second Language Learning, Test Format
Peer reviewedAnderson, Gary L. – Educational Leadership, 2002
Argues that the School Leaders Licensure Assessment required for administrator certification in several states promotes a narrow, mainstream concept of instructional leadership. (PKP)
Descriptors: Criticism, Elementary Secondary Education, Instructional Leadership, National Standards
Peer reviewedLaukkanen, Eila; Halonen, Pirjo; Viinamaki, Heimo – Journal of Youth and Adolescence, 1999
Studied the reliability of a translation of the Offer Self-Image Questionnaire (OSIQ) (D. Offer and others, 1984) with 268 Finnish 13-year-olds, 83 of whom were tested a second time. Results support the reliability of the OSIQ for only four subscales. (SLD)
Descriptors: Adolescents, Finnish, Foreign Countries, Reliability
Peer reviewedPommerich, Mary; Nicewander, W. Alan; Hanson, Bradley A. – Journal of Educational Measurement, 1999
Studied whether a group's average percent correct in a content domain could be accurately estimated for groups taking a single test form and not the entire domain of items. Evaluated six Item Response Theory-based domain score estimation methods through simulation and concluded they performed better than observed score on the form taken. (SLD)
Descriptors: Estimation (Mathematics), Groups, Item Response Theory, Scores


