Publication Date
| In 2026 | 0 |
| Since 2025 | 62 |
| Since 2022 (last 5 years) | 388 |
| Since 2017 (last 10 years) | 831 |
| Since 2007 (last 20 years) | 1345 |
Descriptor
Source
Author
Publication Type
Education Level
Audience
| Practitioners | 195 |
| Teachers | 161 |
| Researchers | 93 |
| Administrators | 50 |
| Students | 34 |
| Policymakers | 15 |
| Parents | 12 |
| Counselors | 2 |
| Community | 1 |
| Media Staff | 1 |
| Support Staff | 1 |
| More ▼ | |
Location
| Canada | 63 |
| Turkey | 59 |
| Germany | 41 |
| United Kingdom | 37 |
| Australia | 36 |
| Japan | 35 |
| China | 33 |
| United States | 32 |
| California | 25 |
| Iran | 25 |
| United Kingdom (England) | 25 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
Peer reviewedWang, Tianyou; Kolen, Michael J. – Applied Psychological Measurement, 1996
A quadratic curve test equating method for equating different test forms under a random-groups data collection design is proposed that equates the first three central moments of the test forms. When applied to real test data, the method performs as well as other equating methods. Procedures from implementing the test are described. (SLD)
Descriptors: Data Collection, Equated Scores, Standardized Tests, Test Construction
Peer reviewedMaggi, Stefania – International Journal of Testing, 2001
Developed an Italian version of the Self-Description Questionnaire (SDQ-III) and studied the reliability and factorial validity of this translated instrument. Results show that the translated version has psychometric properties similar to those of the original English version. (SLD)
Descriptors: Factor Structure, Foreign Countries, Psychometrics, Reliability
Peer reviewedToppino, Thomas C.; Brochin, H. Ann – Journal of Educational Research, 1989
Study findings indicate that exposure to a statement on a true-false test increased college students' (N=64) tendency to believe the statement was true, regardless of whether the statement actually was true or false. In contrast to previous research, these findings support existence of a negative suggestion effect for true-false exams. (IAH)
Descriptors: Higher Education, Learning Processes, Objective Tests, Test Format
Peer reviewedWallace, Randall R.; And Others – Reading Improvement, 1995
Finds no evidence that the standard procedure for administering a spelling test (in which the examiner pronounces the word, uses it in a sentence, then pronounces it again) is any less effective than those utilizing additional visualization and vocalization components. (RS)
Descriptors: Grade 3, Primary Education, Spelling, Spelling Instruction
Peer reviewedBors, Douglas A.; Stokes, Tonya L. – Educational and Psychological Measurement, 1998
First-year college students (n=506) completed Raven's Advanced Progressive Matrices (J. Raven, J. Court, and J. Raven, 1988). Data were used to contribute to the normative database for American college students. A short form developed from 12 items of the original 36 was found to possess acceptable psychometric properties. (SLD)
Descriptors: College Freshmen, Higher Education, Norms, Psychometrics
Peer reviewedArmstrong, Ronald D.; Jones, Douglas H.; Kunce, Charles S. – Applied Psychological Measurement, 1998
Investigated the use of mathematical programming techniques to generate parallel test forms with passages and items based on item-response theory (IRT) using the Fundamentals of Engineering Examination. Generated four parallel test forms from the item bank of almost 1,100 items. Comparison with human-generated forms supports the mathematical…
Descriptors: Engineering, Item Banks, Item Response Theory, Test Construction
Peer reviewedPershing, James A.; Pershing, Jana L. – Human Resource Development Quarterly, 2001
Question dimensions, construction, and response formats of 50 reactionnaire forms completed by participants in medical school programs were analyzed. Numerous problems in 30 forms and shortcomings in 20 others were identified. Ways to improve layout, appearance, anonymity protection, and questions were suggested. (Contains 53 references.) (SK)
Descriptors: Attitude Measures, Evaluation Problems, Privacy, Surveys
Peer reviewedKobayashi, Miyoko – Language Testing, 2002
Investigates the effects of text organization and response format on second language learners' performance on reading comprehension tests. Analyzes the results of reading comprehension tests that were delivered to Japanese University students. Found that text organization and test format had a significant impact on students' performance.…
Descriptors: College Students, Language Tests, Second Language Learning, Test Format
Peer reviewedAnderson, Gary L. – Educational Leadership, 2002
Argues that the School Leaders Licensure Assessment required for administrator certification in several states promotes a narrow, mainstream concept of instructional leadership. (PKP)
Descriptors: Criticism, Elementary Secondary Education, Instructional Leadership, National Standards
Peer reviewedLaukkanen, Eila; Halonen, Pirjo; Viinamaki, Heimo – Journal of Youth and Adolescence, 1999
Studied the reliability of a translation of the Offer Self-Image Questionnaire (OSIQ) (D. Offer and others, 1984) with 268 Finnish 13-year-olds, 83 of whom were tested a second time. Results support the reliability of the OSIQ for only four subscales. (SLD)
Descriptors: Adolescents, Finnish, Foreign Countries, Reliability
Peer reviewedPommerich, Mary; Nicewander, W. Alan; Hanson, Bradley A. – Journal of Educational Measurement, 1999
Studied whether a group's average percent correct in a content domain could be accurately estimated for groups taking a single test form and not the entire domain of items. Evaluated six Item Response Theory-based domain score estimation methods through simulation and concluded they performed better than observed score on the form taken. (SLD)
Descriptors: Estimation (Mathematics), Groups, Item Response Theory, Scores
Abedi, Jamal; Leon, Seth; Kao, Jenny C. – National Center for Research on Evaluation, Standards, and Student Testing (CRESST), 2008
This study examines performance differences between students with disabilities and students without disabilities students using differential item functioning (DIF) analyses in a high-stakes reading assessment. Results indicated that for Grade 9, many items exhibited DIF. Items that exhibited DIF were more likely to be located in the second half…
Descriptors: Test Bias, Test Items, Student Evaluation, Disabilities
Ministerial Council for Education, Early Childhood Development and Youth Affairs (NJ1), 2008
The information and assessment materials in these resources have been designed to assist teachers to gauge their own students' proficiency in Information and Communication Technologies (ICT) literacy. By examining modules from the National Year 6 and Year 10 ICT Literacy Assessment teachers may be able to design similar tasks and to judge their…
Descriptors: Foreign Countries, National Programs, Testing Programs, National Competency Tests
Liu, Kimy; Ketterlin-Geller, Leanne R.; Yovanoff, Paul; Tindal, Gerald – Behavioral Research and Teaching, 2008
BRT Math Screening Measures focus on students' mathematics performance in grade-level standards for students in grades 1-8. A total of 24 test forms are available with three test forms per grade corresponding to fall, winter, and spring testing periods. Each form contains computation problems and application problems. BRT Math Screening Measures…
Descriptors: Test Items, Test Format, Test Construction, Item Response Theory
Liu, Kristin K.; Anderson, Michael – Assessment for Effective Intervention, 2008
This article studies accessible assessment design to large-scale English language proficiency assessments that are now mandatory for elementary and secondary English language learners in public schools. Using a modified Delphi approach, a panel of 33 experts from the areas of assessment, English as a second language or bilingual education, and…
Descriptors: Delphi Technique, Test Items, Sign Language, Bilingual Education

Direct link
