Publication Date
| In 2026 | 0 |
| Since 2025 | 62 |
| Since 2022 (last 5 years) | 388 |
| Since 2017 (last 10 years) | 831 |
| Since 2007 (last 20 years) | 1345 |
Descriptor
Source
Author
Publication Type
Education Level
Audience
| Practitioners | 195 |
| Teachers | 161 |
| Researchers | 93 |
| Administrators | 50 |
| Students | 34 |
| Policymakers | 15 |
| Parents | 12 |
| Counselors | 2 |
| Community | 1 |
| Media Staff | 1 |
| Support Staff | 1 |
| More ▼ | |
Location
| Canada | 63 |
| Turkey | 59 |
| Germany | 41 |
| United Kingdom | 37 |
| Australia | 36 |
| Japan | 35 |
| China | 33 |
| United States | 32 |
| California | 25 |
| Iran | 25 |
| United Kingdom (England) | 25 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
Liu, Jinghua; Low, Albert C. – ETS Research Report Series, 2007
This study applied kernel equating (KE) in two scenarios: equating to a very similar population and equating to a very different population, referred to as a distant population, using SAT® data. The KE results were compared to the results obtained from analogous classical equating methods in both scenarios. The results indicate that KE results are…
Descriptors: College Entrance Examinations, Equated Scores, Comparative Analysis, Evaluation Methods
Haynie, W. J., III – Journal of Technology Education, 2007
Commencing in 1985, a small body of experimental studies on the effects of test taking on delayed retention learning of technical subject matter has been completed in technology education settings. Much of the learning in technology education courses, especially the hands-on aspects, are best assessed via instruments and techniques other than…
Descriptors: Meta Analysis, Retention (Psychology), Technology Education, Tests
National Center for Education Statistics, 2007
The purpose of this document is to provide background information that will be useful in interpreting the 2007 results from the Trends in International Mathematics and Science Study (TIMSS) by comparing its design, features, framework, and items with those of the U.S. National Assessment of Educational Progress and another international assessment…
Descriptors: National Competency Tests, Comparative Analysis, Achievement Tests, Test Items
Deborah Elizabeth Fox – ProQuest LLC, 2007
The purpose of this study was the development and testing of a novel method for assessment of white blood cell (WBC) identification skills used in the field of Clinical Laboratory Sciences (CLS). A dual format exam was administered to both novices (students) and experts (laboratory professionals). Format 1 was similar to current assessment…
Descriptors: Evaluation, Evaluation Methods, Health Sciences, Metabolism
Hol, A. Michiel; Vorst, Harrie C. M.; Mellenbergh, Gideon J. – Applied Psychological Measurement, 2007
In a randomized experiment (n = 515), a computerized and a computerized adaptive test (CAT) are compared. The item pool consists of 24 polytomous motivation items. Although items are carefully selected, calibration data show that Samejima's graded response model did not fit the data optimally. A simulation study is done to assess possible…
Descriptors: Student Motivation, Simulation, Adaptive Testing, Computer Assisted Testing
Kim, Seonghoon; Kolen, Michael J. – Applied Measurement in Education, 2006
Four item response theory linking methods (2 moment methods and 2 characteristic curve methods) were compared to concurrent (CO) calibration with the focus on the degree of robustness to format effects (FEs) when applying the methods to multidimensional data that reflected the FEs associated with mixed-format tests. Based on the quantification of…
Descriptors: Item Response Theory, Robustness (Statistics), Test Format, Comparative Analysis
Singelis, Theodore M.; Yamada, Ann Marie; Barrio, Concepcion; Laney, Joshua Harrison; Her, Pa; Ruiz-Anaya, Alejandrina; Lennertz, Sara Terwilliger – Hispanic Journal of Behavioral Sciences, 2006
The metric equivalence of translated scales is often in question but seldom examined. This study presents test-retest data that support the metric equivalence of the Spanish and English language versions of three measures: the Bidimensional Acculturation Scale, the Satisfaction with Life Scale, and the Self-Construal Scale. Participants were…
Descriptors: Acculturation, Life Satisfaction, English, Test Format
McKinley, Robert L.; Way, Walter D. – 1992
An analysis of the skills necessary for performance on the Test of English as a Foreign Language (TOEFL) tends to support the view that there are important, although subtle, secondary dimensions present in the test. This research explored the feasibility of an item response theory (IRT) based method of modeling examinee performance on these…
Descriptors: Ability, Goodness of Fit, Identification, Item Response Theory
Eignor, Daniel R.; And Others – 1995
Two recent simulation studies were conducted to aid in the diagnosis and interpretation of equating differences found between random and matched (nonrandom) samples for four commonly used equating procedures: (1) Tucker; (2) Levine equally reliable; (3) Chained equipercentile observed-score; and (4) three-parameter, item response theory true-score…
Descriptors: Criteria, Equated Scores, Item Response Theory, Raw Scores
College Board, Washington, DC. Washington Office. – 1996
The National Assessment of Educational Progress (NAEP) is a congressionally mandated project of the U.S. Department of Education's National Center for Education Statistics. It assesses what students in the United States should know and be able to do in geography, reading, writing, mathematics, science, U.S. history, the arts, civics, and other…
Descriptors: Academic Achievement, Elementary Secondary Education, Evaluation, Mathematical Concepts
Hensley, Wayne E. – 1992
Two studies among U.S. college students (n=88 and n=329) examined the relationships between the order in which responses are offered on a questionnaire and the ranked importance of those responses. Study 1 included 36 males and 52 females, and Study 2 included 127 males and 202 females. Both studies found that approximately one-third (32 percent…
Descriptors: Classification, College Students, Higher Education, Questionnaires
Sireci, Stephen G.; Bastari, B. – 1998
In many cross-cultural research studies, assessment instruments are translated or adapted for use in multiple languages. However, it cannot be assumed that different language versions of an assessment are equivalent across languages. A fundamental issue to be addressed is the comparability or equivalence of the construct measured by each language…
Descriptors: Construct Validity, Cross Cultural Studies, Evaluation Methods, Multidimensional Scaling
Council of Chief State School Officers, Washington, DC. – 1992
The Reading Framework for the 1992 National Assessment of Educational Progress (NAEP) contains the rationale for the aspects of reading assessed in 1992 and criteria for development of the assessment. Developed through a national consensus process as a part of an effort to move assessment forward, the framework presented in the booklet is more…
Descriptors: Elementary Secondary Education, Literacy, Reading Skills, Reading Tests
Svinicki, Marilla; Koch, Bill – Innovation Abstracts, 1984
The decision of whether to use essay tests or multiple choice tests depends on several qualifiers related to the different characteristics of the tests and the needs of the situation. The most important qualifier involves matching the type of test to the instructional objectives being tested, with multiple choice tests being used to measure a…
Descriptors: Comparative Analysis, Essay Tests, Multiple Choice Tests, Test Format
Paul, Peter V.; And Others – 1990
A study was conducted to compare the results of three test formats in relation to assessing the depth of vocabulary knowledge of fourth-grade students. Students (117) were tested on their knowledge of the primary and secondary meanings of 44 multimeaning words using multiple-choice, yes/no, and interview formats. Information on the test-taking…
Descriptors: Definitions, Grade 4, Intermediate Grades, Reading Research

Peer reviewed
Direct link
