Publication Date
In 2025 | 34 |
Since 2024 | 128 |
Since 2021 (last 5 years) | 467 |
Since 2016 (last 10 years) | 873 |
Since 2006 (last 20 years) | 1353 |
Descriptor
Source
Author
Publication Type
Education Level
Audience
Practitioners | 195 |
Teachers | 159 |
Researchers | 92 |
Administrators | 49 |
Students | 34 |
Policymakers | 14 |
Parents | 12 |
Counselors | 2 |
Community | 1 |
Media Staff | 1 |
Support Staff | 1 |
More ▼ |
Location
Canada | 62 |
Turkey | 59 |
Germany | 40 |
United Kingdom | 36 |
Australia | 35 |
Japan | 35 |
China | 32 |
United States | 32 |
California | 25 |
United Kingdom (England) | 25 |
Netherlands | 24 |
More ▼ |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
Liu, Ou Lydia; Wilson, Mark – Applied Measurement in Education, 2009
Many efforts have been made to determine and explain differential gender performance on large-scale mathematics assessments. A well-agreed-on conclusion is that gender differences are contextualized and vary across math domains. This study investigated the pattern of gender differences by item domain (e.g., Space and Shape, Quantity) and item type…
Descriptors: Gender Differences, Mathematics Tests, Measurement, Test Format
Kalaycioglu, Dilara Bakan; Berberoglu, Giray – Journal of Psychoeducational Assessment, 2011
This study is aimed to detect differential item functioning (DIF) items across gender groups, analyze item content for the possible sources of DIF, and eventually investigate the effect of DIF items on the criterion-related validity of the test scores in the quantitative section of the university entrance examination (UEE) in Turkey. The reason…
Descriptors: Test Bias, College Entrance Examinations, Item Analysis, Test Items
Olinghouse, Natalie G.; Colwell, Ryan P. – Intervention in School and Clinic, 2013
This article provides recommendations for teachers to better prepare 3rd through 12th grade students with learning disabilities for large-scale writing assessments. The variation across large-scale writing assessments and the multiple needs of struggling writers indicate the need for test preparation to be embedded within a comprehensive,…
Descriptors: Learning Disabilities, Elementary Secondary Education, Writing Evaluation, Test Wiseness
Camilli, Gregory – Educational Research and Evaluation, 2013
In the attempt to identify or prevent unfair tests, both quantitative analyses and logical evaluation are often used. For the most part, fairness evaluation is a pragmatic attempt at determining whether procedural or substantive due process has been accorded to either a group of test takers or an individual. In both the individual and comparative…
Descriptors: Alternative Assessment, Test Bias, Test Content, Test Format
Brese, Falk, Ed. – International Association for the Evaluation of Educational Achievement, 2012
The goal for selecting the released set of test items was to have approximately 25% of each of the full item sets for mathematics content knowledge (MCK) and mathematics pedagogical content knowledge (MPCK) that would represent the full range of difficulty, content, and item format used in the TEDS-M study. The initial step in the selection was to…
Descriptors: Preservice Teacher Education, Elementary School Teachers, Secondary School Teachers, Mathematics Teachers
Pae, Tae-Il – Language Testing, 2012
This study tracked gender differential item functioning (DIF) on the English subtest of the Korean College Scholastic Aptitude Test (KCSAT) over a nine-year period across three data points, using both the Mantel-Haenszel (MH) and item response theory likelihood ratio (IRT-LR) procedures. Further, the study identified two factors (i.e. reading…
Descriptors: Aptitude Tests, Academic Aptitude, Language Tests, Test Items
Ventouras, Errikos; Triantis, Dimos; Tsiakas, Panagiotis; Stergiopoulos, Charalampos – Computers & Education, 2011
The aim of the present research was to compare the use of multiple-choice questions (MCQs) as an examination method against the oral examination (OE) method. MCQs are widely used and their importance seems likely to grow, due to their inherent suitability for electronic assessment. However, MCQs are influenced by the tendency of examinees to guess…
Descriptors: Grades (Scholastic), Scoring, Multiple Choice Tests, Test Format
Pagnano-Richardson, Karen; Henninger, Mary L. – Strategies: A Journal for Physical and Sport Educators, 2010
The cognitive domain in physical education is an important component of quality physical education as demonstrated through Standard 2 of the National Standards for Physical Education which state that the physically educated person "demonstrates understanding of movement concepts, principles, strategies, and tactics as they apply to learning and…
Descriptors: Physical Education, Play, Team Sports, Physical Activities
Kim, Jungtae; Craig, Daniel A. – Computer Assisted Language Learning, 2012
Videoconferencing offers new opportunities for language testers to assess speaking ability in low-stakes diagnostic tests. To be considered a trusted testing tool in language testing, a test should be examined employing appropriate validation processes [Chapelle, C.A., Jamieson, J., & Hegelheimer, V. (2003). "Validation of a web-based ESL…
Descriptors: Speech Communication, Testing, Language Tests, Construct Validity
Cho, Hyun-Jeong; Lee, Jaehoon; Kingston, Neal – Applied Measurement in Education, 2012
This study examined the validity of test accommodation in third-eighth graders using differential item functioning (DIF) and mixture IRT models. Two data sets were used for these analyses. With the first data set (N = 51,591) we examined whether item type (i.e., story, explanation, straightforward) or item features were associated with item…
Descriptors: Testing Accommodations, Test Bias, Item Response Theory, Validity
Evers, Arne – International Journal of Testing, 2012
In this article, the characteristics of five test review models are described. The five models are the US review system at the Buros Center for Testing, the German Test Review System of the Committee on Tests, the Brazilian System for the Evaluation of Psychological Tests, the European EFPA Review Model, and the Dutch COTAN Evaluation System for…
Descriptors: Program Evaluation, Test Reviews, Trend Analysis, International Education
Eaton, Danice K.; Brener, Nancy D.; Kann, Laura; Roberts, Alice M.; Kyle, Tonja M.; Flint, Katherine H.; Ross, Alexander L. R. – Journal of School Health, 2011
Background: School-based risk behavior surveys traditionally have been administered via paper-and-pencil. This study assessed the feasibility of conducting in-class online surveys in US high schools. Methods: A paper-and-pencil questionnaire assessing computer availability and perceptions of online surveys was mailed to a nationally representative…
Descriptors: High Schools, Intervals, Principals, Mail Surveys
Yun, Jeehwan – Computer Assisted Language Learning, 2011
This study examines (1) the effects of hypertext gloss use on L2 vocabulary acquisition in computerized reading contexts; (2) which specific combination of either text-only (single) or text + visual (multiple) hypertext glosses is more effective on L2 vocabulary acquisition; and (3) potential moderators to systematically account for between study…
Descriptors: Hypermedia, Effect Size, Vocabulary Development, Second Language Learning
Scarpati, Stanley E.; Wells, Craig S.; Lewis, Christine; Jirka, Stephen – Journal of Special Education, 2011
The purpose of this study was to use differential item functioning (DIF) and latent mixture model analyses to explore factors that explain performance differences on a large-scale mathematics assessment between examinees allowed to use a calculator or who were afforded item presentation accommodations versus those who did not receive the same…
Descriptors: Testing Accommodations, Test Items, Test Format, Validity
van Buijsen, Marit; Hendriks, Angelique; Ketelaars, Mieke; Verhoeven, Ludo – Research in Developmental Disabilities: A Multidisciplinary Journal, 2011
Children with communication disorders have problems with both language and social interaction. The theory-of-mind hypothesis provides an explanation for these problems, and different tests have been developed to test this hypothesis. However, different modes of presentation are used in these tasks, which make the results difficult to compare. In…
Descriptors: Autism, Language Impairments, Communication Disorders, Interpersonal Relationship