Publication Date
| In 2026 | 34 |
| Since 2025 | 2433 |
| Since 2022 (last 5 years) | 12948 |
| Since 2017 (last 10 years) | 34073 |
| Since 2007 (last 20 years) | 68564 |
Descriptor
| Foreign Countries | 30631 |
| Test Validity | 21786 |
| Scores | 18282 |
| Academic Achievement | 16944 |
| Test Construction | 16779 |
| Test Reliability | 15055 |
| Achievement Tests | 14883 |
| Standardized Tests | 14734 |
| Comparative Analysis | 14432 |
| Elementary Secondary Education | 13052 |
| Language Tests | 12558 |
| More ▼ | |
Source
Author
Publication Type
Education Level
Audience
| Practitioners | 5034 |
| Teachers | 3394 |
| Researchers | 2630 |
| Policymakers | 1232 |
| Administrators | 979 |
| Students | 687 |
| Parents | 325 |
| Counselors | 216 |
| Community | 162 |
| Support Staff | 50 |
| Media Staff | 34 |
| More ▼ | |
Location
| Turkey | 2831 |
| Australia | 2433 |
| Canada | 2273 |
| California | 1857 |
| United States | 1729 |
| Texas | 1618 |
| China | 1583 |
| United Kingdom | 1316 |
| Florida | 1312 |
| United Kingdom (England) | 1205 |
| Germany | 1125 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Meets WWC Standards without Reservations | 121 |
| Meets WWC Standards with or without Reservations | 189 |
| Does not meet standards | 174 |
Forthmann, Boris; Paek, Sue Hyeon; Dumas, Denis; Barbot, Baptiste; Holling, Heinz – British Journal of Educational Psychology, 2020
Background: The originality of divergent thinking (DT) production is one of the most critical indicators of creative potential. It is commonly scored using the statistical infrequency of responses relative to all responses provided in a given sample. Aims: Response frequency estimates vary in terms of measurement precision. This issue has been…
Descriptors: Creative Thinking, Creativity Tests, Item Response Theory, Scores
Monfort-Pañego, Manuel; Miñana-Signes, Vicente – Measurement in Physical Education and Exercise Science, 2020
The purpose of this study was to develop a questionnaire to assess body posture habits in adolescents' daily activities. To develop and assess the instrument we used the Delphi method, and a test-retest reliability design. The questionnaire consisted of 31 questions with 4-level Likert scale. One hundred and sixty-eight students were studied, 72…
Descriptors: Psychometrics, Content Validity, Questionnaires, Human Body
Hepperlen, Renee A.; Rabaey, Paula; Hearst, Mary O. – Journal of Applied Research in Intellectual Disabilities, 2020
Background: Families of children with disabilities often face unique challenges. Developed in a U.S. context, the Beach Center Family Quality of Life measure assesses the effectiveness of supports and services that families receive. This study examines whether items from three sub-scales of the Beach Center instrument perform similarly for two…
Descriptors: Cross Cultural Studies, Test Validity, Family Life, Quality of Life
Cole, Brian S.; Lima-Walton, Elia; Brunnert, Kim; Vesey, Winona Burt; Raha, Kaushik – Journal of Applied Testing Technology, 2020
Automatic item generation can rapidly generate large volumes of exam items, but this creates challenges for assembly of exams which aim to include syntactically diverse items. First, we demonstrate a diminishing marginal syntactic return for automatic item generation using a saturation detection approach. This analysis can help users of automatic…
Descriptors: Artificial Intelligence, Automation, Test Construction, Test Items
Krach, Shelley Kathleen; McCreery, Michael P.; Dennis, Lindsay; Guerard, Jessika; Harris, Erica L. – Psychology in the Schools, 2020
Pearson now uses a technology-based testing platform, Q-Interactive, to administer tests previously available in paper versions. The same norms are used for both versions; Pearson's in-house equivalency studies indicated that both versions are equated. The goal of the current study is to independently evaluate equivalency findings. For the current…
Descriptors: Preschool Children, Computer Assisted Testing, Test Items, Scores
Aksoy, Bülent; Namal, Remzi – International Education Studies, 2020
The aim of this study is to determine the scope validity of the graph drawing interpretation skill checklist that can be used in social studies teaching. The literature review was made while the draft skill checklist was prepared at first. There have generally been some graph drawing and interpretation skill checklist studies at secondary and…
Descriptors: Check Lists, Test Validity, Graphs, Interpretive Skills
Deng, Ruiqi; Benckendorff, Pierre; Gannaway, Deanne – British Journal of Educational Technology, 2020
The purpose of this study is to contribute to a better understanding of the complexity of conceptualising and measuring learner engagement in Massive Open Online Courses (MOOCs). The paper develops and validates a MOOC engagement scale (MES) to measure learner engagement. The initial questionnaire items of the scale were developed by reviewing…
Descriptors: Online Courses, Learner Engagement, Test Construction, Test Validity
Ismail, Yilmaz – Educational Research and Reviews, 2020
This study draws on the understanding that when the correlation between variables is not known yet the non-linear expectation in the correlation between the variables is present, non-linear measurement tools can be used. In education, possibility measurement tools can be used for non-linear measurement. Multiple-choice possibility measurement…
Descriptors: Multiple Choice Tests, Measurement Techniques, Student Evaluation, Test Items
Hill, Laura G. – International Journal of Behavioral Development, 2020
Retrospective pretests ask respondents to report after an intervention on their aptitudes, knowledge, or beliefs before the intervention. A primary reason to administer a retrospective pretest is that in some situations, program participants may over the course of an intervention revise or recalibrate their prior understanding of program content,…
Descriptors: Pretesting, Response Style (Tests), Bias, Testing Problems
Umucu, Emre; Wu, Jia-Rung; Sanchez, Jennifer; Brooks, Jessica M.; Chiu, Chung-Yi; Tu, Wei-Mo; Chan, Fong – Journal of American College Health, 2020
Objective: The current study aims to validate the PERMA-Profiler, a well-known well-being measure, among a sample of student veterans. Participants: A sample of 205 student veterans were recruited from universities across the United States. Method: Cross-sectional research design was used in this study. Measurement structure of the PERMA-Profiler…
Descriptors: Test Validity, Measures (Individuals), Well Being, Veterans
Kerub, Orly; Haas, Eric J.; Meiri, Gal; Davidovitch, Nadav; Menashe, Idan – Journal of Autism and Developmental Disorders, 2020
Systematic screening of autism spectrum disorder (ASD) can improve early diagnosis of ASD. We compared the efficacy of two ASD screening methods, the Global Developmental Screening (GDS), and the Modified Checklist for Autism in Toddlers-Revised, with Follow-Up (M-CHAT/F) in 1591 toddlers of ages 18-36 months from 35 government-funded clinics in…
Descriptors: Foreign Countries, Screening Tests, Autism, Pervasive Developmental Disorders
Hurley, Annette; Willis, Michelle; Guidry, Megan; Bode, Dan; Corneille, Marissa L.; Mills, Sara – Language, Speech, and Hearing Services in Schools, 2020
Purpose: The purpose of this study was to review quality benchmarks from hearing screening programs conducted at local Head Start centers and preschool and elementary schools associated with our university training programs. Method: Hearing screening results from 6,043 children were reviewed. Hearing screening was accomplished using either…
Descriptors: Auditory Tests, Screening Tests, Preschools, Elementary Schools
Derek Sauder – ProQuest LLC, 2020
The Rasch model is commonly used to calibrate multiple choice items. However, the sample sizes needed to estimate the Rasch model can be difficult to attain (e.g., consider a small testing company trying to pretest new items). With small sample sizes, auxiliary information besides the item responses may improve estimation of the item parameters.…
Descriptors: Item Response Theory, Sample Size, Computation, Test Length
Mark Chapman; Meg Montee; Yangting Wang; Gordon Blaine West; Jason A. Kemp; Ahyoung Alicia Kim – Language Testing, 2026
This paper reports the results of a study designed to explore the relationships between speaking test task variables and linguistic features of spoken responses on a speaking assessment for Grade 7 multilingual English learners (age 12-13) in U.S. public schools. Speaking task responses from 30 high-proficiency test takers were transcribed and…
Descriptors: English Learners, Language Tests, Language Proficiency, Language Fluency
Travis Lemon; Karen Taylor – Mathematics Teacher: Learning and Teaching PK-12, 2026
Collaboration, communication, creativity, and critical thinking are skills that are fundamentally important to student success in today's world. They are often referred to as the 4Cs and are found among other lists of important skills, practices, and characteristics that are desirable for students to learn and incorporate into their lives. In…
Descriptors: Mathematics Education, Secondary School Mathematics, Grade 9, Cooperation

Peer reviewed
Direct link
