Publication Date
In 2025 | 39 |
Since 2024 | 162 |
Since 2021 (last 5 years) | 585 |
Since 2016 (last 10 years) | 1221 |
Since 2006 (last 20 years) | 2727 |
Descriptor
Source
Author
Publication Type
Education Level
Audience
Researchers | 169 |
Practitioners | 49 |
Teachers | 32 |
Administrators | 8 |
Policymakers | 8 |
Counselors | 4 |
Students | 4 |
Media Staff | 1 |
Location
Turkey | 172 |
Australia | 81 |
Canada | 79 |
China | 69 |
United States | 55 |
Germany | 43 |
Taiwan | 43 |
Japan | 40 |
United Kingdom | 38 |
Iran | 36 |
Spain | 33 |
More ▼ |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
Meets WWC Standards without Reservations | 1 |
Meets WWC Standards with or without Reservations | 1 |
Does not meet standards | 1 |
Joshua B. Gilbert; Luke W. Miratrix; Mridul Joshi; Benjamin W. Domingue – Annenberg Institute for School Reform at Brown University, 2024
Analyzing heterogeneous treatment effects (HTE) plays a crucial role in understanding the impacts of educational interventions. A standard practice for HTE analysis is to examine interactions between treatment status and pre-intervention participant characteristics, such as pretest scores, to identify how different groups respond to treatment.…
Descriptors: Causal Models, Item Response Theory, Statistical Inference, Psychometrics
Slepkov, A. D.; Van Bussel, M. L.; Fitze, K. M.; Burr, W. S. – SAGE Open, 2021
There is a broad literature in multiple-choice test development, both in terms of item-writing guidelines, and psychometric functionality as a measurement tool. However, most of the published literature concerns multiple-choice testing in the context of expert-designed high-stakes standardized assessments, with little attention being paid to the…
Descriptors: Foreign Countries, Undergraduate Students, Student Evaluation, Multiple Choice Tests
Koyuncu, Ilhan; Kilic, Abdullah Faruk – International Journal of Assessment Tools in Education, 2021
In exploratory factor analysis, although the researchers decide which items belong to which factors by considering statistical results, the decisions taken sometimes can be subjective in case of having items with similar factor loadings and complex factor structures. The aim of this study was to examine the validity of classifying items into…
Descriptors: Classification, Graphs, Factor Analysis, Decision Making
Characterizing Reasoning about Fraction Arithmetic of Middle Grades Teachers in Three Latent Classes
Ölmez, Ibrahim Burak; Izsák, Andrew – Mathematical Thinking and Learning: An International Journal, 2021
We analyzed item responses provided by a nationwide sample of 990 in-service U.S. middle grades mathematics teachers to a novel survey focused on fraction arithmetic. The survey targeted four components of reasoning in terms of measured quantities: "Referent units," "Reversibility," "Partitioning and iterating," and…
Descriptors: Fractions, Arithmetic, Mathematics Instruction, Teaching Methods
Susanti, Yuni; Tokunaga, Takenobu; Nishikawa, Hitoshi – Research and Practice in Technology Enhanced Learning, 2020
The present study focuses on the integration of an automatic question generation (AQG) system and a computerised adaptive test (CAT). We conducted two experiments. In the first experiment, we administered sets of questions to English learners to gather their responses. We further used their responses in the second experiment, which is a…
Descriptors: Computer Assisted Testing, Test Items, Simulation, English Language Learners
Jing Chen; Yi Jiang – SAGE Open, 2025
Anticipatory "it" pattern, which encodes interpersonal stance, plays a crucial role in academic writing. While previous studies have been explored the overuse and the underuse of this pattern among English as a Foreign Language (EFL) learners and published writers, there has been limited exploration of how EFL learners use the…
Descriptors: Foreign Countries, Masters Programs, Graduate Students, English (Second Language)
Lambert, Richard G. – Center for Educational Measurement and Evaluation, 2023
This study sought to investigate whether there were performance differences between the children who engaged with the Ignite by Hatch™ educational gaming system using the English- or Spanish-language versions of the games. Differential item functioning methods (DIF) were employed to investigate these differences. Specifically, DIF analyses can…
Descriptors: Comparative Analysis, Educational Games, Spanish, English
Hsu, Liwei – Computer Assisted Language Learning, 2023
This study presents essential findings that would be advantageous to the designers and practitioners of Language Massive Open Online Courses (LMOOCs). It reveals the key elements that contribute to successful LMOOCs, and how they are influenced by English as a foreign language (EFL) learners' personal characteristics (gender, age, and opened to…
Descriptors: MOOCs, English (Second Language), Second Language Learning, Second Language Instruction
Deribo, Tobias; Goldhammer, Frank; Kroehne, Ulf – Educational and Psychological Measurement, 2023
As researchers in the social sciences, we are often interested in studying not directly observable constructs through assessments and questionnaires. But even in a well-designed and well-implemented study, rapid-guessing behavior may occur. Under rapid-guessing behavior, a task is skimmed shortly but not read and engaged with in-depth. Hence, a…
Descriptors: Reaction Time, Guessing (Tests), Behavior Patterns, Bias
Aydin, Utkun; Birgili, Bengi – Educational Assessment, 2023
Internationally, mathematics education reform has been directed toward characterizing educational goals that go beyond topic/content/skill descriptions and develop students' problem solving. The Revised Bloom's Taxonomy and MATH (Mathematical Assessment Task Hierarchy) Taxonomy characterize such goals. University entrance examinations have been…
Descriptors: Critical Thinking, Thinking Skills, Skill Development, Mathematics Instruction
Hartono, Wahyu; Hadi, Samsul; Rosnawati, Raden; Retnawati, Heri – Pegem Journal of Education and Instruction, 2023
Researchers design diagnostic assessments to measure students' knowledge structures and processing skills to provide information about their cognitive attribute. The purpose of this study is to determine the instrument's validity and score reliability, as well as to investigate the use of classical test theory to identify item characteristics. The…
Descriptors: Diagnostic Tests, Test Validity, Item Response Theory, Content Validity
R. Freed; D. H. McKinnon; M. T. Fitzgerald; S. Salimpour – Physical Review Physics Education Research, 2023
This paper presents the results of a confirmatory factor analysis on two self-efficacy scales designed to probe the self-efficacy of college-level introductory astronomy (Astro-101) students (n ¼ 15181) from 22 institutions across the United States of America and Canada. The students undertook a course based on similar curriculum materials, which…
Descriptors: Self Efficacy, Science Instruction, Astronomy, Factor Analysis
Tu, Thuy Thi Minh – ProQuest LLC, 2023
The study aimed to elicit information from Vietnamese EFL university instructors about their knowledge and skills regarding the principles, theory, and practices of language assessment by means of revision and validation of the Language Assessment Literacy--Revised Vietnam (LAL-RV), which was previously developed by Kremmel and Harding (2020). A…
Descriptors: English (Second Language), Second Language Learning, Second Language Instruction, College Faculty
Ali M. Alodat; Marcia Gentry; Hyeseong Lee – Gifted Child Quarterly, 2024
Exceptionally talented refugee students are often underrepresented in allocating to gifted programs because of inadequate identification methods in Arab countries. This study investigates the Arabic version of the Having Opportunities Promotes Excellence (HOPE) Scale for identifying gifted refugee students. Students (n = 13,598) from refugee camp…
Descriptors: Rating Scales, Refugees, Identification, Arabic
Tim Stoeckel; Tomoko Ishii – Vocabulary Learning and Instruction, 2024
In an upcoming coverage-comprehension study, we plan to assess learners' meaning-recall knowledge of words as they occur in the study's reading passage. As several meaning-recall test formats exist, the purpose of this small-scale study (N = 10) was to determine which of three formats was most similar to a criterion interview regarding mean score…
Descriptors: Vocabulary Development, Language Tests, Second Language Learning, Classification