Publication Date
In 2025 | 39 |
Since 2024 | 162 |
Since 2021 (last 5 years) | 585 |
Since 2016 (last 10 years) | 1221 |
Since 2006 (last 20 years) | 2727 |
Descriptor
Source
Author
Publication Type
Education Level
Audience
Researchers | 169 |
Practitioners | 49 |
Teachers | 32 |
Administrators | 8 |
Policymakers | 8 |
Counselors | 4 |
Students | 4 |
Media Staff | 1 |
Location
Turkey | 172 |
Australia | 81 |
Canada | 79 |
China | 69 |
United States | 55 |
Germany | 43 |
Taiwan | 43 |
Japan | 40 |
United Kingdom | 38 |
Iran | 36 |
Spain | 33 |
More ▼ |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
Meets WWC Standards without Reservations | 1 |
Meets WWC Standards with or without Reservations | 1 |
Does not meet standards | 1 |
Sen, Sedat; Cohen, Allan S. – Measurement: Interdisciplinary Research and Perspectives, 2019
Mixture item response theory (MixIRT) models combine IRT models with latent class model and assume that there exist latent subpopulations in the data. Identification of latent subpopulations via MixIRT models produces more detailed information. Detailed information about the response processing of examinees provides a better understanding of the…
Descriptors: Item Response Theory, Models, Item Analysis, Personality Traits
Eaton, Philip; Frank, Barrett; Johnson, Keith; Willoughy, Shannon – Physical Review Physics Education Research, 2019
While numerous studies have analyzed the conceptions probed by the Force Concept Inventory (FCI), assessments dedicated to electricity and magnetism lack similar analyses. This paper investigated the conceptions explored by the Brief Electricity and Magnetism Assessment (BEMA) and the Conceptual Survey of Electricity and Magnetism (CSEM) using…
Descriptors: Energy, Magnets, Physics, Science Tests
Wang, Wenyi; Song, Lihong; Chen, Ping; Ding, Shuliang – Journal of Educational Measurement, 2019
Most of the existing classification accuracy indices of attribute patterns lose effectiveness when the response data is absent in diagnostic testing. To handle this issue, this article proposes new indices to predict the correct classification rate of a diagnostic test before administering the test under the deterministic noise input…
Descriptors: Cognitive Tests, Classification, Accuracy, Diagnostic Tests
Svetina, Dubravka; Liaw, Yuan-Ling; Rutkowski, Leslie; Rutkowski, David – Journal of Educational Measurement, 2019
This study investigates the effect of several design and administration choices on item exposure and person/item parameter recovery under a multistage test (MST) design. In a simulation study, we examine whether number-correct (NC) or item response theory (IRT) methods are differentially effective at routing students to the correct next stage(s)…
Descriptors: Measurement, Item Analysis, Test Construction, Item Response Theory
Feuerstahler, Leah; Wilson, Mark – Journal of Educational Measurement, 2019
Scores estimated from multidimensional item response theory (IRT) models are not necessarily comparable across dimensions. In this article, the concept of aligned dimensions is formalized in the context of Rasch models, and two methods are described--delta dimensional alignment (DDA) and logistic regression alignment (LRA)--to transform estimated…
Descriptors: Item Response Theory, Models, Scores, Comparative Analysis
Xu, Jie – ProQuest LLC, 2019
Research has shown that cross-sectional mediation analysis cannot accurately reflect a true longitudinal mediated effect. To investigate longitudinal mediated effects, different longitudinal mediation models have been proposed and these models focus on different research questions related to longitudinal mediation. When fitting mediation models to…
Descriptors: Case Studies, Error of Measurement, Longitudinal Studies, Models
Sarallah Jafaripour; Omid Tabatabaei; Hadi Salehi; Hossein Vahid Dastjerdi – International Journal of Language Testing, 2024
The purpose of this study was to examine gender and discipline-based Differential Item Functioning (DIF) and Differential Distractor Functioning (DDF) on the Islamic Azad University English Proficiency Test (IAUEPT). The study evaluated DIF and DDF across genders and disciplines using the Rasch model. To conduct DIF and DDF analysis, the examinees…
Descriptors: Item Response Theory, Test Items, Language Tests, Language Proficiency
Susu Zhang; Xueying Tang; Qiwei He; Jingchen Liu; Zhiliang Ying – Grantee Submission, 2024
Computerized assessments and interactive simulation tasks are increasingly popular and afford the collection of process data, i.e., an examinee's sequence of actions (e.g., clickstreams, keystrokes) that arises from interactions with each task. Action sequence data contain rich information on the problem-solving process but are in a nonstandard,…
Descriptors: Correlation, Problem Solving, Computer Assisted Testing, Prediction
Pariwat Imsa-ard – LEARN Journal: Language Education and Acquisition Research Network, 2024
This study diverges from conventional psychological perspectives that often concentrate on language learning deficiencies necessitating remedial interventions. Instead, it emphasizes the pivotal role of positive psychological constructs in facilitating optimal language development, particularly in contexts characterized by limited exposure to…
Descriptors: Psychological Patterns, English (Second Language), Second Language Learning, Second Language Instruction
Rujun Xu; James Soland – International Journal of Testing, 2024
International surveys are increasingly being used to understand nonacademic outcomes like math and science motivation, and to inform education policy changes within countries. Such instruments assume that the measure works consistently across countries, ethnicities, and languages--that is, they assume measurement invariance. While studies have…
Descriptors: Surveys, Statistical Bias, Achievement Tests, Foreign Countries
Jin, Kuan-Yu; Wu, Yi-Jhen; Chen, Hui-Fang – Journal of Educational and Behavioral Statistics, 2022
For surveys of complex issues that entail multiple steps, multiple reference points, and nongradient attributes (e.g., social inequality), this study proposes a new multiprocess model that integrates ideal-point and dominance approaches into a treelike structure (IDtree). In the IDtree, an ideal-point approach describes an individual's attitude…
Descriptors: Likert Scales, Item Response Theory, Surveys, Responses
Kim, Hyun-Kyung; Kim, Haesun A. – International Journal of Science and Mathematics Education, 2022
The study aims to analyze student responses to chemistry constructed response items to obtain detailed information on science NAEA (National Assessment of Educational Achievement) in South Korea and to draw suggestions for enhancing curriculum, teaching, and learning. For this purpose, we analyzed 7444 answers that could be generalized as 1.29% of…
Descriptors: Foreign Countries, Science Achievement, Science Tests, Teaching Methods
McCorkle, William; Montezuma, Jessie – Journal of Social Studies Education Research, 2022
The ideas of free markets and less government regulation were associated at the turn of the 20th Century with a more internationalist approach and, at times, even more openness to immigration. Some of these dynamics have shifted particularly with the rise of a more populist economic message with leaders like Donald Trump. This study examines the…
Descriptors: Nationalism, Immigration, Social Studies, Teaching Methods
Fröber, Kerstin; Jurczyk, Vanessa; Dreisbach, Gesine – Journal of Experimental Psychology: Learning, Memory, and Cognition, 2022
Frequent forced switching between tasks has been shown to reduce switch costs and increase voluntary switch rates. So far, however, the boundary conditions of the influence of forced task switching on voluntary task switching are unknown. Thus, the present study was aimed to test different aspects of generalizability (across items, tasks, and…
Descriptors: Cognitive Ability, Attention Control, Task Analysis, Generalization
Beaty, Roger E.; Johnson, Dan R.; Zeitlen, Daniel C.; Forthmann, Boris – Creativity Research Journal, 2022
Semantic distance is increasingly used for automated scoring of originality on divergent thinking tasks, such as the Alternate Uses Task (AUT). Despite some psychometric support for semantic distance -- including positive correlations with human creativity ratings -- additional work is needed to optimize its reliability and validity, including…
Descriptors: Semantics, Scoring, Creative Thinking, Creativity