NotesFAQContact Us
Collection
Advanced
Search Tips
Showing 166 to 180 of 5,127 results Save | Export
Peer reviewed Peer reviewed
PDF on ERIC Download full text
Valentina Albano; Donatella Firmani; Luigi Laura; Jerin George Mathew; Anna Lucia Paoletti; Irene Torrente – Journal of Learning Analytics, 2023
Multiple-choice questions (MCQs) are widely used in educational assessments and professional certification exams. Managing large repositories of MCQs, however, poses several challenges due to the high volume of questions and the need to maintain their quality and relevance over time. One of these challenges is the presence of questions that…
Descriptors: Natural Language Processing, Multiple Choice Tests, Test Items, Item Analysis
Peer reviewed Peer reviewed
PDF on ERIC Download full text
Serap Buyukkidik – International Journal of Assessment Tools in Education, 2023
In the current study, differential item functioning (DIF) detection using real data was conducted with the application of "Mantel-Haenszel (MH)", "Simultaneous item bias test (SIBTEST)", "Lord's chi-square", and "Raju's area" methods, both when item purification was carried out and when item purification was…
Descriptors: Language Tests, Test Items, Item Analysis, Gender Differences
Peer reviewed Peer reviewed
Direct linkDirect link
Kaj Sparle Christensen; Ole Jakob Storebø; Bo Bach – Journal of Attention Disorders, 2025
Objective: This study examines the validity of the ASRS-5 as a new screening tool for ADHD and evaluates its proposed screening cut-off in a general population context. Method: A nationally representative sample of 2,002 individuals aged 18 to 80 years was surveyed using the ASRS-5, with complete data obtained from 714 participants. Psychometric…
Descriptors: Foreign Countries, Construct Validity, Psychometrics, Item Analysis
Peer reviewed Peer reviewed
Direct linkDirect link
Justin Harris – Language Teaching Research, 2025
This article outlines the development of a 16-item instrument for measuring language learner's foreign language self-efficacy (SE) concerning their speaking and listening skills through repeated administrations to groups of Japanese tertiary students. Responses were analysed through the Rasch model, which allows researchers to investigate…
Descriptors: Speech Communication, Questionnaires, Item Analysis, Second Language Learning
Peer reviewed Peer reviewed
PDF on ERIC Download full text
Shasha Chen; Shaohui Chi; Zuhao Wang – Journal of Baltic Science Education, 2025
Interdisciplinary thinking is critical for equipping students to apply scientific knowledge and tackle societal challenges across various disciplines, which has been recognized as a key objective of twenty-first century science education. However, research on effective interdisciplinary assessment in secondary school science education is still…
Descriptors: Thinking Skills, Interdisciplinary Approach, Science Instruction, Grade 7
Peer reviewed Peer reviewed
Direct linkDirect link
Melanie Ann Weber; Mia Anzilotti; Reece Gormley; Christina Huber; Alyssa McGarvey; Grace McKee; Claire Ogden; Hannah Seinfeld; Julia Wank; Arnold Olszewski – Perspectives of the ASHA Special Interest Groups, 2024
Purpose: Technology, including educational applications (apps), is commonly used in schools by teachers and speech-language pathologists. Nonetheless, very little research has examined the efficacy of these apps for student learning or how to choose appropriate apps for instruction. Several previous rubrics to evaluate the instructional quality of…
Descriptors: Computer Software, Handheld Devices, Educational Technology, Technology Uses in Education
Peer reviewed Peer reviewed
Direct linkDirect link
Kunal Sareen – Innovations in Education and Teaching International, 2024
This study examines the proficiency of Chat GPT, an AI language model, in answering questions on the Situational Judgement Test (SJT), a widely used assessment tool for evaluating the fundamental competencies of medical graduates in the UK. A total of 252 SJT questions from the "Oxford Assess and Progress: Situational Judgement" Test…
Descriptors: Ethics, Decision Making, Artificial Intelligence, Computer Software
Peer reviewed Peer reviewed
Direct linkDirect link
Shadi Noroozi; Hossein Karami – Language Testing in Asia, 2024
Recently, psychometricians and researchers have voiced their concern over the exploration of language test items in light of Messick's validation framework. Validity has been central to test development and use; however, it has not received due attention in language tests having grave consequences for test takers. The present study sought to…
Descriptors: Foreign Countries, Doctoral Students, Graduate Students, Language Proficiency
Peer reviewed Peer reviewed
Direct linkDirect link
Dimitrov, Dimiter M.; Atanasov, Dimitar V. – Educational and Psychological Measurement, 2021
This study presents a latent (item response theory--like) framework of a recently developed classical approach to test scoring, equating, and item analysis, referred to as "D"-scoring method. Specifically, (a) person and item parameters are estimated under an item response function model on the "D"-scale (from 0 to 1) using…
Descriptors: Scoring, Equated Scores, Item Analysis, Item Response Theory
Peer reviewed Peer reviewed
Direct linkDirect link
Kang, Hyeon-Ah; Han, Suhwa; Kim, Doyoung; Kao, Shu-Chuan – Educational and Psychological Measurement, 2022
The development of technology-enhanced innovative items calls for practical models that can describe polytomous testlet items. In this study, we evaluate four measurement models that can characterize polytomous items administered in testlets: (a) generalized partial credit model (GPCM), (b) testlet-as-a-polytomous-item model (TPIM), (c)…
Descriptors: Goodness of Fit, Item Response Theory, Test Items, Scoring
Peer reviewed Peer reviewed
Direct linkDirect link
Fuchimoto, Kazuma; Ishii, Takatoshi; Ueno, Maomi – IEEE Transactions on Learning Technologies, 2022
Educational assessments often require uniform test forms, for which each test form has equivalent measurement accuracy but with a different set of items. For uniform test assembly, an important issue is the increase of the number of assembled uniform tests. Although many automatic uniform test assembly methods exist, the maximum clique algorithm…
Descriptors: Simulation, Efficiency, Test Items, Educational Assessment
Peer reviewed Peer reviewed
PDF on ERIC Download full text
Kolarec, Biserka; Nincevic, Marina – International Society for Technology, Education, and Science, 2022
The object of research is a statistics exam that contains problem tasks. One examiner performed two exam evaluation methods to repeatedly evaluate the exam. The goal was to compare the methods for objectivity. One of the two exam evaluation methods we call a serial evaluation method. The serial evaluation method assumes evaluation of all exam…
Descriptors: Statistics Education, Mathematics Tests, Evaluation Methods, Test Construction
Peer reviewed Peer reviewed
PDF on ERIC Download full text
Castillo-Diaz, Marcio Alexander; Gomes, Cristiano Mauro Assis; Jelihovschi, Enio Galinkin – International Journal of Educational Methodology, 2022
The field of studies in metacognition points to some limitations in the way the construct has traditionally been measured and shows a near absence of performance-based tests. The Meta-Text is a performance-based test recently created to assess components of cognition regulation: planning, monitoring, and judgment. This study presents the first…
Descriptors: Schemata (Cognition), Decision Making, Undergraduate Students, Foreign Countries
Peer reviewed Peer reviewed
PDF on ERIC Download full text
Kusmaryono, Imam; Wijayanti, Dyana; Maharani, Hevy Risqi – International Journal of Educational Methodology, 2022
This study reviews 60 papers using a Likert scale and published between 2012-2021. Screening for literature review uses the PRISMA method. The data analysis technique was carried out through data extraction, then synthesized in a structured manner using the narrative method. To achieve credible research results at the stage of the data collection…
Descriptors: Likert Scales, Social Science Research, Rating Scales, Group Discussion
Peer reviewed Peer reviewed
Direct linkDirect link
Watkins, Marley W.; Canivez, Gary L. – School Psychology Review, 2022
IQ tests provide numerous scores, but valid interpretation of those scores is dependent on how precisely each score reflects its intended construct and whether it provides unique information independent of other constructs. Thus, IQ scores must be evaluated for their reliability and dimensionality to determine their psychometric utility. As a…
Descriptors: Children, Intelligence Tests, Scores, Psychometrics
Pages: 1  |  ...  |  8  |  9  |  10  |  11  |  12  |  13  |  14  |  15  |  16  |  ...  |  342