Publication Date
In 2025 | 39 |
Since 2024 | 162 |
Since 2021 (last 5 years) | 585 |
Since 2016 (last 10 years) | 1221 |
Since 2006 (last 20 years) | 2727 |
Descriptor
Source
Author
Publication Type
Education Level
Audience
Researchers | 169 |
Practitioners | 49 |
Teachers | 32 |
Administrators | 8 |
Policymakers | 8 |
Counselors | 4 |
Students | 4 |
Media Staff | 1 |
Location
Turkey | 172 |
Australia | 81 |
Canada | 79 |
China | 69 |
United States | 55 |
Germany | 43 |
Taiwan | 43 |
Japan | 40 |
United Kingdom | 38 |
Iran | 36 |
Spain | 33 |
More ▼ |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
Meets WWC Standards without Reservations | 1 |
Meets WWC Standards with or without Reservations | 1 |
Does not meet standards | 1 |
Tatiana di Lucia Faion Franchi; Felipe Valentini; Leonardo Botinhon de Campos; Letícia da Silva de Souza; Pedro Vanni; Leonardo de Barros Mose; Ricardo Primi – International Journal of Testing, 2024
This study investigated the application of item quadruplets to control social desirability in measuring socio-emotional competencies and their relationship with intelligence. Quadruplets involve four variations of an item, differing in the polarity of their descriptive content and evaluative content (social desirability): positive-desirable,…
Descriptors: Social Desirability, Social Emotional Learning, Interpersonal Competence, Emotional Intelligence
Metsämuuronen, Jari – International Journal of Educational Methodology, 2020
A new index of item discrimination power (IDP), dimension-corrected Somers' D (D2) is proposed. Somers' D is one of the superior alternatives for item-total- (Rit) and item-rest correlation (Rir) in reflecting the real IDP with items with scales 0/1 and 0/1/2, that is, up to three categories. D also reaches the extreme value +1 and -1 correctly…
Descriptors: Item Analysis, Correlation, Test Items, Simulation
Tsutsumi, Emiko; Kinoshita, Ryo; Ueno, Maomi – International Educational Data Mining Society, 2021
Knowledge tracing (KT), the task of tracking the knowledge state of each student over time, has been assessed actively by artificial intelligence researchers. Recent reports have described that Deep-IRT, which combines Item Response Theory (IRT) with a deep learning model, provides superior performance. It can express the abilities of each student…
Descriptors: Item Response Theory, Prediction, Accuracy, Artificial Intelligence
Mohammed Alqabbaa – ProQuest LLC, 2021
Psychometricians at an organization named the Education and Training Evaluation Commission (ETEC) developed a new test scoring method called the latent D-scoring method (DSM-L) where it is believed that the new method itself is much easier and more efficient to use compared to the Item Response Theory (IRT) method. However, there are no studies…
Descriptors: Item Response Theory, Scoring, Item Analysis, Equated Scores
Baghestani, Ahmad Reza; Ahmadi, Farzane; Tanha, Azadeh; Meshkat, Mojtaba – Measurement and Evaluation in Counseling and Development, 2019
The content validity ratio (CVR), which is suggested by Lawshe (1975), is a widely used index to quantify content validity. In this study, the Bayesian approach is used to determine the minimum number of experts required to agree an item is essential, and then the CVR is calculated.
Descriptors: Content Validity, Bayesian Statistics, Specialists, Measurement Techniques
Farshad Effatpanah; Purya Baghaei; Mona Tabatabaee-Yazdi; Esmat Babaii – Language Testing, 2025
This study aimed to propose a new method for scoring C-Tests as measures of general language proficiency. In this approach, the unit of analysis is sentences rather than gaps or passages. That is, the gaps correctly reformulated in each sentence were aggregated as sentence score, and then each sentence was entered into the analysis as a polytomous…
Descriptors: Item Response Theory, Language Tests, Test Items, Test Construction
Jorge López González; Paula Crespí; Belén Obispo-Díaz; Jesús Rodríguez Barroso – Journal of Beliefs & Values, 2025
The cardinal virtues (prudence, justice, fortitude, and temperance) have relevance within the areas of character education and integral or comprehensive formation. In recent years, there has been growing interest and a great deal of literature produced on character education and its measurement. In this paper, we propose a questionnaire (a…
Descriptors: Ethics, Self Concept, Values Education, Likert Scales
Mimi Ismail; Ahmed Al - Badri; Said Al - Senaidi – Journal of Education and e-Learning Research, 2025
This study aimed to reveal the differences in individuals' abilities, their standard errors, and the psychometric properties of the test according to the two methods of applying the test (electronic and paper). The descriptive approach was used to achieve the study's objectives. The study sample consisted of 74 male and female students at the…
Descriptors: Achievement Tests, Computer Assisted Testing, Psychometrics, Item Response Theory
Li, Jiayuan; Van den Noortgate, Wim – Sociological Methods & Research, 2022
This article presents an updated meta-analysis of survey experiments comparing the performance of the item count technique (ICT) and the direct questioning method. After synthesizing 246 effect sizes from 54 studies, we find that the probability that a sensitive item will be selected is 0.089 higher when using ICT compared to direct questioning.…
Descriptors: Meta Analysis, Surveys, Comparative Analysis, Social Science Research
Hryvko, Antonina V.; Zhuk, Yurii O. – Journal of Curriculum and Teaching, 2022
A feature of the presented study is a comprehensive approach to studying the reliability problem of linguistic testing results due to the several functional and variable factors impact. Contradictions and ambiguous views of scientists on the researched issues determine the relevance of this study. The article highlights the problem of equivalence…
Descriptors: Student Evaluation, Language Tests, Test Format, Test Items
Toma, Radu Bogdan; Lederman, Norman G. – Research in Science Education, 2022
The development of attitudes toward science instruments has recently emerged in science education research. However, a comprehensive review of their psychometric properties, using currently accepted assessment standards, has not yet been completed. Consequently, this review discusses the validity and reliability of 18 measures published between…
Descriptors: Attitude Measures, Measures (Individuals), Psychometrics, Standards
Shahat, Mohamed A.; Boone, William J.; Ambusaidi, Abdullah K.; Al Bahri, Khalsa; Ohle-Peters, Annika – Journal of Baltic Science Education, 2022
A range of pedagogical learning theories has been proposed to guide science teachers' classroom teaching. This study presents the results of the development and use of an 18-item Arabic language rating scale survey to assess Omani teachers' (N = 400) views towards the application of selected pedagogical learning theories of potential use in their…
Descriptors: Science Teachers, Teacher Surveys, Teacher Attitudes, Item Analysis
Dalka, Robert P.; Sachmpazidi, Diana; Henderson, Charles; Zwolak, Justyna P. – Physical Review Physics Education Research, 2022
Likert-style surveys are a widely used research instrument to assess respondents' preferences, beliefs, or experiences. In this paper, we propose and demonstrate how network analysis (NA) can be employed to model and evaluate the interconnectedness of items in Likert-style surveys. We explore the advantages of this approach by applying the…
Descriptors: Network Analysis, Likert Scales, Student Experience, Comparative Analysis
Hayat, Bahrul – Cogent Education, 2022
The purpose of this study comprises (1) calibrating the Basic Statistics Test for Indonesian undergraduate psychology students using the Rasch model, (2) testing the impact of adjustment for guessing on item parameters, person parameters, test reliability, and distribution of item difficulty and person ability, and (3) comparing person scores…
Descriptors: Guessing (Tests), Statistics Education, Undergraduate Students, Psychology
Martha L. Epstein; Hamza Malik; Kun Wang; Chandra Hawley Orrill – Grantee Submission, 2022
Response Process Validity (RPV) reflects the degree to which items are interpreted as intended by item developers. In this study, teacher responses to constructed response (CR) items to assess pedagogical content knowledge (PCK) of middle school mathematics teachers were evaluated to determine what types of teacher responses signaled weak RPV. We…
Descriptors: Teacher Response, Test Items, Pedagogical Content Knowledge, Mathematics Teachers