Publication Date
| In 2026 | 0 |
| Since 2025 | 74 |
| Since 2022 (last 5 years) | 509 |
| Since 2017 (last 10 years) | 1084 |
| Since 2007 (last 20 years) | 2603 |
Descriptor
Source
Author
Publication Type
Education Level
Audience
| Researchers | 169 |
| Practitioners | 49 |
| Teachers | 32 |
| Administrators | 8 |
| Policymakers | 8 |
| Counselors | 4 |
| Students | 4 |
| Media Staff | 1 |
Location
| Turkey | 173 |
| Australia | 81 |
| Canada | 79 |
| China | 72 |
| United States | 56 |
| Taiwan | 44 |
| Germany | 43 |
| Japan | 41 |
| United Kingdom | 39 |
| Iran | 37 |
| Indonesia | 35 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Meets WWC Standards without Reservations | 1 |
| Meets WWC Standards with or without Reservations | 1 |
| Does not meet standards | 1 |
Tim Stoeckel; Liang Ye Tan; Hung Tan Ha; Nam Thi Phuong Ho; Tomoko Ishii; Young Ae Kim; Chunmei Huang; Stuart McLean – Vocabulary Learning and Instruction, 2024
Local item dependency (LID) occurs when test-takers' responses to one test item are affected by their responses to another. It can be problematic if it causes inflated reliability estimates or distorted person and item measures. The cued-recall reading comprehension test in Hu and Nation's (2000) well-known and influential coverage--comprehension…
Descriptors: Reading Comprehension, English (Second Language), Second Language Instruction, Second Language Learning
Li, Jiayuan; Van den Noortgate, Wim – Sociological Methods & Research, 2022
This article presents an updated meta-analysis of survey experiments comparing the performance of the item count technique (ICT) and the direct questioning method. After synthesizing 246 effect sizes from 54 studies, we find that the probability that a sensitive item will be selected is 0.089 higher when using ICT compared to direct questioning.…
Descriptors: Meta Analysis, Surveys, Comparative Analysis, Social Science Research
Hryvko, Antonina V.; Zhuk, Yurii O. – Journal of Curriculum and Teaching, 2022
A feature of the presented study is a comprehensive approach to studying the reliability problem of linguistic testing results due to the several functional and variable factors impact. Contradictions and ambiguous views of scientists on the researched issues determine the relevance of this study. The article highlights the problem of equivalence…
Descriptors: Student Evaluation, Language Tests, Test Format, Test Items
Toma, Radu Bogdan; Lederman, Norman G. – Research in Science Education, 2022
The development of attitudes toward science instruments has recently emerged in science education research. However, a comprehensive review of their psychometric properties, using currently accepted assessment standards, has not yet been completed. Consequently, this review discusses the validity and reliability of 18 measures published between…
Descriptors: Attitude Measures, Measures (Individuals), Psychometrics, Standards
Shahat, Mohamed A.; Boone, William J.; Ambusaidi, Abdullah K.; Al Bahri, Khalsa; Ohle-Peters, Annika – Journal of Baltic Science Education, 2022
A range of pedagogical learning theories has been proposed to guide science teachers' classroom teaching. This study presents the results of the development and use of an 18-item Arabic language rating scale survey to assess Omani teachers' (N = 400) views towards the application of selected pedagogical learning theories of potential use in their…
Descriptors: Science Teachers, Teacher Surveys, Teacher Attitudes, Item Analysis
Dalka, Robert P.; Sachmpazidi, Diana; Henderson, Charles; Zwolak, Justyna P. – Physical Review Physics Education Research, 2022
Likert-style surveys are a widely used research instrument to assess respondents' preferences, beliefs, or experiences. In this paper, we propose and demonstrate how network analysis (NA) can be employed to model and evaluate the interconnectedness of items in Likert-style surveys. We explore the advantages of this approach by applying the…
Descriptors: Network Analysis, Likert Scales, Student Experience, Comparative Analysis
Hayat, Bahrul – Cogent Education, 2022
The purpose of this study comprises (1) calibrating the Basic Statistics Test for Indonesian undergraduate psychology students using the Rasch model, (2) testing the impact of adjustment for guessing on item parameters, person parameters, test reliability, and distribution of item difficulty and person ability, and (3) comparing person scores…
Descriptors: Guessing (Tests), Statistics Education, Undergraduate Students, Psychology
Martha L. Epstein; Hamza Malik; Kun Wang; Chandra Hawley Orrill – Grantee Submission, 2022
Response Process Validity (RPV) reflects the degree to which items are interpreted as intended by item developers. In this study, teacher responses to constructed response (CR) items to assess pedagogical content knowledge (PCK) of middle school mathematics teachers were evaluated to determine what types of teacher responses signaled weak RPV. We…
Descriptors: Teacher Response, Test Items, Pedagogical Content Knowledge, Mathematics Teachers
Zhang, Mengxue; Heffernan, Neil; Lan, Andrew – International Educational Data Mining Society, 2023
Automated scoring of student responses to open-ended questions, including short-answer questions, has great potential to scale to a large number of responses. Recent approaches for automated scoring rely on supervised learning, i.e., training classifiers or fine-tuning language models on a small number of responses with human-provided score…
Descriptors: Scoring, Computer Assisted Testing, Mathematics Instruction, Mathematics Tests
Marcinek, Tibor; Jakobsen, Arne; Partová, Edita – Journal of Mathematics Teacher Education, 2023
The measures of mathematical knowledge for teaching developed at the University of Michigan in the U.S., have been adapted and used in studies measuring teacher knowledge in several countries around the world. In the adaptation, many of these studies relied on comparisons of item parameters and none of them considered a comparison of raw data. In…
Descriptors: Mathematics Skills, Cross Cultural Studies, Foreign Countries, Pedagogical Content Knowledge
DeCarlo, Lawrence T. – Journal of Educational Measurement, 2023
A conceptualization of multiple-choice exams in terms of signal detection theory (SDT) leads to simple measures of item difficulty and item discrimination that are closely related to, but also distinct from, those used in classical item analysis (CIA). The theory defines a "true split," depending on whether or not examinees know an item,…
Descriptors: Multiple Choice Tests, Test Items, Item Analysis, Test Wiseness
Daus, Stephan; Skjelbred, Siv-Elisabeth; Pedersen, Cathrine – Journal of Psychoeducational Assessment, 2023
To improve the understanding of the drivers of interest, and its impact on other outcomes, researchers and educators need valid and informative measures capturing the different domains of interest. Answering the lack of interest measures in marketing education, we develop and psychometrically assess three instruments reflecting the theoretical…
Descriptors: Marketing, Student Interests, Career Choice, Student Attitudes
Strong, John Z. – Reading & Writing Quarterly, 2023
Awareness of informational text structures is related to reading comprehension and varies according to characteristics of readers and texts. The purpose of this study was to develop and refine a measure of text structure awareness, the Text Structure Identification Test (TSIT), by investigating its internal consistency reliability and construct…
Descriptors: Text Structure, Reading Instruction, Construct Validity, Grade 4
Lendínez-Turón, Ana; Domínguez-Valerio, Cándida María; Orgaz-Agüera, Francisco; Moral-Cuadra, Salvador – International Journal of Sustainability in Higher Education, 2023
Purpose: The purpose of this research paper is to adapt and validate a useful instrument to diagnose the knowledge, attitudes, behaviours and intention to participate (KABIP) towards Sustainable Development Goals (SDGs) in higher education institutions (HEIs) from the public administration in developing countries. Design/methodology/approach: The…
Descriptors: Public Administration, Sustainable Development, Objectives, Higher Education
Peabody, Michael R. – Measurement: Interdisciplinary Research and Perspectives, 2023
Many organizations utilize some form of automation in the test assembly process; either fully algorithmic or heuristically constructed. However, one issue with heuristic models is that when the test assembly problem changes the entire model may need to be re-conceptualized and recoded. In contrast, mixed-integer programming (MIP) is a mathematical…
Descriptors: Programming Languages, Algorithms, Heuristics, Mathematical Models

Peer reviewed
Direct link
