NotesFAQContact Us
Collection
Advanced
Search Tips
Showing 46 to 60 of 9,759 results Save | Export
Peer reviewed Peer reviewed
Direct linkDirect link
Razavipour, Kioumars; Raji, Behnaz – Language Testing in Asia, 2022
The credibility of conclusions arrived at in quantitative research depends, to a large extent, on the quality of data collection instruments used to quantify language and non-language constructs. Despite this, research into data collection instruments used in Applied Linguistics and particularly in the thesis genre remains limited. This study…
Descriptors: Applied Linguistics, Test Reliability, Language Tests, Credibility
Peer reviewed Peer reviewed
PDF on ERIC Download full text
Mirjam Sophia Glessmer; Rachel Forsyth – Teaching & Learning Inquiry, 2025
Generative AI tools (GenAI) are increasingly used for academic tasks, including qualitative data analysis for the Scholarship of Teaching and Learning (SoTL). In our practice as academic developers, we are frequently asked for advice on whether this use for GenAI is reliable, valid, and ethical. Since this is a new field, we have not been able to…
Descriptors: Artificial Intelligence, Research Methodology, Data Analysis, Scholarship
Peer reviewed Peer reviewed
Direct linkDirect link
Alberto Gandolfi – International Journal of Artificial Intelligence in Education, 2025
In this paper, we initially investigate the capabilities of GPT-3 5 and GPT-4 in solving college-level calculus problems, an essential segment of mathematics that remains under-explored so far. Although improving upon earlier versions, GPT-4 attains approximately 65% accuracy for standard problems and decreases to 20% for competition-like…
Descriptors: Artificial Intelligence, Reliability, Problem Solving, Mathematics Skills
Peer reviewed Peer reviewed
Direct linkDirect link
Takashi Mori; Nami Ogawa; Ichiro Fujishima; Hidetaka Wakabayashi; Keishi Okamoto; Yuto Kameyama; Ai Hirano; Fumiko Oshima; Masataka Itoda; Sumito Ogawa; Tomohisa Ohno; Minoru Yamada; Kenjiro Kunieda; Takashi Shigematsu; Shinta Nishioka; Kazuki Fukuma; Akio Shimizu; Yoichiro Sugiyama – International Journal of Language & Communication Disorders, 2025
Purpose: Measurement of swallowing muscle mass is important in determining sarcopenic dysphagia. Ultrasound equipment can measure the cross-sectional area of the swallowing muscles, but the inter-instrument reliability is unknown. In this study, the inter-instrument reliability was investigated. Methods: Three ultrasound devices were used to…
Descriptors: Human Body, Motor Reactions, Diagnostic Tests, Acoustics
Peer reviewed Peer reviewed
Direct linkDirect link
Guy B. deBrun – Journal of Outdoor Recreation, Education, and Leadership, 2025
Discussions of what it means to be an effective outdoor leader are common in outdoor education literature (Martin et al., 2025; Smith, 2021). Research has identified core competencies (Martin et al., 2025), conceptual frameworks (Pomfret et al., 2023), and course curricula/qualifications for effective leadership (Baker & O'Brien, 2019; Seaman…
Descriptors: Outdoor Leadership, Leadership Effectiveness, Evaluation Methods, Scoring Rubrics
Peer reviewed Peer reviewed
Direct linkDirect link
Brogan L. Barr; Virginia V. W. McIntosh; Eileen F. Britt; Jennifer Jordan; Janet D. Carter – Measurement: Interdisciplinary Research and Perspectives, 2024
Even when raters demonstrate agreement in the use of a measure, limited score variability or violation of often-ignored statistical assumptions can result in lower reliability estimates than intuitively expected. This article uses data drawn from two randomized controlled trials of schema therapy and cognitive behavioral therapy for the treatment…
Descriptors: Evaluators, Interrater Reliability, Reliability, Measurement Techniques
Peer reviewed Peer reviewed
Direct linkDirect link
Sas, Marlies; Snaphaan, Thom; Pauwels, Lieven J. R.; Ponnet, Koen; Hardyns, Wim – Field Methods, 2023
This study focuses on the use of systematic social observations (SSO) to measure crime prevention through environmental design (CPTED) and disorder. To improve knowledge about measurement issues in small area research, SSO is conducted by means of three different methods: in-situ, photographs, and Google Street View (GSV) imagery. By evaluating…
Descriptors: Crime Prevention, Measurement Techniques, Photography, Observation
Peer reviewed Peer reviewed
Direct linkDirect link
Haiko Bruno Zimmermann; Debora Knihs; Raphael Sakugawa; Chris Bishop; Juliano Dal Pupo – Measurement in Physical Education and Exercise Science, 2024
Background: Measures that assess muscle strength and its development, either voluntarily or involuntarily, are important in the clinical and research context. The main aim of this study was to verify the interday reliability and the minimum detectable change (MDC) of the knee extensors muscles torque using evoked contractions and explosive…
Descriptors: Human Body, Physiology, Motor Reactions, Muscular Strength
Peer reviewed Peer reviewed
Direct linkDirect link
William C. M. Belzak; Daniel J. Bauer – Journal of Educational and Behavioral Statistics, 2024
Testing for differential item functioning (DIF) has undergone rapid statistical developments recently. Moderated nonlinear factor analysis (MNLFA) allows for simultaneous testing of DIF among multiple categorical and continuous covariates (e.g., sex, age, ethnicity, etc.), and regularization has shown promising results for identifying DIF among…
Descriptors: Test Bias, Algorithms, Factor Analysis, Error of Measurement
Peer reviewed Peer reviewed
Direct linkDirect link
L.J.G. Krijnen; K. Greaves-Lord; W. Mandy; K.J.S. Mataw; P. Hartog; S. Begeer – Journal of Autism and Developmental Disorders, 2024
The current study evaluated a brief, informant-based autism interview: the Developmental, Dimensional and Diagnostic Interview -- Adult Version (3Di-Adult). Feasibility, reliability and validity of the Dutch 3Di-Adult was tested amongst autistic participants (n = 62) and a non-autistic comparison group (n = 30) in the Netherlands. The 3Di-Adult…
Descriptors: Autism Spectrum Disorders, Identification, Foreign Countries, Adults
Peer reviewed Peer reviewed
Direct linkDirect link
Sidney Newton; Rui Wang – Educational Studies, 2024
Notwithstanding the neuromyth controversy, the malleability of learning style preferences impacts the validity of the measurement instrument and the effectiveness of the associated model of learning. This study investigates the test-retest reliability and underlying dynamics of Kolb's Learning Style Inventory (KLSI). It surveys 245 college-level…
Descriptors: Cognitive Style, Preferences, Reliability, Validity
Peer reviewed Peer reviewed
Direct linkDirect link
Bronson Hui; Zhiyi Wu – Studies in Second Language Acquisition, 2024
A slowdown or a speedup in response times across experimental conditions can be taken as evidence of online deployment of knowledge. However, response-time difference measures are rarely evaluated on their reliability, and there is no standard practice to estimate it. In this article, we used three open data sets to explore an approach to…
Descriptors: Reliability, Reaction Time, Psychometrics, Criticism
Peer reviewed Peer reviewed
Direct linkDirect link
Richard S. Balkin; Quentin Hunter; Bradley T. Erford – Measurement and Evaluation in Counseling and Development, 2024
We describe best practices in reporting reliability estimates in counseling research with consideration to precision, generalization, and diverse populations. We provide a historical context to reporting reliability estimates, the limitations of past practices, and new methods to address reliability generalization. We highlight best practices…
Descriptors: Best Practices, Reliability, Counseling, Research
Peer reviewed Peer reviewed
Direct linkDirect link
Russell P. Houpt; Kevin J. Grimm; Aaron T. McLaughlin; Daryl R. Van Tongeren – Structural Equation Modeling: A Multidisciplinary Journal, 2024
Numerous methods exist to determine the optimal number of classes when using latent profile analysis (LPA), but none are consistently correct. Recently, the likelihood incremental percentage per parameter (LI3P) was proposed as a model effect-size measure. To evaluate the LI3P more thoroughly, we simulated 50,000 datasets, manipulating factors…
Descriptors: Structural Equation Models, Profiles, Sample Size, Evaluation Methods
Peer reviewed Peer reviewed
Direct linkDirect link
Ilona Kocvarová; Jan Kalenda; Jitka Vaculíková; Zuzana Neupauer; Ruženka Šimonji Cernak; Anna Wloch – Higher Education Quarterly, 2024
The article focuses on adaptation and validation of the Academic Motivation Scale questionnaire (AMS-28) in higher education in four Eastern European countries: Czechia, Slovakia, Serbia, and Poland. The research was conducted with a total of 1711 respondents. We examined the construct validity of AMS-28 including measurement invariance and…
Descriptors: Foreign Countries, Learning Motivation, Measures (Individuals), Validity
Pages: 1  |  2  |  3  |  4  |  5  |  6  |  7  |  8  |  9  |  10  |  11  |  ...  |  651