NotesFAQContact Us
Collection
Advanced
Search Tips
Showing 436 to 450 of 9,530 results Save | Export
Peer reviewed Peer reviewed
PDF on ERIC Download full text
Y. Yokhebed; Rexy Maulana Dwi Karmadi; Luvia Ranggi Nastiti – Journal of Biological Education Indonesia (Jurnal Pendidikan Biologi Indonesia), 2025
Although self-assessment in critical thinking is thought to help students recognise their strengths and weaknesses, the reliability and validity of the assessment tool is still questionable, so a more objective evaluation is needed. Objective of this investigation is to assess the self-assessment tools in evaluating students' critical thinking…
Descriptors: Self Evaluation (Individuals), Critical Thinking, Science and Society, Test Validity
Peer reviewed Peer reviewed
Direct linkDirect link
Jieun Kim; Byungmin Lee – Reading in a Foreign Language, 2025
While it has been observed that second language learners selectively read information to answer comprehension questions, the amount of textual information required to correctly answer a question remains unclear. This study sought to identify the effects of this strategic reading on reading comprehension performance and explore how proficiency…
Descriptors: Reading Strategies, Reading Tests, English Learners, Second Language Learning
Peer reviewed Peer reviewed
PDF on ERIC Download full text
Dwi Rismi Ocy; Iva Sarifah; Riyadi – Journal of Research and Advances in Mathematics Education, 2025
Mathematical abstraction skills are fundamental for advanced reasoning and problem-solving, yet assessing these skills in senior high school students poses challenges due to limited validated instruments. This study aims to develop and validate a test instrument for measuring mathematical abstraction skills in Indonesian high school students. The…
Descriptors: Abstract Reasoning, Mathematics Tests, Mathematics Instruction, Mathematics Teachers
Peer reviewed Peer reviewed
Direct linkDirect link
Dulce Romero-Ayuso; Garbiñe Guerra-Begoña; Laura Marco-Miralles; José Matías Triviño-Juárez; Sonia Pérez-Rodríguez; Carmen Vidal-Ramírez; Abel Toledano-González; Sara Rosenblum – Reading and Writing: An Interdisciplinary Journal, 2025
Handwriting is a perceptual-motor skill encompassing a series of psychomotor skills related to academic performance. The main aim of this study was to translate and study the psychometric properties of the Handwriting Proficiency Screening Questionnaire for Children (HPSQ-C) for the Spanish population. A study was conducted on a final sample of…
Descriptors: Handwriting, Writing Skills, Screening Tests, Questionnaires
Peer reviewed Peer reviewed
PDF on ERIC Download full text
Arandha May Rachmawati; Agus Widyantoro – English Language Teaching Educational Journal, 2025
This study aims to evaluate the quality of English reading comprehension test instruments used in informal learning, especially as English literacy tests. With a quantitative approach, the analysis was carried out using the Rasch model through the Quest program on 30 multiple-choice questions given to 30 grade IX students from informal educational…
Descriptors: Item Response Theory, Reading Tests, Reading Comprehension, English (Second Language)
Peer reviewed Peer reviewed
PDF on ERIC Download full text
Robitzsch, Alexander – Journal of Intelligence, 2020
The last series of Raven's standard progressive matrices (SPM-LS) test was studied with respect to its psychometric properties in a series of recent papers. In this paper, the SPM-LS dataset is analyzed with regularized latent class models (RLCMs). For dichotomous item response data, an alternative estimation approach based on fused regularization…
Descriptors: Statistical Analysis, Classification, Intelligence Tests, Test Items
Peer reviewed Peer reviewed
Direct linkDirect link
Kim, Jinho; Wilson, Mark – Educational and Psychological Measurement, 2020
This study investigates polytomous item explanatory item response theory models under the multivariate generalized linear mixed modeling framework, using the linear logistic test model approach. Building on the original ideas of the many-facet Rasch model and the linear partial credit model, a polytomous Rasch model is extended to the item…
Descriptors: Item Response Theory, Test Items, Models, Responses
Peer reviewed Peer reviewed
Direct linkDirect link
Raykov, Tenko; Marcoulides, George A. – Educational and Psychological Measurement, 2020
This note raises caution that a finding of a marked pseudo-guessing parameter for an item within a three-parameter item response model could be spurious in a population with substantial unobserved heterogeneity. A numerical example is presented wherein each of two classes the two-parameter logistic model is used to generate the data on a…
Descriptors: Guessing (Tests), Item Response Theory, Test Items, Models
Peer reviewed Peer reviewed
Direct linkDirect link
Falcão, Filipe; Costa, Patrício; Pêgo, José M. – Advances in Health Sciences Education, 2022
Background: Current demand for multiple-choice questions (MCQs) in medical assessment is greater than the supply. Consequently, an urgency for new item development methods arises. Automatic Item Generation (AIG) promises to overcome this burden, generating calibrated items based on the work of computer algorithms. Despite the promising scenario,…
Descriptors: Multiple Choice Tests, Computer Assisted Testing, Test Items, Medical Education
Peer reviewed Peer reviewed
Direct linkDirect link
Baldwin, Peter; Clauser, Brian E. – Journal of Educational Measurement, 2022
While score comparability across test forms typically relies on common (or randomly equivalent) examinees or items, innovations in item formats, test delivery, and efforts to extend the range of score interpretation may require a special data collection before examinees or items can be used in this way--or may be incompatible with common examinee…
Descriptors: Scoring, Testing, Test Items, Test Format
Peer reviewed Peer reviewed
Direct linkDirect link
Kim, Hyung Jin; Lee, Won-Chan – Journal of Educational Measurement, 2022
Orlando and Thissen (2000) introduced the "S - X[superscript 2]" item-fit index for testing goodness-of-fit with dichotomous item response theory (IRT) models. This study considers and evaluates an alternative approach for computing "S - X[superscript 2]" values and other factors associated with collapsing tables of observed…
Descriptors: Goodness of Fit, Test Items, Item Response Theory, Computation
Peer reviewed Peer reviewed
Direct linkDirect link
Elkhatat, Ahmed M. – International Journal for Educational Integrity, 2022
Examinations form part of the assessment processes that constitute the basis for benchmarking individual educational progress, and must consequently fulfill credibility, reliability, and transparency standards in order to promote learning outcomes and ensure academic integrity. A randomly selected question examination (RSQE) is considered to be an…
Descriptors: Integrity, Monte Carlo Methods, Credibility, Reliability
Peer reviewed Peer reviewed
Direct linkDirect link
Sen, Sedat – Creativity Research Journal, 2022
The purpose of this study was to estimate the overall reliability values for the scores produced by Runco Ideational Behavior Scale (RIBS) and explore the variability of RIBS score reliability across studies. To achieve this, a reliability generalization meta-analysis was carried out using the 86 Cronbach's alpha estimates obtained from 77 studies…
Descriptors: Generalization, Creativity, Meta Analysis, Higher Education
Peer reviewed Peer reviewed
PDF on ERIC Download full text
Kilic, Abdullah Faruk; Uysal, Ibrahim – International Journal of Assessment Tools in Education, 2022
Most researchers investigate the corrected item-total correlation of items when analyzing item discrimination in multi-dimensional structures under the Classical Test Theory, which might lead to underestimating item discrimination, thereby removing items from the test. Researchers might investigate the corrected item-total correlation with the…
Descriptors: Item Analysis, Correlation, Item Response Theory, Test Items
Yiqin Pan – ProQuest LLC, 2022
Item preknowledge refers to the phenomenon in which some examinees have access to live items before taking a test. It is one of the most common and significant concerns within the testing industry. Thus, various statistical methods have been proposed to detect item preknowledge in computerized linear or adaptive testing. However, the success of…
Descriptors: Artificial Intelligence, Prior Learning, Test Items, Algorithms
Pages: 1  |  ...  |  26  |  27  |  28  |  29  |  30  |  31  |  32  |  33  |  34  |  ...  |  636