Publication Date
| In 2026 | 0 |
| Since 2025 | 1 |
| Since 2022 (last 5 years) | 6 |
| Since 2017 (last 10 years) | 10 |
| Since 2007 (last 20 years) | 15 |
Descriptor
| Generalization | 19 |
| Scores | 19 |
| Test Reliability | 19 |
| Psychometrics | 8 |
| Measures (Individuals) | 5 |
| Meta Analysis | 5 |
| Test Validity | 5 |
| Measurement Techniques | 4 |
| Decision Making | 3 |
| Foreign Countries | 3 |
| Language Tests | 3 |
| More ▼ | |
Source
Author
| Abdulkadir Haktanir | 1 |
| Alejandro Sandoval-Lentisco | 1 |
| Alejandro Veas | 1 |
| Aloisi, Cesare | 1 |
| Aras, Yahyahan | 1 |
| Borsman, Denny | 1 |
| Byeolbee Um | 1 |
| Callaghan, A. | 1 |
| Caruso, John C. | 1 |
| Chin, Han Xin | 1 |
| Cotton, Sue M. | 1 |
| More ▼ | |
Publication Type
| Journal Articles | 17 |
| Reports - Research | 11 |
| Reports - Evaluative | 6 |
| Information Analyses | 3 |
| Speeches/Meeting Papers | 3 |
Education Level
| Higher Education | 6 |
| Postsecondary Education | 5 |
Audience
Location
| Nigeria | 1 |
| United Kingdom (Reading) | 1 |
Laws, Policies, & Programs
Assessments and Surveys
| California Critical Thinking… | 1 |
| Medical College Admission Test | 1 |
| Test of English as a Foreign… | 1 |
| United States Medical… | 1 |
What Works Clearinghouse Rating
José Antonio López-López; Rubén López-Nicolás; Alejandro Sandoval-Lentisco; Julio Sánchez-Meca; Alejandro Veas – Journal of Psychoeducational Assessment, 2025
The School Attitude Assessment Survey-Revised (SAAS-R) is a popular scale for assessing attitudinal and motivational aspects of students' academic achievement. However, evidence on key psychometric properties of the SAAS-R such as reliability remains limited. We conducted a reliability generalization study of the SAAS-R using meta-analytic…
Descriptors: Attitude Measures, Student Attitudes, School Attitudes, Psychometrics
Orhan, Ali – Journal of Psychoeducational Assessment, 2022
The aims of this reliability generalization study were to provide the overall alpha values of the California critical thinking disposition inventory (CCTDI) total score and subscales scores and investigate the characteristics of the studies that may be associated with the variability in the reliability values of the CCTDI total score and subscales…
Descriptors: Critical Thinking, Measures (Individuals), Test Reliability, Generalization
Sojeong Nam; Byeolbee Um; Jeongwoon Jeong; Monique Rodriguez; David Lardier – Measurement and Evaluation in Counseling and Development, 2024
This study aimed to provide meta-analytic reliability information of the Columbia-Suicide Severity Rating Scale (C-SSRS). We implemented systematic search procedures to 35 eligible studies (N = 23,247; Mage = 26.74 years) that reported reliability estimates. The synthesized average values of Cronbach's alpha were 0.88 (95% CI [0.85, 0.92]) for the…
Descriptors: Scores, Test Reliability, Rating Scales, Suicide
Abdulkadir Haktanir; M. Furkan Kurnaz; Zeynep Simsir Gökalp – Measurement and Evaluation in Counseling and Development, 2024
Objective: Brief Self-Control Scale (BSCS) is the most widely used instrument to assess self-control. The purpose of this reliability generalization meta-analysis was to examine the degree to which consistency reliability coefficients for scores on the BSCS generalize across age groups and languages. Method: We included studies using the BSCS and…
Descriptors: Self Control, Measures (Individuals), Meta Analysis, Test Reliability
Lenz, A. Stephen; Ho, Chia-Min; Rocha, Lauren; Aras, Yahyahan – Measurement and Evaluation in Counseling and Development, 2021
This study examined the degree that reliability coefficients for scores on the PTGI generalize across participant and study characteristics. Meta-analytic procedures resulted in observed and predicted mean alpha coefficients ranging from acceptable to excellent and appeared to be largely unrelated to the participant characteristics included in our…
Descriptors: Generalization, Test Reliability, Scores, Measures (Individuals)
Sen, Sedat – Creativity Research Journal, 2022
The purpose of this study was to estimate the overall reliability values for the scores produced by Runco Ideational Behavior Scale (RIBS) and explore the variability of RIBS score reliability across studies. To achieve this, a reliability generalization meta-analysis was carried out using the 86 Cronbach's alpha estimates obtained from 77 studies…
Descriptors: Generalization, Creativity, Meta Analysis, Higher Education
Liao, Ray J. T. – Language Testing, 2023
Among the variety of selected response formats used in L2 reading assessment, multiple-choice (MC) is the most commonly adopted, primarily due to its efficiency and objectiveness. Given the impact of assessment results on teaching and learning, it is necessary to investigate the degree to which the MC format reliably measures learners' L2 reading…
Descriptors: Reading Tests, Language Tests, Second Language Learning, Second Language Instruction
Aloisi, Cesare; Callaghan, A. – Higher Education Pedagogies, 2018
The University of Reading Learning Gain project is a three-year longitudinal project to test and evaluate a range of available methodologies and to draw conclusions on what might be the right combination of instruments for the measurement of Learning Gain in higher education. This paper analyses the validity of a measure of critical thinking…
Descriptors: Foreign Countries, Cognitive Tests, Critical Thinking, Thinking Skills
Tan, Kevin; Chin, Han Xin; Yau, Christine W. L.; Lim, Erle C. H.; Samarasekera, Dujeepa; Ponnamperuma, Gominda; Tan, Nigel C. K. – Anatomical Sciences Education, 2018
Neuroanatomical localization (NL) is a key skill in neurology, but learners often have difficulty with it. This study aims to evaluate a concise NL tool (NLT) developed to help teach and learn NL. To evaluate the NLT, an extended-matching questions (EMQ) test to assess NL was designed and validated. The EMQ was validated with fourth-year medical…
Descriptors: Neurology, Anatomy, Teaching Methods, Test Construction
Yoder, Zachariah – Journal of Multilingual and Multicultural Development, 2017
The recorded text test (RTT) is commonly used to test dialect intelligibility, often to inform language development decisions. More than 25 papers using the RTT method were published on www.sil.org/silesr from January 2009 to March 2013. As introduced by Casad [1974. "Dialect Intelligibility Testing." Summer Institute of Linguistics…
Descriptors: Scores, Language Minorities, Language Variation, Mutual Intelligibility
Lee, Ming; Wimmers, Paul F. – Advances in Health Sciences Education, 2016
Although problem-based learning (PBL) has been widely used in medical schools, few studies have attended to the assessment of PBL processes using validated instruments. This study examined reliability and validity for an instrument assessing PBL performance in four domains: Problem Solving, Use of Information, Group Process, and Professionalism.…
Descriptors: Problem Based Learning, Teaching Methods, Medical Education, Physicians
Sawaki, Yasuyo; Sinharay, Sandip – ETS Research Report Series, 2013
This study investigates the value of reporting the reading, listening, speaking, and writing section scores for the "TOEFL iBT"® test, focusing on 4 related aspects of the psychometric quality of the TOEFL iBT section scores: reliability of the section scores, dimensionality of the test, presence of distinct score profiles, and the…
Descriptors: Scores, Computer Assisted Testing, Factor Analysis, Correlation
Warne, Russell – Online Submission, 2008
Literature shows that most researchers are unaware of some of the characteristics of reliability. This paper clarifies some misconceptions by describing the procedures, benefits, and limitations of reliability generalization while using it to illustrate the nature of score reliability. Reliability generalization (RG) is a meta-analytic method…
Descriptors: Test Reliability, Generalization, Meta Analysis, Scores
Borsman, Denny; Romeijn, Jan-Willem; Wicherts, Jelte M. – Psychological Methods, 2008
This article shows that measurement invariance (defined in terms of an invariant measurement model in different groups) is generally inconsistent with selection invariance (defined in terms of equal sensitivity and specificity across groups). In particular, when a unidimensional measurement instrument is used and group differences are present in…
Descriptors: Test Items, Minority Groups, Measurement, Scores
Rexrode, Kathryn R.; Petersen, Suni; O'Toole, Siobhan – Educational and Psychological Measurement, 2008
For more than 20 years, the Ways of Coping Scale (WOCS) has been used extensively to measure coping. Yet beyond the original psychometric data, few studies have reexamined its properties utilizing the enormous body of research generated on the WOCS. Reliability has been assumed to be consistent as an attribute of the test. This study used…
Descriptors: Evaluation Research, Test Reliability, Coping, Measures (Individuals)
Previous Page | Next Page »
Pages: 1 | 2
Peer reviewed
Direct link
