Publication Date
| In 2026 | 0 |
| Since 2025 | 6 |
| Since 2022 (last 5 years) | 26 |
| Since 2017 (last 10 years) | 108 |
| Since 2007 (last 20 years) | 302 |
Descriptor
| Comparative Analysis | 792 |
| Test Reliability | 792 |
| Test Validity | 425 |
| Foreign Countries | 174 |
| Test Construction | 132 |
| Correlation | 119 |
| Statistical Analysis | 117 |
| Scores | 106 |
| Higher Education | 98 |
| Psychometrics | 91 |
| Test Items | 89 |
| More ▼ | |
Source
Author
| Reckase, Mark D. | 5 |
| Bashaw, W. L. | 3 |
| Bennett, Randy Elliot | 3 |
| Benson, Jeri | 3 |
| Crehan, Kevin D. | 3 |
| Ebel, Robert L. | 3 |
| Frisbie, David A. | 3 |
| Hakstian, A. Ralph | 3 |
| Henk, William A. | 3 |
| Weiss, David J. | 3 |
| Winke, Paula | 3 |
| More ▼ | |
Publication Type
Education Level
Audience
| Researchers | 18 |
| Practitioners | 17 |
| Teachers | 9 |
| Administrators | 4 |
| Counselors | 2 |
| Policymakers | 2 |
| Parents | 1 |
| Support Staff | 1 |
Location
| United States | 21 |
| Turkey | 20 |
| Australia | 16 |
| China | 11 |
| United Kingdom (England) | 11 |
| Germany | 9 |
| Hong Kong | 9 |
| Iran | 9 |
| Taiwan | 9 |
| United Kingdom | 9 |
| Canada | 8 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
Alqarni, Abdulelah Mohammed – Journal on Educational Psychology, 2019
This study compares the psychometric properties of reliability in Classical Test Theory (CTT), item information in Item Response Theory (IRT), and validation from the perspective of modern validity theory for the purpose of bringing attention to potential issues that might exist when testing organizations use both test theories in the same testing…
Descriptors: Test Theory, Item Response Theory, Test Construction, Scoring
Maghfiroh, Anissa; Kuswanto, Heru – International Journal of Instruction, 2022
This research aims to reveal the effectiveness of the use of Kofie GeBoL media in improving (1) vector representation ability and (2) critical thinking ability in physics instruction. It is a descriptive quantitative study with the quasi-experiment design. It was conducted in two stages: empirical try out and implementation of Kofie GeboL to see…
Descriptors: Physics, Instructional Effectiveness, Critical Thinking, Thinking Skills
Icht, Michal; Bergerzon-Bitton, Orly; Ben-David, Boaz M. – International Journal of Language & Communication Disorders, 2022
'Dysarthria' is a group of motor speech disorders resulting from a disturbance in neuromuscular control. Most individuals with dysarthria cope with communicative restrictions due to speech impairments and reduced intelligibility. Thus, language-sensitive measurements of intelligibility are important in dysarthria neurological assessment. The…
Descriptors: Speech Impairments, Articulation (Education), Psychomotor Skills, Intelligibility
Olsen, Jacob; Preston, Angela I.; Algozzine, Bob; Algozzine, Kate; Cusumano, Dale – Clearing House: A Journal of Educational Strategies, Issues and Ideas, 2018
Although it is widely agreed that there is no universally accepted definition for school climate, most professionals ground it in shared beliefs, values, and attitudes reflecting the quality and character of life in schools. In this article, we review and analyze measures accessible to school personnel charged with documenting and monitoring…
Descriptors: Educational Environment, Measures (Individuals), School Personnel, Test Format
Sawczuk, Thomas; Jones, Ben; Scantlebury, Sean; Weakley, Jonathan; Read, Dale; Costello, Nessan; Darrall-Jones, Joshua David; Stokes, Keith; Till, Kevin – Measurement in Physical Education and Exercise Science, 2018
This study aimed to evaluate the between-day reliability and usefulness of a fitness testing battery in a group of youth sport athletes. Fifty-nine youth sport athletes (age = 17.3 ± 0.7 years) undertook a fitness testing battery including the isometric mid-thigh pull, counter-movement jump, 5-40 m sprint splits, and the 5-0-5 change of direction…
Descriptors: Test Reliability, Comparative Analysis, Athletes, Team Sports
Fu, Yuanshu; Wen, Zhonglin; Wang, Yang – Educational and Psychological Measurement, 2018
The maximal reliability of a congeneric measure is achieved by weighting item scores to form the optimal linear combination as the total score; it is never lower than the composite reliability of the measure when measurement errors are uncorrelated. The statistical method that renders maximal reliability would also lead to maximal criterion…
Descriptors: Test Reliability, Test Validity, Comparative Analysis, Attitude Measures
Silber, Henning; Roßmann, Joss; Gummer, Tobias – International Journal of Social Research Methodology, 2018
In this article, we present the results of three question design experiments on inter-item correlations, which tested a grid design against a single-item design. The first and second experiments examined the inter-item correlations of a set with five and seven items, respectively, and the third experiment examined the impact of the question design…
Descriptors: Foreign Countries, Online Surveys, Experiments, Correlation
Schnoor, Birger; Hartig, Johannes; Klinger, Thorsten; Naumann, Alexander; Usanova, Irina – Language Testing, 2023
Research on assessing English as a foreign language (EFL) development has been growing recently. However, empirical evidence from longitudinal analyses based on substantial samples is still needed. In such settings, tests for measuring language development must meet high standards of test quality such as validity, reliability, and objectivity, as…
Descriptors: English (Second Language), Second Language Learning, Second Language Instruction, Longitudinal Studies
Xu, Jing; Jones, Edmund; Laxton, Victoria; Galaczi, Evelina – Assessment in Education: Principles, Policy & Practice, 2021
Recent advances in machine learning have made automated scoring of learner speech widespread, and yet validation research that provides support for applying automated scoring technology to assessment is still in its infancy. Both the educational measurement and language assessment communities have called for greater transparency in describing…
Descriptors: Second Language Learning, Second Language Instruction, English (Second Language), Computer Software
Ozdemir, Adem; Koc, Yasemin; Gundogdu, Kerim – International Journal of Psycho-Educational Sciences, 2018
The aim of this research is to develop a scale that prospective science teachers in the Education Faculties compare themselves to their peers according to the "Science field Teacher and Professional Skills" courses. For this reason, 25 items related to Physics, Chemistry, Biology, Science Experiments and Professional Skills courses were…
Descriptors: Preservice Teachers, Science Teachers, Likert Scales, Test Validity
Chongo, Samri; Osman, Kamisah; Nayan, Nazrul Anuar – EURASIA Journal of Mathematics, Science and Technology Education, 2021
Computational thinking (CT) is one of the systematic tools in problem solving and widely accepted as an important skill in the 21st century. This study aimed to identify the effectiveness of the Chemistry Computational Thinking (CT-CHEM) Module on achievement in chemistry. This study also employed a quasi-experimental design with the participation…
Descriptors: Chemistry, Science Instruction, Thinking Skills, Achievement Tests
Galeoto, Giovanni; D'Elpidio, Giuliana; Alvaro, Rosaria; Zicari, Anna Maria; Valente, Donatella; Riccio, Marianna – International Association for Development of the Information Society, 2021
The Italian Disciplinary section of Test of Competences (TECO-D) project is an important longitudinal study used to analyze learning outcomes of ungraded students and to measure quality of the educational process. The aim of the present study was to evaluate the psychometric properties of the TECO-D in students enrolled in the Bachelor's Degree in…
Descriptors: Case Studies, Nursing Education, Psychometrics, Longitudinal Studies
Mandy, William; Clarke, Kiri; McKenner, Michele; Strydom, Andre; Crabtree, Jason; Lai, Meng-Chuan; Allison, Carrie; Baron-Cohen, Simon; Skuse, David – Journal of Autism and Developmental Disorders, 2018
We developed a brief, informant-report interview for assessing autism spectrum conditions (ASC) in adults, called the Developmental, Dimensional and Diagnostic Interview-Adult Version (3Di-Adult); and completed a preliminary evaluation. Informant reports were collected for participants with ASC (n = 39), a non-clinical comparison group (n = 29)…
Descriptors: Autism, Pervasive Developmental Disorders, Adults, Diagnostic Tests
McKie, Greg L.; Islam, Hashim; Townsend, Logan K.; Howe, Greg J.; Hazell, Tom J. – Measurement in Physical Education and Exercise Science, 2018
This study examined the validity and reliability of a 30-second running sprint test using two non-motorized treadmills compared to the established Wingate Anaerobic Test. Twenty-four participants completed three sessions in a randomized order on a: (1) manual mode treadmill (Woodway); (2) specialized interval training treadmill (HiTrainer); and…
Descriptors: Exercise, Physical Activities, Correlation, Exercise Physiology
Kelleher, Leila K.; Beach, Tyson A. C.; Frost, David M.; Johnson, Andrew M.; Dickey, James P. – Measurement in Physical Education and Exercise Science, 2018
The scoring scheme for the functional movement screen implicitly assumes that the factor structure is consistent, stable, and congruent across different populations. To determine if this is the case, we compared principal components analyses of three samples: a healthy, general population (n = 100), a group of varsity athletes (n = 101), and a…
Descriptors: Factor Structure, Test Reliability, Screening Tests, Motion

Peer reviewed
Direct link
