Publication Date
| In 2026 | 2 |
| Since 2025 | 454 |
| Since 2022 (last 5 years) | 1933 |
| Since 2017 (last 10 years) | 4505 |
| Since 2007 (last 20 years) | 6990 |
Descriptor
Source
Author
Publication Type
Education Level
Audience
| Researchers | 454 |
| Practitioners | 319 |
| Teachers | 128 |
| Administrators | 73 |
| Policymakers | 33 |
| Counselors | 31 |
| Students | 17 |
| Parents | 10 |
| Community | 6 |
| Support Staff | 5 |
Location
| Turkey | 837 |
| Australia | 239 |
| China | 211 |
| Canada | 207 |
| Indonesia | 161 |
| Spain | 129 |
| United States | 123 |
| United Kingdom | 121 |
| Germany | 111 |
| Taiwan | 108 |
| Netherlands | 102 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Meets WWC Standards without Reservations | 2 |
| Meets WWC Standards with or without Reservations | 2 |
| Does not meet standards | 1 |
Ryan, Joseph J.; Gontkovsky, Samuel T. – Journal of Psychoeducational Assessment, 2021
We analyzed data from the WASI-II manual to determine discrepancy score reliabilities of the Verbal Comprehension (VCI) and Perceptual Reasoning (PRI) indexes and the four subtests in the child and adult standardization samples. Reliabilities of the VCI-PRI discrepancy scores range from 0.78 to 0.86 for children and 0.82 to 0.89 for adults and…
Descriptors: Intelligence Tests, Test Reliability, Scores, Children
Foster, Robert C. – Educational and Psychological Measurement, 2021
This article presents some equivalent forms of the common Kuder-Richardson Formula 21 and 20 estimators for nondichotomous data belonging to certain other exponential families, such as Poisson count data, exponential data, or geometric counts of trials until failure. Using the generalized framework of Foster (2020), an equation for the reliability…
Descriptors: Test Reliability, Data, Computation, Mathematical Formulas
Raykov, Tenko; Marcoulides, George A. – Educational and Psychological Measurement, 2021
The population discrepancy between unstandardized and standardized reliability of homogeneous multicomponent measuring instruments is examined. Within a latent variable modeling framework, it is shown that the standardized reliability coefficient for unidimensional scales can be markedly higher than the corresponding unstandardized reliability…
Descriptors: Test Reliability, Computation, Measures (Individuals), Research Problems
Barth, Philipp; Stadtmann, Georg – Journal of Creative Behavior, 2021
The "consensual assessment technique" (CAT) is a reliable and valid method to measure (product) creativity and often considered "the" gold standard of creativity assessment. The reliability measure traditionally applied in CAT studies--inter-rater reliability--cannot capture time-sampling error, which is a particular relevant…
Descriptors: Creativity, Creativity Tests, Test Reliability, Interrater Reliability
Sean N. Weeks; Tyler L. Renshaw; Allysia A. Rainey; Aubrey Hiatt – Journal of Emotional and Behavioral Disorders, 2024
Internalizing and externalizing problems are common targets for school mental health screening. Prior research supports the interpretation of scores from the Youth Internalizing Problems Screener (YIPS) and the Youth Externalizing Problems Screener (YEPS), which were developed separately yet intended as companion measures. We extended previous…
Descriptors: Adolescents, Screening Tests, Behavior Problems, Mental Health
Orhan Gazi Yildirim; Nezahat Hamiden Karaca; Fatma Betül Senol – International Electronic Journal of Elementary Education, 2024
Self concept is an experiential formation gained as a result of certain experiences. The concept of self-concept has an interesting intersection with the psychological field of humour. The aim of the study is to examine the relationship between the humor styles and self-perceptions of primary school 4th grade students and to conduct the…
Descriptors: Foreign Countries, Self Concept, Humor, Elementary School Students
Vinay Kumar Yadav; Shakti Prasad – Measurement: Interdisciplinary Research and Perspectives, 2024
In sample survey analysis, accurate population mean estimation is an important task, but traditional approaches frequently ignore the intricacies of real-world data, leading to biassed results. In order to handle uncertainties, indeterminacies, and ambiguity, this work presents an innovative approach based on neutrosophic statistics. We proposed…
Descriptors: Sampling, Statistical Bias, Predictor Variables, Predictive Measurement
Ananda Aprilia; Wipsar Sunu Brams Dwandaru – Science Education International, 2024
This study focused on developing physics test instruments for senior high school students on the topics of temperature and heat. The study aimed to determine (i) the quality of the test instrument content, (ii) the feasibility of the test instrument, and (iii) students' graphic representation abilities on "Temperature and Heat" topics.…
Descriptors: Foreign Countries, High School Seniors, Secondary School Science, Physics
Sarah K. Anderson; Sevda Ozsezer-Kurnuc; Pinky Jain – British Journal of Educational Studies, 2024
This paper reports on a systematic literature review to understand better methodologies and data collection tools used to judge student teaching effectiveness, ways in which validity and reliability are considered, the processes involved in assessing new teaching effectiveness within teacher education programmes, and how evaluation and results are…
Descriptors: Literature Reviews, Content Analysis, Student Teachers, Teacher Effectiveness
Samuel J. Howarth; Erinn McCreath Frangakis; Steven Hirsch; Diana De Carvalho – Measurement in Physical Education and Exercise Science, 2024
The flexion relaxation ratio (FRR) of the lumbar extensor muscles is often assessed in experimental and clinical studies. This study evaluated within- and between-session test--retest reliability and measurement error for different FRR formulations. Participants completed two identical data collection sessions 1-week apart. Spine flexion and…
Descriptors: Exercise Physiology, Human Body, Pretests Posttests, Error of Measurement
Xijuan Zhang; Hao Wu – Structural Equation Modeling: A Multidisciplinary Journal, 2024
A full structural equation model (SEM) typically consists of both a measurement model (describing relationships between latent variables and observed scale items) and a structural model (describing relationships among latent variables). However, often researchers are primarily interested in testing hypotheses related to the structural model while…
Descriptors: Structural Equation Models, Goodness of Fit, Robustness (Statistics), Factor Structure
Mustafa Ilhan; Nese Güler; Gülsen Tasdelen Teker; Ömer Ergenekon – International Journal of Assessment Tools in Education, 2024
This study aimed to examine the effects of reverse items created with different strategies on psychometric properties and respondents' scale scores. To this end, three versions of a 10-item scale in the research were developed: 10 positive items were integrated in the first form (Form-P) and five positive and five reverse items in the other two…
Descriptors: Test Items, Psychometrics, Scores, Measures (Individuals)
Lientje Maas; Matthew J. Madison; Matthieu J. S. Brinkhuis – Grantee Submission, 2024
Diagnostic classification models (DCMs) are psychometric models that yield probabilistic classifications of respondents according to a set of discrete latent variables. The current study examines the recently introduced one-parameter log-linear cognitive diagnosis model (1-PLCDM), which has increased interpretability compared with general DCMs due…
Descriptors: Clinical Diagnosis, Classification, Models, Psychometrics
Madeline A. Schellman; Matthew J. Madison – Grantee Submission, 2024
Diagnostic classification models (DCMs) have grown in popularity as stakeholders increasingly desire actionable information related to students' skill competencies. Longitudinal DCMs offer a psychometric framework for providing estimates of students' proficiency status transitions over time. For both cross-sectional and longitudinal DCMs, it is…
Descriptors: Diagnostic Tests, Classification, Models, Psychometrics
Caspar J. Van Lissa; Eli-Boaz Clapper; Rebecca Kuiper – Research Synthesis Methods, 2024
The product Bayes factor (PBF) synthesizes evidence for an informative hypothesis across heterogeneous replication studies. It can be used when fixed- or random effects meta-analysis fall short. For example, when effect sizes are incomparable and cannot be pooled, or when studies diverge significantly in the populations, study designs, and…
Descriptors: Hypothesis Testing, Evaluation Methods, Replication (Evaluation), Sample Size

Peer reviewed
Direct link
