Publication Date
| In 2026 | 0 |
| Since 2025 | 8 |
| Since 2022 (last 5 years) | 24 |
| Since 2017 (last 10 years) | 657 |
| Since 2007 (last 20 years) | 1979 |
Descriptor
| Statistical Analysis | 2830 |
| Validity | 1179 |
| Test Validity | 1071 |
| Foreign Countries | 1009 |
| Factor Analysis | 717 |
| Correlation | 704 |
| Test Reliability | 670 |
| Reliability | 591 |
| Questionnaires | 527 |
| Measures (Individuals) | 441 |
| Test Construction | 403 |
| More ▼ | |
Source
Author
Publication Type
Education Level
Audience
| Researchers | 54 |
| Practitioners | 20 |
| Teachers | 11 |
| Administrators | 7 |
| Students | 7 |
| Counselors | 2 |
| Policymakers | 2 |
| Community | 1 |
| Parents | 1 |
Location
| Turkey | 193 |
| Australia | 47 |
| Canada | 40 |
| California | 39 |
| China | 39 |
| Taiwan | 38 |
| Germany | 37 |
| Jordan | 36 |
| United Kingdom | 34 |
| Florida | 31 |
| Malaysia | 28 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Meets WWC Standards without Reservations | 1 |
| Meets WWC Standards with or without Reservations | 2 |
| Does not meet standards | 1 |
Nastasi, Bonnie K.; Hitchcock, John H.; Gutierrez, Raquel; Oshrin, Stephanie – Journal of Educational and Psychological Consultation, 2022
The purpose of this article is to provide researchers with a framework for promoting high quality mixed methods school-based consultation research. Furthermore, this article is intended for researchers and practitioners interested in evaluating their practice. To meet these objectives, we address quality criteria for quantitative (validity) and…
Descriptors: Mixed Methods Research, Consultation Programs, Evaluation Criteria, Statistical Analysis
Bárbara Mariana Gutiérrez-Pérez; Antonio Víctor Martín-García – Journal of Educators Online, 2023
Quality assessment systems have recently expanded, serving as indicators for the assessment and ranking of higher education institutions worldwide. The growing development of new educational methodologies, like Blended Learning, requires the design and validation of tools that allow for their assessment. The objective of this article is to…
Descriptors: Psychometrics, Validity, Educational Quality, Blended Learning
Marc Brysbaert – Cognitive Research: Principles and Implications, 2024
Experimental psychology is witnessing an increase in research on individual differences, which requires the development of new tasks that can reliably assess variations among participants. To do this, cognitive researchers need statistical methods that many researchers have not learned during their training. The lack of expertise can pose…
Descriptors: Experimental Psychology, Individual Differences, Statistical Analysis, Task Analysis
Lyrica Lucas; Anum Khushal; Robert Mayes; Brian A. Couch; Joseph Dauer – International Journal of Science Education, 2025
Educational reform priorities such as emphasis on quantitative modelling (QM) have positioned undergraduate biology instructors as designers of QM experiences to engage students in authentic science practices that support the development of data-driven and evidence-based reasoning. Yet, little is known about how biology instructors adapt to the…
Descriptors: Undergraduate Students, College Science, Biology, Classroom Observation Techniques
Haberman, Shelby J. – ETS Research Report Series, 2019
Cross-validation is a common statistical procedure applied to problems that are otherwise computationally intractable. It is often employed to assess the effectiveness of prediction procedures. In this report, cross-validation is discussed in terms of "U"-statistics. This approach permits consideration of the statistical properties of…
Descriptors: Statistical Analysis, Generalization, Prediction, Computation
Tawil, Muh.; Said, Muhammad Amin; Suryansari, Kemala – International Journal of Education and Practice, 2023
This study aimed to examine authentic models of science assessment in assessing the competence of senior high school students who met the criteria of validity, practicality, and effectiveness. The methodology applied was an evaluation and developmental research. Research was conducted in senior high school in accordance with the needs of the…
Descriptors: Foreign Countries, Performance Based Assessment, Student Evaluation, Science Achievement
Westbrook, Charles J.; Davis, Don E.; McElroy, Stacey E.; Brubaker, Kacy; Choe, Elise; Karaga, Sara; Dooley, Matt; O'Bryant, Brittany L.; Van Tongeren, Daryl R.; Hook, Joshua – Measurement and Evaluation in Counseling and Development, 2018
We develop the Trait Sources of Spirituality Scale (TSSS), which assesses experiences of closeness to the sacred, within and outside a religious tradition. After using factor analysis to finalize the scale, we examine evidence of construct validity, including latent profile analysis that reveals 5 patterns of how spirituality is experienced.
Descriptors: Measures (Individuals), Religious Factors, Factor Analysis, Test Construction
Saito, Daisuke; Yajima, Risei; Washizaki, Hironori; Fukazawa, Yoshiaki – Education Sciences, 2021
In evaluating the learning achievement of programming-thinking skills, the method of using a rubric that describes evaluation items and evaluation stages is widely employed. However, few studies have evaluated the reliability, validity, and consistency of the rubrics themselves. In this study, we introduced a statistical method for evaluating the…
Descriptors: Scoring Rubrics, Computer Science Education, Programming, Reliability
de Jong, Peter F. – Journal of Psychoeducational Assessment, 2023
The Wechsler Intelligence Scale for Children--Fifth Edition (WISC-V; Wechsler, 2014) provides a general intelligence score, representing "g," and five index scores, reflecting underlying broad factors. Within person differences between the overall performance across subtests and index scores, denoted as index difference scores, are often…
Descriptors: Test Validity, Children, Intelligence Tests, Indo European Languages
Benton, Tom – Research Matters, 2020
This article reviews the evidence on the extent to which experts' perceptions of item difficulties, captured using comparative judgement, can predict empirical item difficulties. This evidence is drawn from existing published studies on this topic and also from statistical analysis of data held by Cambridge Assessment. Having reviewed the…
Descriptors: Test Items, Difficulty Level, Expertise, Comparative Analysis
Nájera, Pablo; Sorrel, Miguel A.; Abad, Francisco José – Educational and Psychological Measurement, 2019
Cognitive diagnosis models (CDMs) are latent class multidimensional statistical models that help classify people accurately by using a set of discrete latent variables, commonly referred to as attributes. These models require a Q-matrix that indicates the attributes involved in each item. A potential problem is that the Q-matrix construction…
Descriptors: Matrices, Statistical Analysis, Models, Classification
Artvinli, Eyup; Demir, Zulfiye Melis – Journal of Education in Science, Environment and Health, 2018
The aim of this research is to develop an instrument that measures environmental attitudes of third grade students. The study was completed in six stages: creating scale items, content validity study, item total and remaining item correlation study, determining item discrimination, determining construct validity study and examining the internal…
Descriptors: Elementary School Students, Grade 3, Content Validity, Correlation
Sengül Avsar, Asiye – Measurement: Interdisciplinary Research and Perspectives, 2020
In order to reach valid and reliable test scores, various test theories have been developed, and one of them is nonparametric item response theory (NIRT). Mokken Models are the most widely known NIRT models which are useful for small samples and short tests. Mokken Package is useful for Mokken Scale Analysis. An important issue about validity is…
Descriptors: Response Style (Tests), Nonparametric Statistics, Item Response Theory, Test Validity
Krenzke, Tom; Mohadjer, Leyla; Li, Jianzhu; Erciulescu, Andreea; Fay, Robert; Ren, Weijia; Van de Kerckhove, Wendy; Li, Lin; Rao, J. N. K. – National Center for Education Statistics, 2020
The Program for the International Assessment of Adult Competencies (PIAAC) is a multicycle survey of adult skills and competencies sponsored by the Organization for Economic Cooperation and Development (OECD). The survey examines a range of basic skills in the information age and assesses these adult skills consistently across participating…
Descriptors: Adults, Surveys, Statistical Analysis, Computation
Karadavut, Tugba – Applied Measurement in Education, 2021
Mixture IRT models address the heterogeneity in a population by extracting latent classes and allowing item parameters to vary between latent classes. Once the latent classes are extracted, they need to be further examined to be characterized. Some approaches have been adopted in the literature for this purpose. These approaches examine either the…
Descriptors: Item Response Theory, Models, Test Items, Maximum Likelihood Statistics

Peer reviewed
Direct link
