Publication Date
| In 2026 | 1 |
| Since 2025 | 166 |
| Since 2022 (last 5 years) | 1019 |
| Since 2017 (last 10 years) | 2334 |
| Since 2007 (last 20 years) | 6520 |
Descriptor
| Reliability | 9759 |
| Validity | 3866 |
| Foreign Countries | 2823 |
| Measures (Individuals) | 1892 |
| Correlation | 1522 |
| Factor Analysis | 1460 |
| Statistical Analysis | 1278 |
| Questionnaires | 1084 |
| Scores | 1064 |
| Student Attitudes | 1034 |
| Psychometrics | 979 |
| More ▼ | |
Source
Author
Publication Type
Education Level
Audience
| Researchers | 181 |
| Practitioners | 101 |
| Teachers | 61 |
| Administrators | 42 |
| Policymakers | 33 |
| Students | 21 |
| Counselors | 10 |
| Media Staff | 5 |
| Community | 1 |
| Parents | 1 |
| Support Staff | 1 |
| More ▼ | |
Location
| Turkey | 454 |
| Australia | 155 |
| Canada | 144 |
| China | 127 |
| United States | 127 |
| Taiwan | 107 |
| United Kingdom | 100 |
| Nigeria | 98 |
| California | 95 |
| Netherlands | 91 |
| Indonesia | 86 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Meets WWC Standards without Reservations | 3 |
| Meets WWC Standards with or without Reservations | 4 |
| Does not meet standards | 2 |
Laura Scholes; Sarah McDonald; Garth Stahl; Barbara Comber – British Educational Research Journal, 2024
Sourcing information related to socio-scientific issues requires sophisticated literacies to read and evaluate conflicting accounts often signified by disagreement among experts, multiple solutions or misinformation. Much of the previous work exploring how young people approach conflicting information has tended to focus on students in the…
Descriptors: Middle School Students, Information Sources, Internet, Search Strategies
Terra Blevins – ProQuest LLC, 2024
While large language models (LLMs) continue to grow in scale and gain new zero-shot capabilities, their performance for languages beyond English increasingly lags behind. This gap is due to the "curse of multilinguality," where multilingual language models perform worse on individual languages than a monolingual model trained on that…
Descriptors: Multilingualism, Computational Linguistics, Second Languages, Reliability
Miriam C. Boesch; M. Alexandra Da Fonte; Melissa J. Cavagnini; Kaitlyn R. Shaw; Keren E. Deneny; Margaret F. Davis – Journal of Special Education Technology, 2024
Students with complex communication needs have increasingly been using non-dedicated communication systems, such as mobile devices, to support their communication needs. This in turn, has led to an increased used of augmentative and alternative communication apps. The main challenge currently faced is the lack of empirically validated apps and…
Descriptors: Computer Oriented Programs, Evaluation Methods, Augmentative and Alternative Communication, Communication Disorders
Zirou Lin; Hanbing Yan; Li Zhao – Journal of Computer Assisted Learning, 2024
Background: Peer assessment has played an important role in large-scale online learning, as it helps promote the effectiveness of learners' online learning. However, with the emergence of numerical grades and textual feedback generated by peers, it is necessary to detect the reliability of the large amount of peer assessment data, and then develop…
Descriptors: Peer Evaluation, Automation, Grading, Models
Amal Abdullah Alibrahim – South African Journal of Education, 2024
After ChatGPT was released late in 2022, many arguments about its accuracy and use in education arose. In this article, I seek to provide evidence of the accuracy and validity of ChatGPT's responses to users' queries in education by applying a systematic review methodology to analyse publications in specific databases following PRISMA guidelines…
Descriptors: Artificial Intelligence, Technology Uses in Education, Reliability, Natural Language Processing
Ann C. Jolly; Kristen D. Beach; Heather H. Aiken; Steven J. Amendum – Grantee Submission, 2024
The field of education relies heavily on instructional coaches to build teacher capacity in the implementation of evidence-based practices (EBPs). Although observation tools are commonly used to measure the fidelity of implementation by teachers, fewer tools are available to identify specific coaching behaviors used during in situ coaching…
Descriptors: Coaching (Performance), Observation, Research Tools, Reliability
Ann C. Jolly; Kristen D. Beach; Heather H. Aiken; Steven J. Amendum – Journal of Educational and Psychological Consultation, 2024
The field of education relies heavily on instructional coaches to build teacher capacity in the implementation of evidence-based practices (EBPs). Although observation tools are commonly used to measure the fidelity of implementation by teachers, fewer tools are available to identify specific coaching behaviors used during in situ coaching…
Descriptors: Coaching (Performance), Observation, Research Tools, Reliability
Siqi Huang – North American Chapter of the International Group for the Psychology of Mathematics Education, 2023
The goal of this paper is twofold. First, the paper clarifies and elaborates on an important theoretical construct called orientation with respect to understanding in mathematics, which denotes the degree to which students exhibit an inclination towards and demonstrate an earnest concern for understanding in mathematical learning. Second, the…
Descriptors: Mathematics Instruction, Teaching Methods, Problem Solving, Reliability
Raykov, Tenko; Anthony, James C.; Menold, Natalja – Educational and Psychological Measurement, 2023
The population relationship between coefficient alpha and scale reliability is studied in the widely used setting of unidimensional multicomponent measuring instruments. It is demonstrated that for any set of component loadings on the common factor, regardless of the extent of their inequality, the discrepancy between alpha and reliability can be…
Descriptors: Correlation, Evaluation Research, Reliability, Measurement Techniques
Tsangaridou, Niki; Charalambous, Charalambos Y. – Quest, 2023
Focusing on systematic observation, one of the most potent methods of studying teaching quality, represents one of the numerous contributions of Daryl Siedentop to the profession. While he had a clear focus on issues of validity and reliability concerning systematic observation, over the past decades, attention to such issues appears to have…
Descriptors: Physical Education Teachers, Observation, Validity, Reliability
Poulsen, Mads; Juul, Holger; Elbro, Carsten – Annals of Dyslexia, 2023
Different definitions and tests of dyslexia can cause unfairness and make life difficult for people with dyslexia as well as for the professionals. In 2012, the Danish government decided to support the fight against dyslexia. The government issued a public tender for the development of "a standardized, electronically administered test of…
Descriptors: Dyslexia, National Competency Tests, Foreign Countries, Test Construction
Moore, C. Missy; Crawford, Carey C.; Tertichny, Alissa – Measurement and Evaluation in Counseling and Development, 2023
We examined dimensionality and temporal stability of the Interpersonal Stress Scale-Counselor (ISS-C) scores in a sample of professional counselors (n = 518). Confirmatory factor analyses provided support for a four-factor model previously identified through exploratory factor analysis and a bifactor model. Using a randomized test-retest, temporal…
Descriptors: Counselors, Interpersonal Relationship, Stress Variables, Measures (Individuals)
Bouwer, Renske; Koster, Monica; van den Bergh, Huub – Assessment in Education: Principles, Policy & Practice, 2023
Assessing students' writing performance is essential to adequately monitor and promote individual writing development, but it is also a challenge. The present research investigates a benchmark rating procedure for assessing texts written by upper-elementary students. In two studies we examined whether a benchmark rating procedure (1) leads to…
Descriptors: Benchmarking, Writing Evaluation, Evaluation Methods, Elementary School Students
Tine S. Prøitz – Teaching in Higher Education, 2023
Drawing on the concepts of consistency, this study contributes to the discussion of study programme plans and the links between curriculum elements. The main argument is that a universal requirement of consistency is taken for granted in study programme planning, even though critics have noted a need for closer scrutiny and debate. The literature…
Descriptors: Curriculum Development, Reliability, College Curriculum, Alignment (Education)
Matthew J. Madison; Seungwon Chung; Junok Kim; Laine P. Bradshaw – Grantee Submission, 2023
Recent developments have enabled the modeling of longitudinal assessment data in a diagnostic classification model (DCM) framework. These longitudinal DCMs were developed to provide measures of student growth on a discrete scale in the form of attribute mastery transitions, thereby supporting categorical and criterion-referenced interpretations of…
Descriptors: Models, Cognitive Measurement, Diagnostic Tests, Classification

Peer reviewed
Direct link
