Publication Date
| In 2026 | 3 |
| Since 2025 | 675 |
| Since 2022 (last 5 years) | 3176 |
| Since 2017 (last 10 years) | 7417 |
| Since 2007 (last 20 years) | 15055 |
Descriptor
| Test Reliability | 15043 |
| Test Validity | 10279 |
| Reliability | 9761 |
| Foreign Countries | 7144 |
| Test Construction | 4825 |
| Validity | 4191 |
| Measures (Individuals) | 3877 |
| Factor Analysis | 3825 |
| Psychometrics | 3526 |
| Interrater Reliability | 3124 |
| Correlation | 3040 |
| More ▼ | |
Source
Author
Publication Type
Education Level
Audience
| Researchers | 709 |
| Practitioners | 451 |
| Teachers | 208 |
| Administrators | 122 |
| Policymakers | 66 |
| Counselors | 42 |
| Students | 38 |
| Parents | 11 |
| Community | 7 |
| Support Staff | 6 |
| Media Staff | 5 |
| More ▼ | |
Location
| Turkey | 1328 |
| Australia | 436 |
| Canada | 379 |
| China | 368 |
| United States | 271 |
| United Kingdom | 256 |
| Indonesia | 253 |
| Taiwan | 234 |
| Netherlands | 223 |
| Spain | 217 |
| California | 215 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Meets WWC Standards without Reservations | 8 |
| Meets WWC Standards with or without Reservations | 9 |
| Does not meet standards | 6 |
Hogan, Thomas; DeStefano, Marissa; Gilby, Caitlin; Kosman, Dana; Peri, Joshua – Applied Measurement in Education, 2021
Buros' "Mental Measurements Yearbook (MMY)" has provided professional reviews of commercially published psychological and educational tests for over 80 years. It serves as a kind of conscience for the testing industry. For a random sample of 50 entries in the "19th MMY" (a total of 100 separate reviews) this study determined…
Descriptors: Test Reviews, Interrater Reliability, Psychological Testing, Educational Testing
Kazimi, Parviz Firudin Oqlu – Journal of Practical Studies in Education, 2021
The reliability of information in the global information space is one of the most important problems of globalization. The credibility of various information resources is currently being studied and considered in different ways. In some cases, the problem of the reliability of information can be assessed as harmful and dangerous. This article,…
Descriptors: Information Sources, Reliability, Credibility, Classification
Lambert, Richard G.; Holcomb, T. Scott; Bottoms, Bryndle L. – Center for Educational Measurement and Evaluation, 2021
The validity of the Kappa coefficient of chance-corrected agreement has been questioned when the prevalence of specific rating scale categories is low and agreement between raters is high. The researchers proposed the Lambda Coefficient of Rater-Mediated Agreement as an alternative to Kappa to address these concerns. Lambda corrects for chance…
Descriptors: Interrater Reliability, Teacher Evaluation, Test Validity, Evaluation Methods
McLeod, Bryce D.; Sutherland, Kevin S.; Broda, Michael; Granger, Kristen L.; Martinez, Ruben G.; Conroy, Maureen A.; Snyder, Patricia A.; Southam-Gerow, Michael A. – Prevention Science, 2022
Though treatment integrity measurement is important for research intended to promote social and behavioral outcomes of children at risk for emotional and behavioral disorders (EBDs) in early childhood settings, measurement gaps exist in the field. This paper reports on the development and preliminary psychometric assessment of the treatment…
Descriptors: Psychometrics, Measures (Individuals), Fidelity, Integrity
Beula M. Magimairaj; Philip Capin; Sandra L. Gillam; Sharon Vaughn; Greg Roberts; Anna-Maria Fall; Ronald B. Gillam – Grantee Submission, 2022
Purpose: Our aim was to evaluate the psychometric properties of the online administered format of the Test of Narrative Language--Second Edition (TNL-2; Gillam & Pearson, 2017), given the importance of assessing children's narrative ability and considerable absence of psychometric studies of spoken language assessments administered online.…
Descriptors: Computer Assisted Testing, Language Tests, Story Telling, Language Impairments
Lee, Morgan P.; Croteau, Ethan; Gurung, Ashish; Botelho, Anthony F.; Heffernan, Neil T. – International Educational Data Mining Society, 2023
The use of Bayesian Knowledge Tracing (BKT) models in predicting student learning and mastery, especially in mathematics, is a well-established and proven approach in learning analytics. In this work, we report on our analysis examining the generalizability of BKT models across academic years attributed to "detector rot." We compare the…
Descriptors: Bayesian Statistics, Models, Generalizability Theory, Longitudinal Studies
Halvorsen, Marianne Berg; Helverschou, Sissel Berge; Axelsdottir, Brynhildur; Brøndbo, Per Håkan; Martinussen, Monica – Journal of Autism and Developmental Disorders, 2023
There is a need for more knowledge of valid and standardized measures of mental health problems among children and adolescents with intellectual disability (ID). In this study, we systematically reviewed and evaluated the psychometric properties of instruments used to assess general mental health problems in this population. Following PRISMA…
Descriptors: Measures (Individuals), Clinical Diagnosis, Mental Health, Mental Disorders
Stark, Kristabel; Bettini, Elizabeth; Cumming, Michelle; O'Brien, Kristen Merrill; Brunsting, Nelson; Huggins-Manley, Corinne; Binkert, Gino; Shaheen, Tashnuva – Remedial and Special Education, 2023
Special education teachers' (SETs) working conditions play a crucial role in shaping the size, quality, and effectiveness of the U.S. SET workforce and thereby shape the quality of instruction provided to students with disabilities. Valid measures of SETs' working conditions are essential for conducting robust research on how to improve working…
Descriptors: Special Education, Teaching Conditions, Special Education Teachers, Students with Disabilities
Budak, Zeynep; Isikhan, Selen Yilmaz; Batuk, Merve Ozbal – Language, Speech, and Hearing Services in Schools, 2023
Purpose: The aim of this study was to translate the versions of the Hearing Environments and Reflection on Quality of Life (HEAR-QL) into Turkish and investigate the validity and reliability of the Turkish 26-item HEAR-QL (HEAR-QL-26) for children and Turkish 28-item HEAR-QL (HEAR-QL-28) for adolescents. Method: The protocol included translation…
Descriptors: Children, Adolescents, Hearing Impairments, Control Groups
Nguyen-Duc, Thinh; Phuong, Tam T.; Le, Thuy T. B.; Nguyen, Lam T. T. – Learning Organization, 2023
Purpose: The main purpose of this study was to validate the Dimensions of Learning Organization Questionnaire (DLOQ) in a Vietnamese context. Using the DLOQ as a research tool, this study also investigated the impact of demographic features on participants' perceptions of learning organizations (LOs). Design/methodology/approach: Data were…
Descriptors: Foreign Countries, Organizational Culture, Organizational Learning, Questionnaires
Johnson, David A.; Stone, Ashlyn; Marsh, Sarah – Measurement and Evaluation in Counseling and Development, 2023
We evaluated structural, construct, and concurrent validity evidence for State-Interpersonal Reactivity Index scores among 208 telemental health counselors. Confirmatory factor analysis results supported a three-factor model. Partial correlation analyses yielded evidence for construct and concurrent validity evidence for scores. We discuss…
Descriptors: Validity, Scores, Counselors, Health Services
Spudic, Darjan; Janez, Vodicar; Vedran, Hadžic – Measurement in Physical Education and Exercise Science, 2023
The purpose was to evaluate the intra- and inter-session reliability of hip isometric strength assessment using a frame-stabilized dynamometer (FSD) and to compare the results in the hip adduction (ADD) and abduction (ABD) strength assessment using handheld dynamometer (HHD). Twenty participants (24.2 [2.6] years, 69.3 [10.0] kg) underwent testing…
Descriptors: Muscular Strength, Human Body, Measurement Equipment, Young Adults
Practical Considerations in Choosing an Anchor Test Form for Equating under the Random Groups Design
Cui, Zhongmin; He, Yong – Measurement: Interdisciplinary Research and Perspectives, 2023
Careful considerations are necessary when there is a need to choose an anchor test form from a list of old test forms for equating under the random groups design. The choice of the anchor form potentially affects the accuracy of equated scores on new test forms. Few guidelines, however, can be found in the literature on choosing the anchor form.…
Descriptors: Test Format, Equated Scores, Best Practices, Test Construction
Vignoli, Matteo; Dosi, Clio; Balboni, Bernardo – Studies in Higher Education, 2023
Expectations from Higher Education institutions are increasing towards the education of professionals able to face complex societal issues. In this context, traditional thinking is losing ground, and scholars agree on the importance of promoting a Design Thinking (DT) Mindset in educational settings to address wicked problems. However, an…
Descriptors: Design, Thinking Skills, Test Construction, Test Validity
Siddiqui, Nadia; Gorard, Stephen – International Journal of Research & Method in Education, 2023
Robust indicators are important for identifying disadvantaged pupils in education, and for ensuring that they are rightly receiving relevant state-funded assistance. This paper compares the quality and completeness of data from England on student eligibility for free school meals (FSM) based on an administrative census, with more all-encompassing…
Descriptors: Foreign Countries, Family Income, Outcomes of Education, Reliability

Peer reviewed
Direct link
