Publication Date
| In 2026 | 18 |
| Since 2025 | 2375 |
| Since 2022 (last 5 years) | 12890 |
| Since 2017 (last 10 years) | 34015 |
| Since 2007 (last 20 years) | 68506 |
Descriptor
| Foreign Countries | 30599 |
| Test Validity | 21771 |
| Scores | 18272 |
| Academic Achievement | 16940 |
| Test Construction | 16772 |
| Test Reliability | 15043 |
| Achievement Tests | 14867 |
| Standardized Tests | 14727 |
| Comparative Analysis | 14431 |
| Elementary Secondary Education | 13048 |
| Language Tests | 12555 |
| More ▼ | |
Source
Author
Publication Type
Education Level
Audience
| Practitioners | 5034 |
| Teachers | 3394 |
| Researchers | 2630 |
| Policymakers | 1232 |
| Administrators | 979 |
| Students | 687 |
| Parents | 325 |
| Counselors | 216 |
| Community | 162 |
| Support Staff | 50 |
| Media Staff | 34 |
| More ▼ | |
Location
| Turkey | 2827 |
| Australia | 2432 |
| Canada | 2271 |
| California | 1857 |
| United States | 1728 |
| Texas | 1616 |
| China | 1580 |
| United Kingdom | 1315 |
| Florida | 1312 |
| United Kingdom (England) | 1205 |
| Germany | 1123 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Meets WWC Standards without Reservations | 121 |
| Meets WWC Standards with or without Reservations | 189 |
| Does not meet standards | 174 |
Kaya, Fatih – Education Quarterly Reviews, 2021
The aim of this study was to develop a valid and reliable measurement tool in order to determine the democracy levels of teacher candidates. During the scale development process in the research, the validity and reliability studies were conducted through three independent study groups. The first study group consisted of 627 students studying at…
Descriptors: Democracy, Measures (Individuals), Preservice Teachers, Test Validity
Oh, Seungbin; Shillingford-Butler, Ann – Measurement and Evaluation in Counseling and Development, 2021
The authors present the development and examination of the "Client Assessment of Multicultural Competent Behavior" (CAMCB) scores. The CAMCB was designed to measure therapists' multicultural competent behaviors within the context of therapeutic process, from clients' perspective. In this article, three-phases of the study are presented…
Descriptors: Counselor Evaluation, Test Construction, Cultural Awareness, Test Validity
Tunc, Emine Burcu; Parlak, Simel; Uluman, Muge; Eryigit, Derya – International Journal of Assessment Tools in Education, 2021
The aim of this research is to develop Hostility in Pandemic Scale (HPS) for Turkey Population to determine the hostility levels of individuals, which is a factor affecting the mental well-being of the society during the pandemic. The study group consists of 855 individuals between the ages of 18-65 from different genders, and have experienced the…
Descriptors: Psychological Patterns, Pandemics, COVID-19, Test Construction
Schulte, Niklas; Holling, Heinz; Bürkner, Paul-Christian – Educational and Psychological Measurement, 2021
Forced-choice questionnaires can prevent faking and other response biases typically associated with rating scales. However, the derived trait scores are often unreliable and ipsative, making interindividual comparisons in high-stakes situations impossible. Several studies suggest that these problems vanish if the number of measured traits is high.…
Descriptors: Questionnaires, Measurement Techniques, Test Format, Scoring
Markelz, Andrew M.; Riden, Benjamin S.; Zoder-Martell, Kimberly A.; Miller, Joseph E.; Bolinger, Sarah J. – Journal of Positive Behavior Interventions, 2021
Supported by decades of research on praise and its effect on student behaviors, we developed the Behavior-Specific Praise--Observation Tool (BSP-OT) to measure characteristics of effective praise. We evaluated interrater reliability of the BSP-OT to measure praise specificity, contingency, and variety using intraclass correlation (ICC) and Cohen's…
Descriptors: Test Reliability, Classroom Observation Techniques, Positive Reinforcement, Interrater Reliability
Tabin, Mireille; Diacquenod, Cindy; De Palma, Nicola; Gerber, Fabienne; Straccia, Claudio; Wilson, Carlene; Kosel, Markus; Petitpierre, Geneviève – Journal of Intellectual & Developmental Disability, 2021
Background: Social vulnerability refers to the ways in which an individual is at risk of being victimised. The Test of Interpersonal Competences and Personal Vulnerability [TICPV] is an Australian assessment tool designed for adults with intellectual disabilities (ID) [Wilson et al. (1996). Vulnerability to criminal exploitation: Influence of…
Descriptors: Test Validity, At Risk Persons, Intellectual Disability, Test Reliability
Kats, Daniel J.; Skotko, Brian G.; de Graaf, Gert; Skladzien, Ellen; Hooper, Brian Takashi; Mordi, Rose; Mykhailenko, Tetiana; Buckley, Frank; Patsiogiannis, Vasiliki; Krell, Kavita; Haugen, Kelsey; Donelan, Karen – Journal of Applied Research in Intellectual Disabilities, 2023
Background: Down syndrome is the most common liveborn genetic condition. However, there are no surveys measuring societal services and supports for people with Down syndrome. We developed a questionnaire so that initiatives could be targeted towards countries most in need of assistance. Method: We formed a geographically diverse group of…
Descriptors: Down Syndrome, Social Services, Questionnaires, Test Construction
Atar, Burcu; Atalay Kabasakal, Kubra; Kibrislioglu Uysal, Nermin – Journal of Experimental Education, 2023
The purpose of this study was to evaluate the population invariance of equating functions across country subgroups in TIMSS 2015 mathematics tests in relation to the raw-score distribution, DIF, and DTF. We used equipercentile and IRT observed-score equating methods. The results of the study indicate that there is a relationship between the…
Descriptors: Foreign Countries, Achievement Tests, International Assessment, Mathematics Tests
Weeks, Sean N.; Renshaw, Tyler L.; Serang, Sarfaraz – Journal of Psychoeducational Assessment, 2023
Minority stress theory is a model for understanding health disparities among sexual minorities, defined as those who experience a level of same-sex attraction, identity, or behavior. Methods for assessing minority stress among youth included only adult measures until the development of the Sexual Minority Adolescent Stress Inventory (SMASI). The…
Descriptors: Adolescents, LGBTQ People, Test Validity, Stress Variables
Rutkowski, David; Rutkowski, Leslie; Valdivia, Dubravka Svetina; Canbolat, Yusuf; Underhill, Stephanie – Applied Measurement in Education, 2023
Several states in the US have removed time limits on their state assessments. In Indiana, where this study takes place, the state assessment is both untimed during the testing window and allows unlimited breaks during the testing session. Using grade 3 and 8 math and English state assessment data, in this paper we focus on time used for testing…
Descriptors: Testing, Time, Intervals, Academic Achievement
Mi, Shuaishuai; Ye, Jianqiang; Li, Yan; Bi, Hualin – Physical Review Physics Education Research, 2023
This study developed and validated an instrument to investigate senior school students' understanding of electrostatics and provide a cognitive diagnostic assessment of their strengths and weaknesses on the related concepts (e.g., electric charge). The instrument included 20 four-tier multiple-choice items and the development process is organized…
Descriptors: Test Construction, Test Validity, High School Students, Student Evaluation
Adams, Curt; Adigun, Olajumoke Beulah; Fiegener, Ashlyn; Olsen, Jentre J. – Journal of Educational Administration, 2023
Purpose: The study begins by defining and conceptualizing Transformative Leadership Conversation (TLC). The conceptualization addresses the meaning of transformation, sensemaking and learning dialogue, and the conversation structures of framing, questioning and listening, and affirming. Next, the authors build a theoretical argument from…
Descriptors: Transformational Leadership, Questioning Techniques, Listening, Language Usage
Henninger, Mirka; Debelak, Rudolf; Strobl, Carolin – Educational and Psychological Measurement, 2023
To detect differential item functioning (DIF), Rasch trees search for optimal split-points in covariates and identify subgroups of respondents in a data-driven way. To determine whether and in which covariate a split should be performed, Rasch trees use statistical significance tests. Consequently, Rasch trees are more likely to label small DIF…
Descriptors: Item Response Theory, Test Items, Effect Size, Statistical Significance
Salimpour, Saeed; Tytler, Russell; Doig, Brian; Fitzgerald, Michael T.; Eriksson, Urban – International Journal of Science and Mathematics Education, 2023
Cosmology concepts encompass complex spatial and temporal relations that are counterintuitive. Cosmology findings, because of their intrinsic interest, are often reported in the public domain with enthusiasm, and students come to cosmology with a range of conceptions some aligned and some at variance with the current science. This makes cosmology…
Descriptors: Astronomy, Concept Formation, Test Construction, Test Validity
Harris, Dan; Coleman, Kathryn; Cook, Peter J. – Australian Educational Researcher, 2023
This article details how and why we have developed a flexible and responsive process-based rubric exemplar for teaching, learning, and assessing critical and creative thinking. We hope to contribute to global discussions of and efforts toward instrumentalising the challenge of assessing, but not standardising, creativity in compulsory education.…
Descriptors: Critical Thinking, Creative Thinking, Ecology, Compulsory Education

Peer reviewed
Direct link
