Publication Date
| In 2026 | 18 |
| Since 2025 | 2375 |
| Since 2022 (last 5 years) | 12890 |
| Since 2017 (last 10 years) | 34015 |
| Since 2007 (last 20 years) | 68506 |
Descriptor
| Foreign Countries | 30599 |
| Test Validity | 21771 |
| Scores | 18272 |
| Academic Achievement | 16940 |
| Test Construction | 16772 |
| Test Reliability | 15043 |
| Achievement Tests | 14867 |
| Standardized Tests | 14727 |
| Comparative Analysis | 14431 |
| Elementary Secondary Education | 13048 |
| Language Tests | 12555 |
| More ▼ | |
Source
Author
Publication Type
Education Level
Audience
| Practitioners | 5034 |
| Teachers | 3394 |
| Researchers | 2630 |
| Policymakers | 1232 |
| Administrators | 979 |
| Students | 687 |
| Parents | 325 |
| Counselors | 216 |
| Community | 162 |
| Support Staff | 50 |
| Media Staff | 34 |
| More ▼ | |
Location
| Turkey | 2827 |
| Australia | 2432 |
| Canada | 2271 |
| California | 1857 |
| United States | 1728 |
| Texas | 1616 |
| China | 1580 |
| United Kingdom | 1315 |
| Florida | 1312 |
| United Kingdom (England) | 1205 |
| Germany | 1123 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Meets WWC Standards without Reservations | 121 |
| Meets WWC Standards with or without Reservations | 189 |
| Does not meet standards | 174 |
Janvier, Denisse; Choi, Yeo Bi; Klein, Claire; Lord, Catherine; Kim, So Hyun – Journal of Autism and Developmental Disorders, 2022
Describing the relative severity and change in autism symptoms is crucial for the appropriate characterization of clinical and research populations. The calibrated severity score (CSS) of the Autism Diagnostic Observation Schedule-2 (ADOS-2; Lord et al., 2012) was created to better describe autism symptom severity consistently across different…
Descriptors: Test Reliability, Autism, Pervasive Developmental Disorders, Observation
Kascak, Ondrej – Discourse: Studies in the Cultural Politics of Education, 2022
This study draws on qualitative data to analyse the effect of the national Year 5 test culture on test actors in selected Slovak schools. The test culture is presented in the context of disciplinary power, while the method of analysis follows Foucault's later recommendation to focus more on the 'new economy of power relations'. Such an analysis is…
Descriptors: Foreign Countries, Power Structure, Standardized Tests, Testing
Subarkah, Edi; Kartowagiran, Badrun; Sumarno; Hamdi, Syukrul; Rahim, Abdul – International Journal of Educational Methodology, 2022
This research aims to develop the product of the life skill education program (LSEP) which is accurate, credible, and effective. This research used the Plomp model. The model covers the input, process, output, outcome and consists of instrument, scoring guidance, and good or bad criteria. The instruments used in the model are the questionnaire,…
Descriptors: Daily Living Skills, Questionnaires, Observation, Test Validity
Yaneva, Victoria; Clauser, Brian E.; Morales, Amy; Paniagua, Miguel – Advances in Health Sciences Education, 2022
Understanding the response process used by test takers when responding to multiple-choice questions (MCQs) is particularly important in evaluating the validity of score interpretations. Previous authors have recommended eye-tracking technology as a useful approach for collecting data on the processes test taker's use to respond to test questions.…
Descriptors: Eye Movements, Artificial Intelligence, Scores, Test Interpretation
Baydas Onlu, Ozlem; Abdusselam, Mustafa Serkan; Yilmaz, Rabia Meryem – Contemporary Educational Technology, 2022
This study aimed to develop the "Students' Perception of Instructional Feedback Scale" (SPIFS) determining a framework related to the perception of instructional feedback by students. The sequential exploratory mixed method was used in the study. The study was conducted during the instructional design course offered to sophomores in the…
Descriptors: Student Attitudes, Feedback (Response), Test Validity, Test Reliability
Du, Xiaoli; Glass, Jennifer Elaine; Balow, Stephanie; Dyer, Lisa M.; Rathbun, Pamela A.; Guan, Qiaoning; Liu, Jie; Wu, Yaning; Dawson, D. Brian; Walters-Sen, Lauren; Smolarek, Teresa A.; Zhang, Wenying – Journal of Autism and Developmental Disorders, 2022
Our institution developed and continuously improved a Neurodevelopmental Reflex (NDR) algorithm to help physicians with genetic test ordering for neurodevelopmental disorders (NDDs). To assess its performance, we performed a retrospective study of 511 patients tested through NDR from 2018 to 2019. SNP Microarray identified pathogenic/likely…
Descriptors: Test Construction, Genetic Disorders, Patients, Diagnostic Tests
Botes, Elouise; van der Westhuizen, Lindie; Dewaele, Jean-Marc; MacIntyre, Peter; Greiff, Samuel – Applied Linguistics, 2022
Foreign language classroom anxiety (FLCA) is a popular construct in applied linguistics research, traditionally measured with the 33-item Foreign Language Classroom Anxiety Scale (FLCAS). However, recent studies have started utilizing the eight-item Short-Form FLCAS (S-FLCAS). There is therefore a need, which this study addressed in five…
Descriptors: Test Validity, Second Language Learning, Anxiety, Measures (Individuals)
Saltos-Rivas, Rafael; Novoa-Hernández, Pavel; Serrano Rodríguez, Rocío – SAGE Open, 2022
Evaluating digital competencies has become a topic of growing interest in recent years. Although several reviews and studies have summarized the main elements of progress and shortcomings in this area, some issues are yet to be explored. Very little information is available about the ways of ensuring the validity and reliability of the instrument…
Descriptors: Test Reliability, Test Validity, Evaluation Methods, Technological Literacy
Parkin, Jason R.; Robins Deville, Lily – Journal of Psychoeducational Assessment, 2022
Like all psychoeducational batteries, the Wechsler Individual Achievement Test, Fourth Edition (WIAT-4) requires independent investigation and analysis. The publisher provides multiple theories to support interpretation of its reading measures. At the word reading level, the battery includes a new Phonemic Proficiency subtest that the publisher…
Descriptors: Achievement Tests, Reading Tests, Reading Skills, Reading Comprehension
Wilhelmsen, Gunvor Birkeland; Felder, Marion – Improving Schools, 2022
Intact visual functions are necessary for children to reach their academic potential. In the absence of vision screening, children may have unnoticed vision disturbances and academic challenges may be attributed to other problems, such as learning or cognitive disabilities. Visual problems are detrimental to educational achievement if they are not…
Descriptors: Vision Tests, Screening Tests, Foreign Countries, Faculty Development
Baysal, Emine Akkas; Ocak, Gürbüz – International Journal of Progressive Education, 2022
This study aimed to develop a reliable and valid scale to reveal the cognitive biases of university students in context of analytical thinking skills. During scale development process, firstly, a 5-point Likert type scale pre-trial form consisting of 60 items was created. The pre-trial form was applied to 450 students in Afyon Kocatepe University.…
Descriptors: College Students, Thinking Skills, Test Reliability, Test Validity
Metsämuuronen, Jari – Practical Assessment, Research & Evaluation, 2022
The reliability of a test score is usually underestimated and the deflation may be profound, 0.40 - 0.60 units of reliability or 46 - 71%. Eight root sources of the deflation are discussed and quantified by a simulation with 1,440 real-world datasets: (1) errors in the measurement modelling, (2) inefficiency in the estimator of reliability within…
Descriptors: Test Reliability, Scores, Test Items, Correlation
Kogar, Hakan – International Journal of Assessment Tools in Education, 2022
The purpose of this study is to identify which scale short-form development method produces better findings in different factor structures. A simulation study was designed based on this purpose. Three different factor structures and three simulation conditions were selected. As the findings of this simulation study, the model-data fit and…
Descriptors: Test Construction, Measures (Individuals), Factor Structure, Test Reliability
Ryan, Joseph J.; Gontkovsky, Samuel T. – Journal of Psychoeducational Assessment, 2021
We analyzed data from the WASI-II manual to determine discrepancy score reliabilities of the Verbal Comprehension (VCI) and Perceptual Reasoning (PRI) indexes and the four subtests in the child and adult standardization samples. Reliabilities of the VCI-PRI discrepancy scores range from 0.78 to 0.86 for children and 0.82 to 0.89 for adults and…
Descriptors: Intelligence Tests, Test Reliability, Scores, Children
Barth, Philipp; Stadtmann, Georg – Journal of Creative Behavior, 2021
The "consensual assessment technique" (CAT) is a reliable and valid method to measure (product) creativity and often considered "the" gold standard of creativity assessment. The reliability measure traditionally applied in CAT studies--inter-rater reliability--cannot capture time-sampling error, which is a particular relevant…
Descriptors: Creativity, Creativity Tests, Test Reliability, Interrater Reliability

Peer reviewed
Direct link
