Publication Date
| In 2026 | 0 |
| Since 2025 | 3 |
| Since 2022 (last 5 years) | 9 |
| Since 2017 (last 10 years) | 25 |
| Since 2007 (last 20 years) | 48 |
Descriptor
| Foreign Countries | 68 |
| Scoring | 68 |
| Test Reliability | 68 |
| Test Validity | 43 |
| Test Construction | 21 |
| Test Items | 15 |
| Correlation | 11 |
| Multiple Choice Tests | 11 |
| Testing | 11 |
| Comparative Analysis | 10 |
| Language Tests | 10 |
| More ▼ | |
Source
Author
| Adams, Raymond J. | 1 |
| Ahmed, Ayesha | 1 |
| Akeju, S. A. | 1 |
| Akgul, Savas | 1 |
| Aktas, Mehtap | 1 |
| Alatli, Betül | 1 |
| Alici, Devrim | 1 |
| Allison, Carrie | 1 |
| Almehrizi, Rashid S. | 1 |
| Aloisi, Cesare | 1 |
| Andrews, Jac | 1 |
| More ▼ | |
Publication Type
Education Level
Audience
| Practitioners | 2 |
| Policymakers | 1 |
| Researchers | 1 |
| Teachers | 1 |
Location
| Canada | 8 |
| Turkey | 8 |
| Australia | 4 |
| Netherlands | 4 |
| Germany | 3 |
| United Kingdom | 3 |
| United States | 3 |
| Europe | 2 |
| Japan | 2 |
| Switzerland | 2 |
| Taiwan | 2 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
Janika Saretzki; Rosalie Andrae; Boris Forthmann; Mathias Benedek – Journal of Creative Behavior, 2025
Divergent thinking (DT) ability is widely regarded as a central cognitive capacity underlying creativity, but its assessment is challenged by the fact that DT tasks yield a variable number of responses. Various approaches for the scoring of DT tasks have been proposed, which differ in how responses are evaluated and aggregated within a task. The…
Descriptors: Creative Thinking, Creativity Tests, Scoring, Metacognition
M. Arda Atakaya; Ugur Sak; M. Bahadir Ayas – Creativity Research Journal, 2024
Scoring in creativity research has been a central problem since creativity became an important issue in psychology and education in the 1950s. The current study examined the psychometric properties of 27 creativity indices derived from summed and averaged scores using 15 scoring methods. Participants included 2802 middle-school students. Data…
Descriptors: Psychometrics, Creativity, Creativity Tests, Scoring
Alatli, Betül – International Journal of Curriculum and Instruction, 2022
This study was conducted to review the use of tests. For this purpose, 45 articles in which the Turkish form of the "Test Anxiety Inventory (TAI)," which is one of the tests frequently used in the field of education, was employed and that were published between 2000 and 2020 were examined in terms of factors that should be considered in…
Descriptors: Anxiety, Likert Scales, Test Anxiety, Test Reliability
Laila El-Hamamsy; María Zapata-Cáceres; Estefanía Martín-Barroso; Francesco Mondada; Jessica Dehler Zufferey; Barbara Bruno; Marcos Román-González – Technology, Knowledge and Learning, 2025
The introduction of computing education into curricula worldwide requires multi-year assessments to evaluate the long-term impact on learning. However, no single Computational Thinking (CT) assessment spans primary school, and no group of CT assessments provides a means of transitioning between instruments. This study therefore investigated…
Descriptors: Cognitive Tests, Computation, Thinking Skills, Test Validity
Farshad Effatpanah; Purya Baghaei; Mona Tabatabaee-Yazdi; Esmat Babaii – Language Testing, 2025
This study aimed to propose a new method for scoring C-Tests as measures of general language proficiency. In this approach, the unit of analysis is sentences rather than gaps or passages. That is, the gaps correctly reformulated in each sentence were aggregated as sentence score, and then each sentence was entered into the analysis as a polytomous…
Descriptors: Item Response Theory, Language Tests, Test Items, Test Construction
Almehrizi, Rashid S. – Applied Measurement in Education, 2021
KR-21 reliability and its extension (coefficient [alpha]) gives the reliability estimate of test scores under the assumption of tau-equivalent forms. KR-21 reliability gives the reliability estimate for summed scores for dichotomous items when items are randomly sampled from an infinite pool of similar items (randomly parallel forms). The article…
Descriptors: Test Reliability, Scores, Scoring, Computation
Nelson, Brenna Scadden; Petersen, Douglas B.; Rai, Anuradha – Language and Education, 2022
Background: Children in India are among the lowest performing students on international reading comprehension and writing assessments. Oral language instruction may improve these literacy outcomes. Aim: This pilot study investigated whether a multi-tiered system of language support (MTSLS) improved oral narrative language comprehension, reading…
Descriptors: Foreign Countries, Oral Language, Language Processing, Reading Comprehension
Er, Zübeyde; Dinç Artut, Perihan; Bal, Ayten Pinar – Pegem Journal of Education and Instruction, 2023
The aim of this research is to develop a reliable and valid scale to determine the mathematical thinking skills of gifted students. In addition, with the developed scale, thinking skills of gifted students was examined in terms of various variables. In this context, the research was carried out on two different study groups. The first stage of…
Descriptors: Measures (Individuals), Rating Scales, Test Construction, Construct Validity
Heng Lu – PASAA: Journal of Language Teaching and Learning in Thailand, 2023
The test view is on the Duolingo English Test (DET), an alternative online English proficiency test with a machine-driven characteristic. The review covers essential information of the DET such as test purpose, usage, score-mapping with CEFR scale, price, and publisher. Meanwhile, the test usefulness is discussed with focuses on reliability,…
Descriptors: Computer Software, Computer Assisted Instruction, Second Language Learning, Second Language Instruction
Safak, Pinar; Cakmak, Salih; Karakoc, Tamer; Aydin O'Dwyer, Pinar – European Journal of Educational Research, 2021
This study aimed to develop a valid and reliable instrument that measures the functional vision of students with low vision. Thus, an assessment tool and performance activities were developed for three vision skill groups (near vision skills, distance vision skills, and visual field) that include functional vision skills. The universe was 1485…
Descriptors: Foreign Countries, Vision Tests, Diagnostic Tests, Vision
Uzun, N. Bilge; Alici, Devrim; Aktas, Mehtap – European Journal of Educational Research, 2019
The purpose of study is to examine the reliability of analytical rubrics and checklists developed for the assessment of story writing skills by means of generalizability theory. The study group consisted of 52 students attending the 5th grade at primary school and 20 raters in Mersin University. The G study was carried out with the fully crossed…
Descriptors: Foreign Countries, Scoring Rubrics, Check Lists, Writing Tests
Haug, Tobias; Batty, Aaron Olaf; Venetz, Martin; Notter, Christa; Girard-Groeber, Simone; Knoch, Ute; Audeoud, Mireille – Language Testing, 2020
In this study we seek evidence of validity according to the socio-cognitive framework (Weir, 2005) for a new sentence repetition test (SRT) for young Deaf L1 Swiss German Sign Language (DSGS) users. SRTs have been developed for various purposes for both spoken and sign languages to assess language development in children. In order to address the…
Descriptors: Foreign Countries, Language Tests, Sentences, Repetition
Soeharto, Soeharto – Journal of Turkish Science Education, 2021
This study aims to evaluate the psychometric properties of the developed diagnostic assessment test and to identify student misconceptions in science in terms of school grades. 153 students were gathered by using random sample from 10th to 12th grade in senior high schools. The 32 items of the two-tier multiple-choice diagnostic test were…
Descriptors: Grade 12, High School Students, Scientific Attitudes, Misconceptions
Kleijn, Suzanne; Pander Maat, Henk; Sanders, Ted – Language Testing, 2019
Although there are many methods available for assessing text comprehension, the cloze test is not widely acknowledged as one of them. Critiques on cloze testing center on its supposedly limited ability to measure comprehension beyond the sentence. However, these critiques do not hold for all types of cloze tests; the particular configuration of a…
Descriptors: Cloze Procedure, Language Tests, Semantics, Scoring
Xu, Jing; Jones, Edmund; Laxton, Victoria; Galaczi, Evelina – Assessment in Education: Principles, Policy & Practice, 2021
Recent advances in machine learning have made automated scoring of learner speech widespread, and yet validation research that provides support for applying automated scoring technology to assessment is still in its infancy. Both the educational measurement and language assessment communities have called for greater transparency in describing…
Descriptors: Second Language Learning, Second Language Instruction, English (Second Language), Computer Software

Peer reviewed
Direct link
