Publication Date
| In 2026 | 0 |
| Since 2025 | 2 |
| Since 2022 (last 5 years) | 11 |
| Since 2017 (last 10 years) | 30 |
| Since 2007 (last 20 years) | 53 |
Descriptor
| Test Validity | 53 |
| Test Reliability | 32 |
| Test Selection | 20 |
| Foreign Countries | 17 |
| Evaluation Methods | 11 |
| Scores | 9 |
| Test Construction | 9 |
| Psychometrics | 8 |
| Selection Criteria | 8 |
| English (Second Language) | 7 |
| Selection | 7 |
| More ▼ | |
Source
Author
| Abrea Greene | 1 |
| Al-Bulushi, Ali | 1 |
| Al-Issa, Ali | 1 |
| Al-Zadjali, Rima | 1 |
| Alcock, Katherine J. | 1 |
| Allen, Jeff M. | 1 |
| Amery D. Wu | 1 |
| Andrew P. Jaciw | 1 |
| Andrich, David | 1 |
| Apriatni, Mandri S. | 1 |
| Archwamety, Teara | 1 |
| More ▼ | |
Publication Type
Education Level
Audience
| Teachers | 3 |
| Policymakers | 2 |
| Administrators | 1 |
Location
| Australia | 3 |
| California | 3 |
| Turkey | 3 |
| Arizona | 2 |
| Canada | 2 |
| Chile | 2 |
| Greece | 2 |
| Indonesia | 2 |
| New York | 2 |
| United Kingdom (England) | 2 |
| Afghanistan | 1 |
| More ▼ | |
Laws, Policies, & Programs
| Elementary and Secondary… | 1 |
| Elementary and Secondary… | 1 |
| Every Student Succeeds Act… | 1 |
Assessments and Surveys
What Works Clearinghouse Rating
Davis, Mark A.; Philip, Jestine; Walker, Laura – Management Teaching Review, 2022
This article outlines an active learning project that gives students hands-on experience in developing an undergraduate situational judgment test. The five-part activity models the process for constructing a situational judgment test--a tool commonly used for employee selection in organizations. The project is designed to help students assimilate…
Descriptors: Undergraduate Students, Situational Tests, Active Learning, Selection Tools
Scott, Kristin C. – International Journal of Information and Communication Technology Education, 2021
Since Mishra and Koehler released their framework of technological pedagogical content knowledge (TPACK), researchers have been attempting to measure it with a variety of self-assessment instruments. Early TPACK instruments struggled with construct validity. More recently, several instruments have been tested for validity and reliability…
Descriptors: Teacher Effectiveness, Teacher Evaluation, Self Evaluation (Individuals), Pedagogical Content Knowledge
Susan K. Johnsen – Gifted Child Today, 2024
The author provides a checklist for educators who are selecting technically adequate tests for identifying and referring students for gifted education services and programs. The checklist includes questions related to how the test was normed, reliability and validity studies as well as questions related to types of scores, administration, and…
Descriptors: Test Selection, Academically Gifted, Gifted Education, Test Validity
Andrew P. Jaciw – American Journal of Evaluation, 2025
By design, randomized experiments (XPs) rule out bias from confounded selection of participants into conditions. Quasi-experiments (QEs) are often considered second-best because they do not share this benefit. However, when results from XPs are used to generalize causal impacts, the benefit from unconfounded selection into conditions may be offset…
Descriptors: Elementary School Students, Elementary School Teachers, Generalization, Test Bias
Shun-Fu Hu; Amery D. Wu; Jake Stone – Journal of Educational Measurement, 2025
Scoring high-dimensional assessments (e.g., > 15 traits) can be a challenging task. This paper introduces the multilabel neural network (MNN) as a scoring method for high-dimensional assessments. Additionally, it demonstrates how MNN can score the same test responses to maximize different performance metrics, such as accuracy, recall, or…
Descriptors: Tests, Testing, Scores, Test Construction
Ting Zhang; Paul Bailey; Yuqi Liao; Emmanuel Sikali – Large-scale Assessments in Education, 2024
The EdSurvey package helps users download, explore variables in, extract data from, and run analyses on large-scale assessment data. The analysis functions in EdSurvey account for the use of plausible values for test scores, survey sampling weights, and their associated variance estimator. We describe the capabilities of the package in the context…
Descriptors: National Competency Tests, Information Retrieval, Data Collection, Test Validity
Jesús Honorato-Errázuriz; Valentina Bastidas-Schade; Maria-Soledad Ramírez-Montoya – Journal of Social Studies Education Research, 2024
During times of crisis, such as the COVID-19 pandemic, the evaluation of educational programs becomes crucial for making evidence-based decisions. This study aims to validate and pilot an assessment instrument tailored to evaluate an innovative national reading program in Chile, particularly during the critical phase of post-pandemic educational…
Descriptors: Foreign Countries, Reading, Reading Ability, Reading Tests
Eirini M. Mitropoulou; Leonidas A. Zampetakis; Ioannis Tsaousis – Evaluation Review, 2024
Unfolding item response theory (IRT) models are important alternatives to dominance IRT models in describing the response processes on self-report tests. Their usage is common in personality measures, since they indicate potential differentiations in test score interpretation. This paper aims to gain a better insight into the structure of trait…
Descriptors: Foreign Countries, Adults, Item Response Theory, Personality Traits
Stylos, Georgios; Siarka, Olga; Kotsis, Konstantinos T. – European Journal of Science and Mathematics Education, 2023
In a modern yet demanding society, scientific literacy (SL) is an essential skill that enables the individual to explain, understand and discuss issues related to science, health, and the environment. The purpose of this research study is to validate the Scientific Literacy Assessment (SLA) tool in the Greek language and investigate the level of…
Descriptors: Scientific Literacy, Preservice Teachers, Elementary School Teachers, Foreign Countries
Polatcan, Faruk – African Educational Research Journal, 2020
The tendency to read is due to what individuals pay attention to while reading and choosing literary works. The aim of this study is to develop a scale of pre-service teachers' reading tendencies. The study group of the research consisted of 143 pre-service Turkish language teachers studying at Sinop University. Factor analysis was performed to…
Descriptors: Reading Habits, Test Validity, Test Reliability, Reading Material Selection
Predicting Student Success in a Magnet School Setting through Intelligence and Non-Cognitive Factors
John Jeffrey McCann Jr. – ProQuest LLC, 2024
Magnet schools have been a main tool or innovation in urban education settings in the United States, originating in the early 1970's and expanding into most large urban districts today (Blank, 1989). While some magnet schools do not rely on a specific criterion to determine entry, many do. This study focuses on such a setting where students must…
Descriptors: Intelligence Tests, Magnet Schools, Urban Schools, Screening Tests
Mandracchia, Nina R.; Sims, Wesley A. – Computers in the Schools, 2020
As technology use continues to rapidly increase, so too does consumer use of web-based resources. While important, accessibility is often overemphasized by users when consuming and evaluating web resources. This prioritization may have particularly negative consequences for the selection of supports or interventions in educational settings. This…
Descriptors: Internet, Resources, Selection, Rating Scales
Omid Wali; Mohammad Rizwan Khan – Journal of Research Initiatives, 2022
English language proficiency has been considered as an important prerequisite for hiring new faculty members for various disciplines by the Afghan Ministry of Higher Education (MoHE). For this, the Departments of English across the major universities of Afghanistan such as Kabul, Nangarhar, Shaheed Prof. Rabbani Education; Heart and Balkh…
Descriptors: Foreign Countries, English (Second Language), Language Proficiency, College Faculty
Brown, Anna; Fong, Sarah – British Educational Research Journal, 2019
Despite profound influence of selection-by-ability on children's educational opportunities, empirical evidence for the validity of 11-plus tests is scarce. This study focused on secondary selection in Kent, the largest grammar school area in England. We analysed scores from the 'Kent Test' (the 11-plus test used in Kent), Cognitive Assessment…
Descriptors: Foreign Countries, Secondary Education, Elementary Schools, Scores
Isbell, Daniel R.; Kremmel, Benjamin – Language Testing, 2020
Administration of high-stakes language proficiency tests has been disrupted in many parts of the world as a result of the 2019 novel coronavirus pandemic. Institutions that rely on test scores have been forced to adapt, and in many cases this means using scores from a different test, or a new online version of an existing test, that can be taken…
Descriptors: Language Tests, High Stakes Tests, Language Proficiency, Second Language Learning

Peer reviewed
Direct link
