Publication Date
| In 2026 | 10 |
| Since 2025 | 2328 |
| Since 2022 (last 5 years) | 12843 |
| Since 2017 (last 10 years) | 33968 |
| Since 2007 (last 20 years) | 68459 |
Descriptor
| Foreign Countries | 30579 |
| Test Validity | 21757 |
| Scores | 18263 |
| Academic Achievement | 16934 |
| Test Construction | 16763 |
| Test Reliability | 15036 |
| Achievement Tests | 14864 |
| Standardized Tests | 14724 |
| Comparative Analysis | 14431 |
| Elementary Secondary Education | 13046 |
| Language Tests | 12551 |
| More ▼ | |
Source
Author
Publication Type
Education Level
Audience
| Practitioners | 5034 |
| Teachers | 3394 |
| Researchers | 2630 |
| Policymakers | 1232 |
| Administrators | 979 |
| Students | 687 |
| Parents | 325 |
| Counselors | 216 |
| Community | 162 |
| Support Staff | 50 |
| Media Staff | 34 |
| More ▼ | |
Location
| Turkey | 2823 |
| Australia | 2430 |
| Canada | 2270 |
| California | 1854 |
| United States | 1727 |
| Texas | 1615 |
| China | 1579 |
| United Kingdom | 1315 |
| Florida | 1312 |
| United Kingdom (England) | 1203 |
| Germany | 1123 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Meets WWC Standards without Reservations | 121 |
| Meets WWC Standards with or without Reservations | 189 |
| Does not meet standards | 174 |
Darmawan Muttaqin – Journal of Psychoeducational Assessment, 2024
The Vocational Identity Status Assessment (VISA) is one of the instruments that can be used to assess vocational identity. Conceptually, VISA consists of six sub-dimensions and has been validated using factor analysis. This study provides a factor structure test of the Indonesian version of VISA using the exploratory structural equation modeling…
Descriptors: Foreign Countries, Structural Equation Models, Vocational Interests, Occupational Tests
Ehri Ryu – Society for Research on Educational Effectiveness, 2024
Background/Context: Confirmatory factor analysis (CFA) model is a commonly adopted framework to estimate and test a measurement model. Once a well-fitting final CFA model is selected, the selected model may be used to test structural relationships of the latent constructs with other variables, to construct a test with desired reliability and…
Descriptors: Research Problems, Factor Analysis, Scores, Computation
Reeta Neittaanmäki; Iasonas Lamprianou – Language Testing, 2024
This article focuses on rater severity and consistency and their relation to major changes in the rating system in a high-stakes testing context. The study is based on longitudinal data collected from 2009 to 2019 from the second language (L2) Finnish speaking subtest in the National Certificates of Language Proficiency in Finland. We investigated…
Descriptors: Foreign Countries, Interrater Reliability, Evaluators, Item Response Theory
Gorney, Kylie – ProQuest LLC, 2023
Aberrant behavior refers to any type of unusual behavior that would not be expected under normal circumstances. In educational and psychological testing, such behaviors have the potential to severely bias the aberrant examinee's test score while also jeopardizing the test scores of countless others. It is therefore crucial that aberrant examinees…
Descriptors: Behavior Problems, Educational Testing, Psychological Testing, Test Bias
Yalinkilic, Funda; Gul, Seyda – Science Insights Education Frontiers, 2023
The aim of this study is to develop a valid and reliable achievement test on the subject of 'Basic Compounds in the Structure of Living Things'. During the preparation of the draft form of the test, a 32 item-question pool was created by the researchers in the light of the relevant literature. Then, these questions were presented to expert opinion…
Descriptors: Test Construction, Science Achievement, Science Tests, Test Validity
Thompson, Kathryn N. – ProQuest LLC, 2023
It is imperative to collect validity evidence prior to interpreting and using test scores. During the process of collecting validity evidence, test developers should consider whether test scores are contaminated by sources of extraneous information. This is referred to as construct irrelevant variance, or the "degree to which test scores are…
Descriptors: Test Wiseness, Test Items, Item Response Theory, Scores
Jieun Kim – Reading in a Foreign Language, 2024
High-stakes reading tests significantly influence one's future success, leading many second language learners to engage in intensive test preparation. This study examines nine TOEFL reading preparation lectures from two popular cram schools, or [foreign characters omitted] "hagwons," in Korea, with a total duration of five hours and…
Descriptors: High Stakes Tests, Reading Tests, Teaching Methods, Test Preparation
Matthew D. Coss – Language Learning & Technology, 2025
The extent to which writing modality (i.e., hand-writing vs. keyboarding) impacts second-language (L2) writing assessment scores remains unclear. For alphabetic languages like English, research shows mixed results, documenting both equivalent and divergent scores between typed and handwritten tests (e.g., Barkaoui & Knouzi, 2018). However, for…
Descriptors: Computer Assisted Testing, Paper and Pencil Tests, Second Language Learning, Chinese
Vy Le; Jayson M. Nissen; Xiuxiu Tang; Yuxiao Zhang; Amirreza Mehrabi; Jason W. Morphew; Hua Hua Chang; Ben Van Dusen – Physical Review Physics Education Research, 2025
In physics education research, instructors and researchers often use research-based assessments (RBAs) to assess students' skills and knowledge. In this paper, we support the development of a mechanics cognitive diagnostic to test and implement effective and equitable pedagogies for physics instruction. Adaptive assessments using cognitive…
Descriptors: Physics, Science Education, Scientific Concepts, Diagnostic Tests
Kai-Lin Yang; Su-Chi Fang; Szu-Chun Fan – International Journal of STEM Education, 2025
Background: Given that limited well-developed instruments are available to assess students' transdisciplinary STEM practices (T-STEMP), this study aims to develop and validate an instrument for examining secondary students' T-STEMP competence. According to the T-STEMP assessment framework proposed in our previous study, we developed items and…
Descriptors: STEM Education, Interdisciplinary Approach, Test Construction, Test Reliability
Ersoy Öz; Okan Bulut; Zuhal Fatma Cellat; Hülya Yürekli – Education and Information Technologies, 2025
Predicting student performance in international large-scale assessments (ILSAs) is crucial for understanding educational outcomes on a global scale. ILSAs, such as the Program for International Student Assessment and the Trends in International Mathematics and Science Study, serve as vital tools for policymakers, educators, and researchers to…
Descriptors: Foreign Countries, Achievement Tests, Secondary School Students, International Assessment
Tugba Konakli; Mustafa Kemer; Seda Çatak – Psychology in the Schools, 2025
Teachers who demonstrate academic optimism actively design suitable pedagogical approaches, exhibit proficient classroom management skills, and effectively engage students in the learning process. Nevertheless, the current measurement tools used to assess teachers' academic optimism are constructed solely from the teachers' perspective and fail to…
Descriptors: Teacher Attitudes, Positive Attitudes, High School Students, Teacher Student Relationship
Deniz Görgülü; Fatma Coskun; Mustafa Demi?r; Mete Si?pahi?oglu – Education and Information Technologies, 2025
This study aims to examine the psychometric properties of a scale developed to measure teachers' ability to use artificial intelligence tools in education. The scale was originally proposed as having 33 items across 5 sub-dimensions by Chat GPT-4, an AI system. To establish content validity, the scale form was submitted to expert review. An…
Descriptors: Measures (Individuals), Artificial Intelligence, Technology Uses in Education, Psychometrics
Dini Nurani Rahmawati; R. Riandi; Rini Solihat – Journal of Biological Education Indonesia (Jurnal Pendidikan Biologi Indonesia), 2025
Student participation in Indonesia in activities that develop research skills, such as OPSI ("Olimpiade Penelitian Siswa Indonesia" or Indonesian Student Research Olympiad), remains low. A preliminary study to gather teachers' perceptions regarding research skills was also conducted by the author using a questionnaire; the results…
Descriptors: Foreign Countries, Test Construction, Research Skills, High School Students
Sun-Joo Cho; Goodwin Amanda; Jorge Salas; Sophia Mueller – Grantee Submission, 2025
This study incorporates a random forest (RF) approach to probe complex interactions and nonlinearity among predictors into an item response model with the goal of using a hybrid approach to outperform either an RF or explanatory item response model (EIRM) only in explaining item responses. In the specified model, called EIRM-RF, predicted values…
Descriptors: Item Response Theory, Artificial Intelligence, Statistical Analysis, Predictor Variables

Peer reviewed
Direct link
