Publication Date
| In 2026 | 10 |
| Since 2025 | 2328 |
| Since 2022 (last 5 years) | 12843 |
| Since 2017 (last 10 years) | 33968 |
| Since 2007 (last 20 years) | 68459 |
Descriptor
| Foreign Countries | 30579 |
| Test Validity | 21757 |
| Scores | 18263 |
| Academic Achievement | 16934 |
| Test Construction | 16763 |
| Test Reliability | 15036 |
| Achievement Tests | 14864 |
| Standardized Tests | 14724 |
| Comparative Analysis | 14431 |
| Elementary Secondary Education | 13046 |
| Language Tests | 12551 |
| More ▼ | |
Source
Author
Publication Type
Education Level
Audience
| Practitioners | 5034 |
| Teachers | 3394 |
| Researchers | 2630 |
| Policymakers | 1232 |
| Administrators | 979 |
| Students | 687 |
| Parents | 325 |
| Counselors | 216 |
| Community | 162 |
| Support Staff | 50 |
| Media Staff | 34 |
| More ▼ | |
Location
| Turkey | 2823 |
| Australia | 2430 |
| Canada | 2270 |
| California | 1854 |
| United States | 1727 |
| Texas | 1615 |
| China | 1579 |
| United Kingdom | 1315 |
| Florida | 1312 |
| United Kingdom (England) | 1203 |
| Germany | 1123 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Meets WWC Standards without Reservations | 121 |
| Meets WWC Standards with or without Reservations | 189 |
| Does not meet standards | 174 |
Jewsbury, Paul A.; van Rijn, Peter W. – Journal of Educational and Behavioral Statistics, 2020
In large-scale educational assessment data consistent with a simple-structure multidimensional item response theory (MIRT) model, where every item measures only one latent variable, separate unidimensional item response theory (UIRT) models for each latent variable are often calibrated for practical reasons. While this approach can be valid for…
Descriptors: Item Response Theory, Computation, Test Items, Adaptive Testing
Witmer, Sara E.; Roschmann, Sarina – Measurement and Evaluation in Counseling and Development, 2020
It is critical to examine whether test accommodations function as intended in removing construct-irrelevant variance. The measurement comparability of a math test for students with emotional impairments and those without disabilities was examined. Results indicated the presence of limited differential item functioning (DIF) regardless of…
Descriptors: Testing Accommodations, Mathematics Tests, Emotional Disturbances, Students with Disabilities
Calimeris, Lauren; Kosack, Edward – Journal of Economic Education, 2020
In this study, the authors investigate the impact of the Immediate Feedback Assessment Technique (IF-AT) on student learning outcomes in principles of microeconomics classes. The IF-AT enables students to receive immediate feedback and to retry questions for partial credit. The authors use a randomized experiment to evaluate the effect of the…
Descriptors: Feedback (Response), Tests, Microeconomics, Introductory Courses
Mangum, A. – PRIMUS, 2020
This paper gives a blueprint for reacting to the various issues that an instructor will face in their day-to-day life as the manager of a Mastery-Based Testing (MBT) classroom. For example, I created a rubric to compare traditional testing with MBT, adapted my office hours to accommodate more retake opportunities, and required effort between…
Descriptors: Mastery Tests, Testing, Calculus, College Mathematics
Chu, Man-Wai; Fung, Karen – Research in Science Education, 2020
Canadian students experience many different assessments throughout their schooling (O'Connor 2011). There are many benefits to using a variety of assessment types, item formats, and science-based performance tasks in the classroom to measure the many dimensions of science education. Although using a variety of assessments is beneficial, it is…
Descriptors: Student Evaluation, Science Achievement, Foreign Countries, Test Format
Chen, Li-Ming; Jin, Kuan-Yu – British Journal of Educational Psychology, 2020
Background: Most bullying incidents occur in the presence of bystanders, with few choosing to intervene. Therefore, the development of a valid instrument to measure individuals' willingness to intervene in bullying is warranted. Aims: This study aimed to develop as well as validate a self-reported willingness to intervene in bullying scale (WIBS)…
Descriptors: Test Construction, Bullying, Intervention, Junior High School Students
Miller, Matthew B.; Jimenez-Garcia, John Alexander; Hong, Chang Ki; DeMont, Richard – Measurement in Physical Education and Exercise Science, 2020
The Child-Focused Injury Risk Screening Tool (ChildFIRST) is a process-based assessment including 10 movement skills with 4 associated evaluation criteria. The ChildFIRST has been validated by a group of experts to evaluate movement competence and injury risk in 8-12-year-olds. The purpose of this study is to evaluate the reliability of the…
Descriptors: Screening Tests, Risk Assessment, Injuries, Psychomotor Skills
Raczka, Roman; Theodore, Kate; Williams, Janice – Journal of Intellectual Disabilities, 2020
There is an appropriate increasing focus on the need to ensure the voices of people with intellectual disability are captured as part of assessing individuals' quality of life; however, there remains a lack of a consensus on ways to achieve this. This article describes the development of a self-report measure of quality of life for people with…
Descriptors: Quality of Life, Intellectual Disability, Psychometrics, Test Validity
Eroglu, Sultan Yavuz; Eroglu, Erdem – International Journal of Progressive Education, 2020
The purpose of this study was to develop a scale towards career planning of sports sciences students. Study group consisted of 543 students who were attending in physical education and sports teaching, sports management, and coaching departments in Siirt University. Construct validity of scale was tested through factor analysis and confirmatory…
Descriptors: Foreign Countries, Measures (Individuals), Career Planning, College Students
Witmer, Sara E.; Roschmann, Sarina – Education and Training in Autism and Developmental Disabilities, 2020
Although it is critical for students with autism to be included in large-scale assessment and accountability systems, it is not clear how to best measure their underlying academic skills and knowledge. Additional empirically-supported guidance is necessary to assist school teams that need to make decisions about how to best include students with…
Descriptors: Testing Accommodations, Autism, Pervasive Developmental Disorders, Students with Disabilities
Strait, Julia Englund; Dawson, Peg; Walther, Christine A. P.; Strait, Gerald Gill; Barton, Amy K.; Brunson McClain, Maryellen – Contemporary School Psychology, 2020
Executive functioning (EF) skills are vital for academic success. Along with the recent explosion of interventions targeting these skills comes the need for affordable, efficient, and ecologically valid measures for planning and tailoring interventions and monitoring outcomes. The current study describes the refinement and initial psychometric…
Descriptors: Executive Function, Questionnaires, Rating Scales, Test Items
Thomas, Christopher L.; Cassady, Jerrell C.; Heath, Joshua A. – International Journal of School & Educational Psychology, 2020
Test anxiety has been identified as a substantial barrier to student success at all educational levels. Given the ubiquitous presence of test anxiety, there have been many attempts to provide readily available measures of test anxiety to help identify learners at-risk for adverse academic outcomes. The purpose of the current study was to test the…
Descriptors: Psychometrics, Test Anxiety, Structural Equation Models, At Risk Students
Yigit, Sihmehmet; Acar, Eyüp – International Education Studies, 2020
Purpose of the research: it is aimed to examine whether the levels of altruism of Physical Education and sports teachers differ according to some variables. This research consists of a total of 126 teachers, 35 women and 91 men, who work as physical education and sports teachers at primary education secondary grade and secondary schools in Kütahya…
Descriptors: Physical Education Teachers, Altruism, Individual Differences, Elementary Secondary Education
Zhang, Haiwei; Jiang, Yuhao; Yang, Jing – SAGE Open, 2020
Researchers have used different measures to examine participants' second-language (L2) proficiency, yet it remains unclear how these different measures influence research results on the effect of L2 proficiency on the target variable. The present research explored the modulation effect of four different measures of L2 Chinese proficiency on…
Descriptors: Language Tests, Language Proficiency, Second Language Learning, Chinese
Wang, Rui; Krosnick, Jon A. – International Journal of Social Research Methodology, 2020
Questionnaires routinely measure unipolar and bipolar constructs using rating scales. Such rating scales can offer odd numbers of points, meaning that they have explicit middle alternatives, or they can offer even numbers of points, omitting the middle alternative. By examining four types of questions in six national or regional telephone surveys,…
Descriptors: Validity, Rating Scales, Questionnaires, Telephone Surveys

Peer reviewed
Direct link
