Publication Date
| In 2026 | 18 |
| Since 2025 | 2375 |
| Since 2022 (last 5 years) | 12890 |
| Since 2017 (last 10 years) | 34015 |
| Since 2007 (last 20 years) | 68506 |
Descriptor
| Foreign Countries | 30599 |
| Test Validity | 21771 |
| Scores | 18272 |
| Academic Achievement | 16940 |
| Test Construction | 16772 |
| Test Reliability | 15043 |
| Achievement Tests | 14867 |
| Standardized Tests | 14727 |
| Comparative Analysis | 14431 |
| Elementary Secondary Education | 13048 |
| Language Tests | 12555 |
| More ▼ | |
Source
Author
Publication Type
Education Level
Audience
| Practitioners | 5034 |
| Teachers | 3394 |
| Researchers | 2630 |
| Policymakers | 1232 |
| Administrators | 979 |
| Students | 687 |
| Parents | 325 |
| Counselors | 216 |
| Community | 162 |
| Support Staff | 50 |
| Media Staff | 34 |
| More ▼ | |
Location
| Turkey | 2827 |
| Australia | 2432 |
| Canada | 2271 |
| California | 1857 |
| United States | 1728 |
| Texas | 1616 |
| China | 1580 |
| United Kingdom | 1315 |
| Florida | 1312 |
| United Kingdom (England) | 1205 |
| Germany | 1123 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Meets WWC Standards without Reservations | 121 |
| Meets WWC Standards with or without Reservations | 189 |
| Does not meet standards | 174 |
Pieterse, Marcel E.; VanDerNagel, Joanneke E. L; Ten Klooster, Peter M.; Turhan, Abdullah; Didden, Robert – Journal of Mental Health Research in Intellectual Disabilities, 2020
Introduction: Personality traits may predict the use of substances in individuals with mild intellectual disabilities (MID) or borderline intellectual functioning (BIF). The Dutch version of the Substance Use Risk Profile Scale (SURPS), adapted for this population, was tested on its psychometric properties. Method: Individuals with MID or BIF (IQ…
Descriptors: Foreign Countries, Mild Intellectual Disability, Psychometrics, Substance Abuse
Jimenez-Garcia, John Alexander; Hong, Chang Ki; Miller, Matthew B.; DeMont, Richard – Measurement in Physical Education and Exercise Science, 2020
The purpose of this Delphi-study was to establish the face and content validity of 10 movement skills, each with four evaluation criteria, to create the Children Focused Injury Risk Screening Tool (ChildFIRST) for 8-12-year-old children. We asked an international expert panel (n = 22) to validate a series of movement skills and evaluation…
Descriptors: Screening Tests, Risk Assessment, Injuries, Children
Lee, Won-Chan; Kim, Stella Y.; Choi, Jiwon; Kang, Yujin – Journal of Educational Measurement, 2020
This article considers psychometric properties of composite raw scores and transformed scale scores on mixed-format tests that consist of a mixture of multiple-choice and free-response items. Test scores on several mixed-format tests are evaluated with respect to conditional and overall standard errors of measurement, score reliability, and…
Descriptors: Raw Scores, Item Response Theory, Test Format, Multiple Choice Tests
Dong, Zehua; Li, Min; Minstrell, Jim; Cui, Yunhuo – Psychology in the Schools, 2020
Science motivation is an important factor that directly influences students' science learning. Numerous studies have been undertaken to develop and validate questionnaire items for measuring students' motivation in science learning. This study is the first longitudinal examination of the Chinese version of Science Motivation Questionnaire II (SMQ…
Descriptors: Psychometrics, Learning Motivation, Chinese, Science Education
Wiggins, Holly C.; Roscoe, Eileen M. – Journal of Applied Behavior Analysis, 2020
Although a demand analysis is helpful for identifying potential establishing operations for the functional analysis (FA) demand condition, it may not always be practical due to time constraints. A potential alternative is the Negative Reinforcement Rating Scale (NRRS), an indirect assessment tool that may serve as a time efficient alternative to a…
Descriptors: Functional Behavioral Assessment, Evaluation Methods, Negative Reinforcement, Autism
Latorre-Román, Pedro Ángel; Consuegra González, Pedro José; Martínez-Redondo, Melchor; Cardona Linares, Antonio José; Salas-Sánchez, J.; Lucena Zurita, Manuel; Manjón Pozas, D.; Pérez Jiménez, Inmaculada; Aragón-Vela, J.; García-Pinillos, Felipe; Robles-Fuentes, Alejandro; Párraga-Montilla, J. A. – Mind, Brain, and Education, 2020
The purpose of this study was to design and validate a complex gait test (CGT) in preschool children and to examine the relationship between CGT performance and age, sex, and cognitive functioning. A total of 1,040 preschool children, aged 3 to 6 years, participated in this study. In all children, standardized dynamic balance test, and several…
Descriptors: Psychomotor Skills, Preschool Children, Age Differences, Gender Differences
Lenz, Katja; Dreher, Anika; Holzäpfel, Lars; Wittmann, Gerald – British Journal of Educational Psychology, 2020
Background: Concerning students' difficulties with fractions, many explanatory approaches are based on the distinction between conceptual knowledge and procedural knowledge. For further research in this field, it is thus crucial to make these constructs accessible to valid measurement. Aims: In this study, we aim at developing a test instrument…
Descriptors: Fractions, Test Construction, Mathematical Concepts, Concept Formation
Hodge, Kari J.; Morgan, Grant B. – Journal of Applied Testing Technology, 2020
The purpose of this study was to examine the use of a misspecified calibration model and its impact on proficiency classification. Monte Carlo simulation methods were employed to compare competing models when the true structure of the data is known (i.e., testlet conditions). The conditions used in the design (e.g., number of items, testlet to…
Descriptors: Item Response Theory, Accuracy, Decision Making, Classification
Eastridge, June A.; Benson, Wendi L. – Teaching of Psychology, 2020
Research on collaborative testing has shown that it decreases test anxiety, increases learning and critical thinking skills, and allows students to practice collaboration and teamwork. However, it has most often been used as a second test following traditional individual testing. This quasi-experimental study compared two models of collaborative…
Descriptors: Group Testing, Models, Statistics, Test Anxiety
Menold, Natalja – Sociological Methods & Research, 2020
Unlike other data collection modes, the effect of labeling rating scales on reliability and validity, as relevant aspects of measurement quality, has seldom been addressed in online surveys. In this study, verbal and numeric rating scales were compared in split-ballot online survey experiments. In the first experiment, respondents' cognitive…
Descriptors: Rating Scales, Online Surveys, Cognitive Processes, Eye Movements
Kottmeyer, Alexa M.; Van Meter, Peggy N.; Cameron, Chelsea E. – Journal of Educational Psychology, 2020
Relational reasoning, or the ability to identify meaningful patterns within streams of information, has emerged as an important factor in a variety of complex tasks. One factor that has received relatively little research attention, however, is how relational reasoning may be influenced by the representational systems (i.e., verbal or nonverbal)…
Descriptors: Undergraduate Students, Logical Thinking, Thinking Skills, Concept Formation
Kay, Alison E.; Hardy, Judy; Galloway, Ross K. – British Journal of Educational Technology, 2020
This study explores the relationship between engagement with an online, free-to-use question-generation application (PeerWise) and student achievement. Using PeerWise, students can create and answer multiple-choice questions and can provide feedback to the question authors on question quality. This provides further scope for students to engage in…
Descriptors: Computer Assisted Testing, Multiple Choice Tests, Academic Achievement, Feedback (Response)
Erus, Seher Merve; Tekel, Esra – European Journal of Educational Research, 2020
The purpose of this study is to develop an Interpersonal Mindfulness Scale-TR (IMS-TR) for Turkish culture. For the data collection process, four different sample groups participated in the study. To test the construct validity of the scale an exploratory factor analysis was performed. Results suggested a 13-item, two-factor solution as (1)…
Descriptors: Foreign Countries, Test Construction, Construct Validity, Test Reliability
Cabral, Luana L.; Nakamura, Fábio Y.; Stefanello, Joice M. F.; Pessoa, Luiz C. V.; Smirmaul, Bruno P. C.; Pereira, Gleber – Measurement in Physical Education and Exercise Science, 2020
The aims of this study were to perform the cross-cultural adaptation of the Borg Rating of Perceived Exertion (RPE) 6-20 Scale to Brazilian Portuguese language and to start testing its validity and reliability. After performing the cross-cultural adaptation of the Scale, concurrent and discriminative validity, and reliability were determined on a…
Descriptors: Test Validity, Test Reliability, Portuguese, Rating Scales
Gershenson, Seth – Education Next, 2020
Grade inflation is pervasive in American high schools. Is rampant grade inflation cause for concern? How teachers' grading standards affect student success is an empirical question--one that the author addresses in this article in a new study of roughly 350,000 North Carolina students taking Algebra I between 2006 and 2016. The author first…
Descriptors: Grading, Grade Inflation, Academic Standards, High School Students

Peer reviewed
Direct link
