Publication Date
| In 2026 | 1 |
| Since 2025 | 899 |
| Since 2022 (last 5 years) | 4508 |
| Since 2017 (last 10 years) | 10441 |
| Since 2007 (last 20 years) | 21904 |
Descriptor
| Test Validity | 21743 |
| Validity | 13779 |
| Test Reliability | 10839 |
| Foreign Countries | 9859 |
| Test Construction | 6878 |
| Factor Analysis | 5756 |
| Measures (Individuals) | 5619 |
| Predictive Validity | 5019 |
| Psychometrics | 4806 |
| Reliability | 4634 |
| Correlation | 4373 |
| More ▼ | |
Source
Author
Publication Type
Education Level
Audience
| Researchers | 1169 |
| Practitioners | 629 |
| Teachers | 336 |
| Administrators | 165 |
| Policymakers | 110 |
| Counselors | 63 |
| Students | 63 |
| Parents | 15 |
| Community | 12 |
| Media Staff | 10 |
| Support Staff | 8 |
| More ▼ | |
Location
| Turkey | 1393 |
| Australia | 704 |
| Canada | 626 |
| China | 527 |
| United States | 439 |
| Indonesia | 387 |
| United Kingdom | 363 |
| Germany | 338 |
| California | 337 |
| Netherlands | 334 |
| Spain | 309 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Meets WWC Standards without Reservations | 7 |
| Meets WWC Standards with or without Reservations | 12 |
| Does not meet standards | 10 |
Ariamanesh, Ali A.; Barati, Hossein; Youhanaee, Manijeh – International TESOL Journal, 2022
The present study investigated the speaking module of TOEFL iBT with an emphasis on the dichotomy of independent and integrated tasks. The potential differences between the two speaking conditions were intended to be explored based on the oral performance elicited from a group of Iranian test takers. To collect the required data, a simulated…
Descriptors: Second Language Learning, English (Second Language), Language Tests, Computer Assisted Testing
Bailey, Alison L.; Wolf, Mikyung Kim; Ballard, Laura – Educational Testing Service, 2022
The research note focuses on the alignment aspect of English language proficiency (ELP) assessments, one of the required types of validity evidence for the federal peer review process of states' assessment systems. A basic tenant of current U.S. education policy is the alignment between what a test assesses and what content has been determined as…
Descriptors: English (Second Language), Second Language Learning, Language Proficiency, Alignment (Education)
New York State Education Department, 2022
The instructions in this manual explain the responsibilities of school administrators for the New York State Testing Program (NYSTP) Grades 3-8 English Language Arts and Mathematics Paper-Based Field Tests. School administrators must be thoroughly familiar with the contents of the manual, and the policies and procedures must be followed as written…
Descriptors: Testing Programs, Mathematics Tests, Test Format, Computer Assisted Testing
Ginther, April; Yan, Xun – Language Testing, 2018
This study examines the predictive validity of the TOEFL iBT with respect to academic achievement as measured by the first-year grade point average (GPA) of Chinese students at Purdue University, a large, public, Research I institution in Indiana, USA. Correlations between GPA, TOEFL iBT total and subsection scores were examined on 1990 mainland…
Descriptors: Correlation, Computer Assisted Testing, Profiles, English (Second Language)
Gargani, John; Strong, Michael – Journal of Teacher Education, 2015
In Gargani and Strong (2014), we describe The Rapid Assessment of Teacher Effectiveness (RATE), a new teacher evaluation instrument. Our account of the validation research associated with RATE inspired a review by Good and Lavigne (2015). Here, we reply to the main points of their review. We elaborate on the validity, reliability, theoretical…
Descriptors: Evidence, Teacher Effectiveness, Teacher Evaluation, Evaluation Methods
Shek, Daniel Tan Lei; Yu, Lu – Journal of Intellectual & Developmental Disability, 2015
Background: As a comprehensive measure for children with autism spectrum disorder (ASD), the Psychoeducational Profile -- Third Edition (PEP-3) has been validated and widely used in the United States. This study attempted to investigate the psychometric properties of the Chinese version of the PEP-3 (CPEP-3) Caregiver Report. Method: A total of…
Descriptors: Autism, Pervasive Developmental Disorders, Children, Parents
Walton, Katherine M.; Ingersoll, Brooke R. – Autism: The International Journal of Research and Practice, 2016
Literature on "Thin Slice" ratings indicates that a number of personality characteristics and behaviors can be accurately predicted by ratings of very short segments (<5?min) of behavior. This study examined the utility of Thin Slice ratings of young children with autism spectrum disorder for predicting developmental skills and…
Descriptors: Autism, Pervasive Developmental Disorders, Language Acquisition, Preschool Children
Han, Jiying; Yin, Hongbiao; Wang, Wenlan – Educational Psychology, 2016
This study explored the effect of tertiary teachers' goal orientations for teaching on their commitment, with a particular focus on the mediating role of teacher engagement. The results of a survey of 597 Chinese tertiary teachers indicated that teacher commitment was positively predicted by ability approach, mastery and relational goals, but was…
Descriptors: Goal Orientation, College Faculty, Teacher Participation, Role
Teng, Lin Sophie; Zhang, Lawrence Jun – Asia-Pacific Education Researcher, 2016
This article describes the development and validation of a survey instrument, the "Writing Strategies for Motivational Regulation Questionnaire" ("WSMRQ"), designed to measure Chinese university students' reported use of motivational regulation strategies in writing in English as a second/foreign language (L2). Conceptualized…
Descriptors: Questionnaires, Writing Strategies, Factor Analysis, Correlation
Lamb, Lindsay M. – Online Submission, 2016
This executive summary highlights findings from the full report (published separately) which analyzes construct and predictive validity of a Social and Emotional Learning (SEL) Competency Survey. Students' SEL skill ratings were correlated with other measures of SEL skills and other outcomes of interest. A separate research brief also was…
Descriptors: School Districts, Social Development, Emotional Development, Social Emotional Learning
Norfolk, Philip A.; Floyd, Randy G. – Psychology in the Schools, 2016
It is often assumed that parents completing behavior rating scales during the assessment of attention-deficit/hyperactivity disorder (ADHD) can deliberately manipulate the outcomes of the assessment. To detect these actions, items designed to detect over-reporting or under-reporting of results are sometimes embedded in such rating scales. This…
Descriptors: Attention Deficit Hyperactivity Disorder, Parents, Deception, Behavior Rating Scales
Zazkis, Dov; Weber, Keith; Mejía-Ramos, Juan Pablo – Educational Studies in Mathematics, 2016
We examine a commonly suggested proof construction strategy from the mathematics education literature--that students first produce a graphical argument and then work to construct a verbal-symbolic proof based on that graphical argument. The work of students who produce such graphical arguments when solving proof construction tasks was analyzed to…
Descriptors: Mathematics Instruction, Mathematical Logic, Validity, Persuasive Discourse
Kane, Michael T. – Assessment in Education: Principles, Policy & Practice, 2016
How we choose to use a term depends on what we want to do with it. If "validity" is to be used to support a score interpretation, validation would require an analysis of the plausibility of that interpretation. If validity is to be used to support score uses, validation would require an analysis of the appropriateness of the proposed…
Descriptors: Test Validity, Test Interpretation, Test Use, Scores
Gafni, Naomi – Assessment in Education: Principles, Policy & Practice, 2016
Naomi Gafni, director of Research and Development, National Institute for Testing and Evaluation, Jerusalem, Israel, has devoted a substantial part of her career to the development of admissions tests and other educational tests and to the investigation of their validity. As such she is keenly aware of the complexities involved in this process.…
Descriptors: Test Validity, Test Interpretation, Test Use, Test Construction
Sternod, Latisha; French, Brian – Journal of Psychoeducational Assessment, 2016
The Watson-Glaser™ II Critical Thinking Appraisal (Watson-Glaser II; Watson & Glaser, 2010) is a revised version of the "Watson-Glaser Critical Thinking Appraisal®" (Watson & Glaser, 1994). The Watson-Glaser II introduces a simplified model of critical thinking, consisting of three subdimensions: recognize assumptions, evaluate…
Descriptors: Cognitive Tests, Critical Thinking, Test Construction, Test Reliability

Peer reviewed
Direct link
