Publication Date
| In 2026 | 1 |
| Since 2025 | 899 |
| Since 2022 (last 5 years) | 4508 |
| Since 2017 (last 10 years) | 10441 |
| Since 2007 (last 20 years) | 21904 |
Descriptor
| Test Validity | 21743 |
| Validity | 13779 |
| Test Reliability | 10839 |
| Foreign Countries | 9859 |
| Test Construction | 6878 |
| Factor Analysis | 5756 |
| Measures (Individuals) | 5619 |
| Predictive Validity | 5019 |
| Psychometrics | 4806 |
| Reliability | 4634 |
| Correlation | 4373 |
| More ▼ | |
Source
Author
Publication Type
Education Level
Audience
| Researchers | 1169 |
| Practitioners | 629 |
| Teachers | 336 |
| Administrators | 165 |
| Policymakers | 110 |
| Counselors | 63 |
| Students | 63 |
| Parents | 15 |
| Community | 12 |
| Media Staff | 10 |
| Support Staff | 8 |
| More ▼ | |
Location
| Turkey | 1393 |
| Australia | 704 |
| Canada | 626 |
| China | 527 |
| United States | 439 |
| Indonesia | 387 |
| United Kingdom | 363 |
| Germany | 338 |
| California | 337 |
| Netherlands | 334 |
| Spain | 309 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Meets WWC Standards without Reservations | 7 |
| Meets WWC Standards with or without Reservations | 12 |
| Does not meet standards | 10 |
Peer reviewedStansfield, Charles – System, 1984
Describes the development of the Secondary Level English Proficiency (SLEP) Test specifications and the performance of each item type during administration of the test in other countries. Innovative formats such as multiple-choice cloze and multiple-choice dictation are discussed and described. In addition, the findings of a validity study…
Descriptors: Cloze Procedure, Comparative Analysis, Dictation, English (Second Language)
Drenth, Pieter J. D.; And Others – Evaluation in Education: An International Review Series, 1983
The purpose of this research was to construct tests that could be used to support various decisions in the educational system in Tanzania. Emphasis was placed on decisions made at the end of primary education, and at the end of four years of secondary education. (Author/BW)
Descriptors: Admission (School), Aptitude Tests, Attitude Measures, College Entrance Examinations
Peer reviewedSchibeci, R. A. – Australian Journal of Education, 1984
The validation of a set of attitude scales developed for use with Australian school students, especially using teacher's ratings, is reported. The four scales measure: motivation to achieve in school learning; respect and confidence in self; attitudes toward school and school learning; and attentiveness in science lessons. (MSE)
Descriptors: Attention Control, Attitude Measures, Educational Attitudes, Elementary Secondary Education
George-Ezzelle, Carol E.; Skaggs, Gary – GED Testing Service, 2004
Current testing standards call for test developers to provide evidence that testing procedures and test scores, and the inferences made based on the test scores, show evidence of validity and are comparable across subpopulations (American Educational Research Association [AERA], American Psychological Association [APA], & National Council on…
Descriptors: Scheduling, Testing Accommodations, Academic Achievement, Test Validity
Glazerman, Steven; Tuttle, Christina – Mathematica Policy Research, Inc., 2006
Education policymakers have long sought to establish teaching standards that will measure new or continuing teachers against these standards. The problem is, existing methods for certifying teachers have been criticized for being either so onerous as to deter good candidates or so lax as to keep weak teachers in the profession. To provide another…
Descriptors: Teacher Certification, Teacher Effectiveness, Academic Achievement, National Competency Tests
Research and Training Center on Family Support and Children's Mental Health, 2006
"Data Trends" reports present summaries of research on mental health services for children and adolescents and their families. The article summarized in this "Data Trends" examines the equivalence of the BPI [Behavior Problem Index] across a sample of African American, Hispanic, and non-Hispanic White children drawn from the…
Descriptors: Measurement Techniques, Child Health, Ethnic Groups, Error of Measurement
Buck, Beverly; O'Brien, Tracey – Education Commission of the States (NJ3), 2005
This document is a summary of the findings of an extensive review by the Education Commission of the States (ECS) of empirical research on the effectiveness of current approaches to licensing and certifying teachers. The research review focused on eight questions (and several subquestions) that are of particular interest and concern to policy and…
Descriptors: Teacher Certification, Teacher Effectiveness, Teaching Methods, Verbal Ability
Cronin, John – Northwest Evaluation Association, 2004
Recently Northwest Evaluation Association (NWEA) completed a project to connect the scale of the MEA with NWEA's RIT scale. Six Maine school systems participated in the study, using test information from a group of over 800 students enrolled in fourth and eighth grade who took both the MEA and NWEA reading and mathematics tests in the spring of…
Descriptors: Grade 8, Grade 4, Achievement Tests, Standardized Tests
Tippeconnic, John W., III – 2003
This digest focuses on academic testing and American Indian and Alaska Native (AI/AN) students. Ideally, test results should be used to improve student learning. Proponents of high-stakes testing say it is needed to measure student achievement and school quality and to hold students and teachers accountable. High-stakes testing is also used to…
Descriptors: Academic Achievement, Accountability, American Indian Education, American Indian Students
Loseth, Rick; Carlson, Sarah; Picard, Anine; King, Carly; Schmid, Chris Oldakowski; VanEngen, Rose – 2000
This report discusses the outcomes of a study that investigated the effectiveness of the Preschool and Early Childhood Functional Assessment Scale (PECFAS) Screener, an assessment designed to assist in the early identification of mental health concerns in preschool-aged children. Formed in 1993, the Putting All Communities Together 4 Families…
Descriptors: Behavior Disorders, Behavior Rating Scales, Disability Identification, Early Identification
Smith, Thomas J. – 2002
A study examined whether career academy programs, with their tailored curricula, dilute content of required curricula, and whether academy students who follow this alternative course of study are less prepared to score well on standardized tests. Data were gathered through site visits to seven schools, review of reports and documents, and…
Descriptors: Career Academies, Career Education, Course Content, Field Interviews
Kenyon, Dorry; Van Duzer, Carol – Center for Adult English Language Acquisition, 2003
Ensuring that language tests for adult English language learners are appropriate, valid, and reliable is a challenge. Performance-based assessments are complex to develop and implement. Yet, because the focus of assessment, both in the National Reporting System for Adult Education (NRS) descriptors and in the Department of Education's definition…
Descriptors: Student Evaluation, Language Tests, Second Language Learning, English (Second Language)
Haertel, Edward H. – National Assessment Governing Board, 2003
The paper initially describes the sources of uncertainty in National Assessment of Educational Progress (NAEP) data and standard errors. As NAEP sample sizes have increased, greater precision has been attained by the program. For this reason, exclusion effects are increasingly important. Two scenarios of revised NAEP results are presented (for New…
Descriptors: Error of Measurement, Computation, Disabilities, Limited English Speaking
Sireci, Stephen G.; Foster, David F.; Robin, Frederic; Olsen, James – 1997
Evaluating the comparability of a test administered in different languages is a difficult, if not impossible, task. Comparisons are problematic because observed differences in test performance between groups who take different language versions of a test could be due to a difference in difficulty between the tests, to cultural differences in test…
Descriptors: Adaptive Testing, Adults, Certification, Comparative Analysis
Kim, JinGyu; And Others – 1994
The reliability and factorial validity of the Computer Attitudes Scale (CAS) was assessed with college students in South Korea. The CAS was developed for use with high school students, but has been used in higher education in the United States. It is a Likert-type scale of 30 positive and negative statements about the use of computers, and is one…
Descriptors: Attitude Measures, College Students, Computer Attitudes, Cross Cultural Studies

Direct link
