Publication Date
| In 2026 | 2 |
| Since 2025 | 469 |
| Since 2022 (last 5 years) | 1948 |
| Since 2017 (last 10 years) | 4520 |
| Since 2007 (last 20 years) | 7005 |
Descriptor
| Test Reliability | 15043 |
| Test Validity | 10011 |
| Test Construction | 4371 |
| Foreign Countries | 3834 |
| Psychometrics | 2429 |
| Factor Analysis | 2301 |
| Measures (Individuals) | 1785 |
| Evaluation Methods | 1410 |
| Higher Education | 1391 |
| Questionnaires | 1261 |
| Factor Structure | 1248 |
| More ▼ | |
Source
Author
Publication Type
Education Level
Audience
| Researchers | 454 |
| Practitioners | 319 |
| Teachers | 128 |
| Administrators | 73 |
| Policymakers | 33 |
| Counselors | 31 |
| Students | 17 |
| Parents | 10 |
| Community | 6 |
| Support Staff | 5 |
Location
| Turkey | 839 |
| Australia | 239 |
| China | 211 |
| Canada | 207 |
| Indonesia | 163 |
| Spain | 130 |
| United States | 123 |
| United Kingdom | 121 |
| Germany | 112 |
| Taiwan | 108 |
| Netherlands | 102 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Meets WWC Standards without Reservations | 2 |
| Meets WWC Standards with or without Reservations | 2 |
| Does not meet standards | 1 |
Robinson, Carrie H.; Betz, Nancy E. – Journal of Career Assessment, 2004
This study examined the test-retest reliability and the concurrent validity of the 17-scale Expanded Skills Confidence Inventory in samples of 321 and 175 college students. Retest values over a 3-week interval ranged from .77 to .89, with a median of .85. Using Brown and Gore's C-index, evidence for the concurrent validity of confidence score…
Descriptors: College Students, Test Validity, Vocational Interests, Test Reliability
Adams, Raymond J. – Studies in Educational Evaluation, 2005
Test reliability is a concept central to classical test theory and it is commonly stated as a requirement that a test attain a certain level of reliability before it be considered of sufficient quality for practical use. This article discusses the role of reliability in item response theory, and in particular the role of reliability in contexts…
Descriptors: Test Reliability, Error of Measurement, Item Sampling, Item Response Theory
Bush, Martin E. – Quality Assurance in Education: An International Perspective, 2006
Purpose: To provide educationalists with an understanding of the key quality issues relating to multiple-choice tests, and a set of guidelines for the quality assurance of such tests. Design/methodology/approach: The discussion of quality issues is structured to reflect the order in which those issues naturally arise. It covers the design of…
Descriptors: Multiple Choice Tests, Test Reliability, Educational Quality, Quality Control
Kaufman, Alan S.; Flanagan, Dawn P.; Alfonso, Vincent C.; Mascolo, Jennifer T. – Journal of Psychoeducational Assessment, 2006
Within the field of psychological assessment, the Wechsler scales continue to be the most widely used intelligence batteries. The concepts, methods, and procedures inherent in the design of the Wechsler scales have been so influential that they have guided most of the test development and research in the field for more than a half century. This…
Descriptors: Intelligence Tests, Test Reviews, Testing, Scoring
Livingston, Ronald B.; Jennings, Earl; Colotla, Victor A.; Reynolds, Cecil R.; Shercliffe, Regan J. – Psychological Assessment, 2006
In this study, the authors examined the stability of Minnesota Multiphasic Personality Inventory--2 (J. N. Butcher, W. G. Dahlstrom, J. R. Graham, A. Tellegen, & B. Kaemmer, 1989) code types in a sample of 94 injured workers with a mean test-retest interval of 21.3 months (SD = 14.1). Congruence rates for undefined code types were 34% for…
Descriptors: Congruence (Psychology), Injuries, Personality Measures, Test Reliability
Kellett, Stephen; Beail, Nigel; Newman, David W. – American Journal on Mental Retardation, 2005
Despite interpersonal problems being commonplace in the clinical presentations of people with mental retardation, previous efforts to index interpersonal difficulties have tended to unsatisfactorily rely on external ratings. The Inventory of Interpersonal Problems-32 is a psychometrically robust self-report measure of interpersonal problems in…
Descriptors: Psychometrics, Mild Mental Retardation, Interpersonal Relationship, Interpersonal Competence
Powell, Thomas W. – Clinical Linguistics & Phonetics, 2006
The third edition of the "Boston Diagnostic Aphasia Examination" (Goodglass, Kaplan, and Barresi) introduced standardized procedures for coding discourse samples elicited using the well known Cookie Theft illustration. To evaluate the reliability of this discourse coding procedure, a transcribed sample was coded by 14 novice examiners…
Descriptors: Examiners, Interrater Reliability, Test Reliability, Aphasia
Zhuang, Xiaohua; MacCann, Carolyn; Wang, Lijuan; Liu, Lydia; Roberts, Richard D. – ETS Research Report Series, 2008
Various policy papers and research studies assert that teamwork is one of the most important skills for students to learn if they are to become meaningful contributors to the 21st century workforce. However, outside of organizational psychology and adult populations, few reliable assessments of this construct exist, with suitable validity evidence…
Descriptors: Teamwork, Cooperative Learning, Evaluation Methods, Student Evaluation
Nadeau, Luc; Richard, Jean-Francois; Godbout, Paul – Physical Education and Sport Pedagogy, 2008
Background: Coaches and physical educators must obtain valid data relating to the contribution of each of their players in order to assess their level of performance in team sport competition. This information must also be collected and used in real game situations to be more valid. Developed initially for a physical education class context, the…
Descriptors: Physical Education, Team Sports, Observation, Performance Based Assessment
Vlachopoulos, Symeon P.; Kaperoni, Maria; Moustaka, Frederiki C.; Anderson, Dean F. – Research Quarterly for Exercise and Sport, 2008
The present study reported on translating the Exercise Identity Scale (EIS: Anderson & Cychosz, 1994) into Greek and examining its psychometric properties and cross-cultural validity based on U.S. individuals' EIS responses. Using four samples comprising 33, 103, and 647 Greek individuals, including exercisers and nonexercisers, and a similar…
Descriptors: Test Reliability, Test Validity, Factor Structure, Measures (Individuals)
Moyer-Packenham, Patricia S.; Bolyard, Johnna J.; Kitsantas, Anastasia; Oh, Hana – Peabody Journal of Education, 2008
The purpose of this study was to examine the types of instruments being used to document mathematics and science teacher quality characteristics in 48 nationally funded mathematics and science education awards. Each of the 48 projects operationalized teacher quality and determined how to assess it. The main research questions examined the…
Descriptors: Teacher Effectiveness, Teacher Characteristics, Awards, Psychometrics
Sales, Jessica McDermott; Milhausen, Robin R.; Wingood, Gina M.; DiClemente, Ralph J.; Salazar, Laura F.; Crosby, Richard A. – Health Education & Behavior, 2008
This study reports on the validation of a scale to assess adolescent girls' frequency of sexual communication with their parents. The Parent-Adolescent Communication Scale (PACS) was administered to 522 African American female adolescents ranging in age from 14 to 18. The PACS demonstrated satisfactory internal consistency (across multiple…
Descriptors: Self Efficacy, Adolescents, Measures (Individuals), Sexuality
Feinberg, Mark E.; Gomez, Brendan J.; Puddy, Richard W.; Greenberg, Mark T. – Health Education & Behavior, 2008
Community coalitions (CCs) have labored with some difficulty to demonstrate empirical evidence of effectiveness in preventing a wide range of adolescent problem behaviors. Training and technical assistance (TA) have been identified as important elements in promoting improved functioning of CCs. A reliable, valid, and inexpensive method to assess…
Descriptors: Prevention, Construct Validity, Risk, Questionnaires
Lembke, Erica S.; Stecker, Pamela M. – Center on Instruction, 2007
One of the best methods of formative assessment in academic areas and a method that exemplifies the characteristics of good measures is Curriculum-Based Measurement (CBM; Deno, 1985). Developed at the University of Minnesota in the early 1970's, CBM has been researched in academic areas including mathematics computation, concepts, and…
Descriptors: Curriculum Based Assessment, Formative Evaluation, Mathematics Education, Educational Research
DeSpain, Donna – ProQuest LLC, 2007
This dissertation focused on master's education cohorts at a small midwestern university. The purpose of this quantitative study was to examine the relationship between various demographic factors and other student characteristics as they were reported on a survey instrument. Of particular interest were those responses to attitudinal questions…
Descriptors: Masters Degrees, Graduate Study, Cohort Analysis, Cooperative Learning

Peer reviewed
Direct link
