Publication Date
| In 2026 | 0 |
| Since 2025 | 5 |
| Since 2022 (last 5 years) | 23 |
| Since 2017 (last 10 years) | 563 |
| Since 2007 (last 20 years) | 1786 |
Descriptor
| Statistical Analysis | 2533 |
| Reliability | 1278 |
| Test Reliability | 1074 |
| Foreign Countries | 940 |
| Correlation | 633 |
| Test Validity | 630 |
| Factor Analysis | 559 |
| Validity | 508 |
| Questionnaires | 479 |
| Measures (Individuals) | 411 |
| Test Construction | 338 |
| More ▼ | |
Source
Author
| Alonzo, Julie | 12 |
| Price, Gary G. | 12 |
| Tindal, Gerald | 10 |
| Lai, Cheng-Fei | 9 |
| Brennan, Robert L. | 8 |
| Raykov, Tenko | 8 |
| Feldt, Leonard S. | 7 |
| Livingston, Samuel A. | 7 |
| Park, Bitnara Jasmine | 7 |
| Irvin, P. Shawn | 6 |
| Anderson, Daniel | 5 |
| More ▼ | |
Publication Type
Education Level
Audience
| Researchers | 34 |
| Practitioners | 21 |
| Teachers | 10 |
| Students | 8 |
| Administrators | 5 |
| Counselors | 2 |
| Parents | 1 |
| Policymakers | 1 |
Location
| Turkey | 204 |
| Nigeria | 57 |
| Jordan | 38 |
| Australia | 35 |
| Iran | 35 |
| Taiwan | 35 |
| Canada | 31 |
| China | 30 |
| Germany | 29 |
| California | 28 |
| United Kingdom | 25 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Meets WWC Standards without Reservations | 1 |
| Meets WWC Standards with or without Reservations | 1 |
| Does not meet standards | 1 |
Secondary School Examinations Council, London (England). – 1964
This bulletin addresses all concerned with examinations for the Certificate of Secondary Education, particularly teachers, with suggestions to improve the efficiency and fairness of examining. The bulletin is divided into two parts: Part I - Examining Techniques: General Principles, covers such major subjects as objectives of the course, analysing…
Descriptors: Academic Achievement, Achievement Tests, Bulletins, Educational Objectives
Pecorella, Patricia A.; Bowers, David G. – 1976
Analyses preparatory to construction of a suitable file for generating a system of future performance trend indicators are described. Such a system falls into the category of a current value approach to human resources accounting. It requires that there be a substantial body of data which: (1) uses the work group or unit, not the individual, as…
Descriptors: Accounting, Administration, Cost Effectiveness, Efficiency
Iramaneerat, Cherdsak; Myford, Carol M. – Online Submission, 2006
A multi-faceted Rasch measurement (MFRM) approach was used to analyze clinical performance ratings of 24 first-year residents in one surgery residency program in Thailand to investigate three types of rater effects: leniency, rater inconsistency, and restriction of range. Faculty from 14 surgical services rated the clinical performance of…
Descriptors: Foreign Countries, Measures (Individuals), Job Performance, Interrater Reliability
Iramaneerat, Cherdsak; Yudkowsky, Rachel – Online Submission, 2006
A multi-faceted Rasch measurement (MFRM) model was used to analyze a clinical skills assessment of 173 fourth-year medical students in a Midwestern medical school to investigate four types of rater errors: leniency, inconsistency, halo, and restriction of range. Each student performed six clinical tasks with six standardized patients (SPs), who…
Descriptors: Patients, Physical Examinations, Medical Students, Clinical Experience
Peer reviewedMittenberg, Wiley; And Others – Psychological Assessment, 1992
Normative data for the Wechsler Memory Scale-Revised were derived empirically using a sample of 50 volunteers between 25 and 34 years of age, who matched U.S. Census data on demographic characteristics. Differences between these empirical norms and published norms that were estimated statistically appear clinically significant. (SLD)
Descriptors: Adults, Census Figures, Demography, Diagnostic Tests
Peer reviewedAsmus, Edward P.; Harrison, Carole S. – Journal of Research in Music Education, 1990
Administers an experimental college version of the Musical Aptitude Profile (CMAP) and 2 motivation measures to 187 nonmusic majors in a music appreciation course. Eliminates CMAP results from analysis resulting from low reliability. Includes data table that were significantly related. (NL)
Descriptors: Aptitude Tests, Higher Education, Learning Motivation, Motivation Techniques
Peer reviewedSameroff, Arnold J.; Fiese, Barbara H. – Monographs of the Society for Research in Child Development, 1999
Investigated reliability of the Family Narrative Consortium (FNC) scales for measuring the effect of social context on construction of family narratives. Analyzed data from four FNC studies to determine reliability of scale dimensions of narrative coherence, narrative interaction, and relationship beliefs. Found that the scale was a set of…
Descriptors: Depression (Psychology), Family Environment, Family History, Family Influence
Feldt, Leonard S.; Kim, Seonghoon – Educational and Psychological Measurement, 2006
Researchers sometimes need a statistical test of the hypothesis that two values of Cronbach's alpha reliability coefficient are equal. The situation may involve scores from two different measures administered to independent random samples or from the same measure administered to random samples from two different populations. Feldt derived a test…
Descriptors: Individual Testing, Test Items, Sample Size, Scores
Whittaker, Tiffany A.; Stapleton, Laura M. – Multivariate Behavioral Research, 2006
Cudeck and Browne (1983) proposed using cross-validation as a model selection technique in structural equation modeling. The purpose of this study is to examine the performance of eight cross-validation indices under conditions not yet examined in the relevant literature, such as nonnormality and cross-validation design. The performance of each…
Descriptors: Multivariate Analysis, Selection, Structural Equation Models, Evaluation Methods
Romero, Fernando; Paris, Scott G.; Brem, Sarah K. – Current Issues in Education, 2005
We examined underlying mechanisms for comprehension differences across expository and narrative text while controlling for factors confounded in the extant literature. Fourth grade students (n=32) read both an expository and a narrative text, and completed both a local comprehension assessment, and a global retelling assessment for each text.…
Descriptors: Reading Comprehension, Grade 4, Psycholinguistics, Models
Moon, Tonya R.; Callahan, Carolyn M.; Brighton, Catherine M.; Hertberg, Holly; Esperat, Andrea M. – Journal for the Education of the Gifted, 2003
The purpose of this study was to collect reliability and validity data on the School Characteristics Inventory (SCI), a quantitative measure based on Sternberg's (2000) theory of contextual modifiability. Data were collected from a national sample of middle school teachers and from teachers participating in a 3-year study investigating teachers'…
Descriptors: Reputation, Validity, Educational Innovation, Factor Analysis
Varrella, Gary F.; Veronesi, Peter D. – Journal of Elementary Science Education, 2004
This paper represents Part I of a two-part study examining preservice teachers' development of a personalized, research-based Science Teaching Rationale (STR). Researchers have historically documented the application of the "rationale paper" (Clough, 1992; Veronesi, 1998) using qualitative methodologies. Since the rationale paper continues to…
Descriptors: Preservice Teachers, Elementary School Science, Statistical Analysis, Evaluation Methods
Cope, Ronald T. – 1995
This paper deals with the problems that arise in performance assessment from the granularity that results from having a small number of tasks or prompts and raters of responses to these tasks or prompts. Two problems are discussed in detail: (1) achieving a satisfactory degree of reliability; and (2) equating or adjusting for differences of…
Descriptors: Difficulty Level, Educational Assessment, Equated Scores, High Stakes Tests
de Jong, John H. A. L. – 1984
The Netherlands' secondary education system is highly differentiated, with four different school types for four scholastic ability levels. Final examinations must accommodate these four levels, and require a test-independent definition of the intended final ability levels as well as a sample-free evaluation of the range of ability levels at which…
Descriptors: Difficulty Level, Efficiency, Equated Scores, Foreign Countries
Perlman, Carole L.; And Others – 1988
The reliability of item bias estimates was studied for four methods: (1) the transformed delta method; (2) Shepard's modified delta method; (3) Rasch's one-parameter residual analysis; and (4) the Mantel-Haenszel procedure. Bias statistics were computed for each sample using all methods. Data were from administration of multiple-choice items from…
Descriptors: Elementary Education, Elementary School Students, Estimation (Mathematics), Item Banks

Direct link
