Publication Date
| In 2026 | 0 |
| Since 2025 | 12 |
| Since 2022 (last 5 years) | 114 |
| Since 2017 (last 10 years) | 375 |
| Since 2007 (last 20 years) | 1130 |
Descriptor
| Comparative Analysis | 1943 |
| Reliability | 880 |
| Test Reliability | 792 |
| Foreign Countries | 554 |
| Test Validity | 443 |
| Correlation | 350 |
| Validity | 332 |
| Interrater Reliability | 327 |
| Statistical Analysis | 321 |
| Scores | 280 |
| Measures (Individuals) | 236 |
| More ▼ | |
Source
Author
| Reckase, Mark D. | 6 |
| Attali, Yigal | 5 |
| Coniam, David | 5 |
| Brennan, Robert L. | 4 |
| Crehan, Kevin D. | 4 |
| Feldt, Leonard S. | 4 |
| Hakstian, A. Ralph | 4 |
| Jones, Ian | 4 |
| Kolen, Michael J. | 4 |
| Lunz, Mary E. | 4 |
| August, Diane | 3 |
| More ▼ | |
Publication Type
Education Level
Audience
| Researchers | 35 |
| Practitioners | 29 |
| Teachers | 15 |
| Administrators | 9 |
| Policymakers | 6 |
| Counselors | 2 |
| Media Staff | 2 |
| Parents | 1 |
| Support Staff | 1 |
Location
| Turkey | 59 |
| United States | 47 |
| Australia | 36 |
| China | 33 |
| Canada | 32 |
| United Kingdom (England) | 32 |
| United Kingdom | 28 |
| Germany | 25 |
| Netherlands | 24 |
| Taiwan | 22 |
| Hong Kong | 20 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Meets WWC Standards with or without Reservations | 1 |
| Does not meet standards | 1 |
Peer reviewedMadsen, Clifford K.; Geringer, John M. – Bulletin of the Council for Research in Music Education, 1999
Investigates patterns of music listening among music majors regarding accompanied and unaccompanied excerpts varying in tone quality and intonation. Tests the reliability of the Continuous Response Digital Interface, contrasting the measurements obtained to those from a paper and pencil measure. Discusses the results. (CMK)
Descriptors: College Students, Comparative Analysis, Evaluation Methods, Higher Education
Peer reviewedAnderson, Lance E.; And Others – Multivariate Behavioral Research, 1996
Simulations were used to compare the moderator variable detection capabilities of moderated multiple regression (MMR) and errors-in-variables regression (EIVR). Findings show that EIVR estimates are superior for large samples, but that MMR is better when reliabilities or sample sizes are low. (SLD)
Descriptors: Comparative Analysis, Error of Measurement, Estimation (Mathematics), Interaction
Peer reviewedHart, Craig H.; Draper, Thomas W.; Olsen, Joseph A. – Merrill-Palmer Quarterly, 2001
Examined cross-informant concordance, temporal stability, and reliability of sociometrics in 84 preschoolers. Found that parallel forms of teacher and peer sociometrics measured overlapping and unique aspects of popularity. Teacher-measured popularity was highly stable over 8 weeks; peer-measured popularity showed lower stability. Both teacher and…
Descriptors: Comparative Analysis, Peer Acceptance, Peer Evaluation, Peer Relationship
Peer reviewedO'Brian, Sue; Packman, Ann; Onslow, Mark; O'Brian, Nigel – Journal of Speech, Language, and Hearing Research, 2004
This study investigated the comparative reliability of 2 stuttering measurement tools when used by experienced judges: percentage of syllables stuttered (%SS) and a 9-point severity scale (SEV). The study also investigated the degree to which scores on 1 tool predict scores on the other and the distributions of stuttering when measured by these…
Descriptors: Severity (of Disability), Rating Scales, Interrater Reliability, Stuttering
DeMars, Christine E. – Journal of Educational Measurement, 2006
Four item response theory (IRT) models were compared using data from tests where multiple items were grouped into testlets focused on a common stimulus. In the bi-factor model each item was treated as a function of a primary trait plus a nuisance trait due to the testlet; in the testlet-effects model the slopes in the direction of the testlet…
Descriptors: Item Response Theory, Reliability, Item Analysis, Factor Analysis
Epstein, Monica K.; Poythress, Norman G.; Brandon, Karen O. – Assessment, 2006
The reliability and validity of the Self-Report Psychopathy Scale (SRPS) was examined in a noninstitutionalized offender sample of mixed gender and race. Adequate alpha coefficients were obtained for the total sample and across gender and race. The SRPS was compared to measures of trait anxiety and passive avoidance errors. SRPS total, primary,…
Descriptors: Self Evaluation (Individuals), Race, Sex, Psychopathology
Newman, Michelle G.; Holmes, Marilyn; Zuellig, Andrea R.; Kachin, Kevin E.; Behar, Evelyn – Psychological Assessment, 2006
This study examined the Panic Disorder Self-Report (PDSR), a new self-report diagnostic measure of panic disorder based on the 4th edition of the Diagnostic and Statistical Manual of Mental Disorders (American Psychiatric Association, 1994). PDSR diagnoses were compared with structured interview diagnoses of individuals with generalized anxiety…
Descriptors: Test Reliability, Validity, Diagnostic Tests, Clinical Diagnosis
Evertson, Carolyn M.; And Others – 1977
The stability of classroom behavior is examined from several perspectives: (1) the relative consistency of teacher behavior in two different sections of the same course taught concurrently; (2) the relative consistency of student behavior in math and English classes attended concurrently; and (3) differences in student and teacher behavior in math…
Descriptors: Classroom Observation Techniques, Comparative Analysis, Correlation, Data Analysis
Kash, Bita A.; Hawes, Catherine; Phillips, Charles D. – Gerontologist, 2007
Purpose: This study had two goals: (a) to assess the validity of the Online Survey Certification and Reporting (OSCAR) staffing data by comparing them to staffing measures from audited Medicaid Cost Reports and (b) to identify systematic differences between facilities that over-report or underreport staffing in the OSCAR. Design and Methods: We…
Descriptors: Health Services, Allied Health Personnel, Validity, Comparative Analysis
Crehan, Kevin D.; And Others – 1993
Among the measurement techniques receiving greater attention is the context-dependent item set or testlet. The context-dependent item set consists of a scenario and related test questions. This item format is generally believed to be able to tap higher level thinking. Unfortunately, this item form leads to inter-item dependence within item sets…
Descriptors: Comparative Analysis, Item Response Theory, Measurement Techniques, Reading Tests
Skinner, Robert E. – 1994
The merits and disadvantages of standardized and informal reading tests for limited English proficient readers are discussed. A growing reliance on standardized ("formal") tests due to their ease of administration and scoring is criticized because the tests are seen as: inadequate for describing students at high and low ends of the scale; not…
Descriptors: Comparative Analysis, English (Second Language), Limited English Speaking, Reading Tests
Schael, Jocelyne; Dionne, Jean-Paul – 1991
The basis of agreement or disagreement among judges/evaluators when applying a coding scheme to concurrent verbal protocols was studied. The sample included 20 university graduates, from varied backgrounds; 10 subjects had and 10 subjects did not have experience in protocol analysis. The total sample was divided into four balanced groups according…
Descriptors: Adults, College Graduates, Comparative Analysis, Encoding (Psychology)
Peer reviewedStauffer, A. J. – Educational and Psychological Measurement, 1974
Descriptors: Attitude Change, Attitude Measures, Comparative Analysis, Educational Research
The Between Teacher Reliability of the Ekwall Reading Inventory and the Classroom Reading Inventory.
Christine, Charles T.; And Others – 1982
Using a test-retest research design, a study examined the reliability of the Classroom Reading Inventory (CRI) and the Ekwall Reading Inventory (ERI). Independent variables of test administrator to subject, test administrator to test, subject to test, and test order were randomized. Subjects included 31 children aged 7 through 12 years. The four…
Descriptors: Comparative Analysis, Elementary Secondary Education, Informal Reading Inventories, Reading Instruction
Tollefson, Nona; Chung, Jing-Mei – 1986
Procedures for correcting for guessing and for assessing partial knowledge (correction-for-guessing, three-decision scoring, elimination/inclusion scoring, and confidence or probabilistic scoring) are discussed. Mean scores and internal consistency reliability estimates were compared across three administration and scoring procedures for…
Descriptors: Achievement Tests, Comparative Analysis, Evaluation Methods, Graduate Students

Direct link
