Publication Date
| In 2026 | 0 |
| Since 2025 | 0 |
| Since 2022 (last 5 years) | 0 |
| Since 2017 (last 10 years) | 0 |
| Since 2007 (last 20 years) | 1 |
Descriptor
| Evaluation Methods | 9 |
| Interrater Reliability | 9 |
| Mathematical Models | 9 |
| Performance Based Assessment | 4 |
| Comparative Analysis | 3 |
| Equations (Mathematics) | 3 |
| Evaluators | 3 |
| Higher Education | 3 |
| Rating Scales | 3 |
| Adults | 2 |
| Computer Simulation | 2 |
| More ▼ | |
Source
| Multivariate Behavioral… | 2 |
| Applied Psychological… | 1 |
| Educational Measurement:… | 1 |
| Evaluation and Program… | 1 |
| International Association for… | 1 |
| Journal of Educational… | 1 |
Author
Publication Type
| Reports - Evaluative | 7 |
| Journal Articles | 6 |
| Collected Works - Proceedings | 1 |
| Collected Works - Serials | 1 |
| Opinion Papers | 1 |
| Reports - Research | 1 |
| Speeches/Meeting Papers | 1 |
Education Level
| Elementary Secondary Education | 1 |
| Higher Education | 1 |
| Postsecondary Education | 1 |
Audience
| Practitioners | 1 |
| Researchers | 1 |
Location
| Asia | 1 |
| Australia | 1 |
| Brazil | 1 |
| Connecticut | 1 |
| Denmark | 1 |
| Egypt | 1 |
| Estonia | 1 |
| Florida | 1 |
| Germany | 1 |
| Greece | 1 |
| Hawaii | 1 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
| NEO Personality Inventory | 1 |
What Works Clearinghouse Rating
Uebersax, John; Grove, Will – 1989
Methods of probability modeling to analyze rater agreement are described, emphasizing their basic similarities and viewing them as variants of a common methodology. Statistical techniques for analyzing agreement data are described to address questions such as how many opinions are required to make a medical diagnosis with necessary accuracy. Kappa…
Descriptors: Clinical Diagnosis, Correlation, Estimation (Mathematics), Evaluation Methods
Peer reviewedDeutsch, Stuart Jay; Malmborg, Charles J. – Evaluation and Program Planning, 1986
A questionnaire is designed to allow assessment of a simple additive value function for testing respondent preferences for different types of information used in evaluating police services. Responses are analyzed to determine what types of information different stakeholder groups consider useful. (Author/LMO)
Descriptors: Adults, Analysis of Variance, Data Collection, Evaluation Methods
Cason, Gerald J.; Cason, Carolyn L. – 1989
The use of three remedies for errors in the measurement of ability that arise from differences in rater stringency is discussed. Models contrasted are: (1) Conventional; (2) Handicap; and (3) deterministic Rater Response Theory (RRT). General model requirements, power, bias of measures, computing cost, and complexity are contrasted. Contrasts are…
Descriptors: Ability, Achievement Rating, Error of Measurement, Evaluation Methods
Peer reviewedMcCrae, Robert R. – Multivariate Behavioral Research, 1993
To assess cross-observer agreement on personality profiles, an Index of Profile Agreement and an associated coefficient are proposed that take into account both the difference between the ratings and the extremes of their mean. Data from the Revised NEO Personality Inventory for 250 peer ratings/self-reports and 68 spouse ratings/self-reports…
Descriptors: Adults, Comparative Analysis, Equations (Mathematics), Evaluation Methods
Peer reviewedGoffin, Richard D.; Jackson, Douglas N. – Multivariate Behavioral Research, 1992
The way in which trait and rater variance combine in multitrait-multirater (MTMR) performance appraisal data is explored. Implications of the confirmatory factor analytic model and the composite direct product (CDP) model for MTMR data are examined. Superior fit of the CDP model for four data sets is discussed. (SLD)
Descriptors: Equations (Mathematics), Evaluation Methods, Goodness of Fit, Interrater Reliability
Peer reviewedJaeger, Richard M. – Educational Measurement: Issues and Practice, 1991
Issues concerning the selection of judges for standard setting are discussed. Determining the consistency of judges' recommendations, or their congruity with other expert recommendations, would help in selection. Enough judges must be chosen to allow estimation of recommendations by an entire population of judges. (SLD)
Descriptors: Cutting Scores, Evaluation Methods, Evaluators, Examiners
Peer reviewedRaymond, Mark R.; Viswesvaran, Chockalingam – Journal of Educational Measurement, 1993
Three variations of a least squares regression model are presented that are suitable for determining and correcting for rating error in designs in which examinees are evaluated by a subset of possible raters. Models are applied to ratings from 4 administrations of a medical certification examination in which 40 raters and approximately 115…
Descriptors: Error of Measurement, Evaluation Methods, Higher Education, Interrater Reliability
Peer reviewedHouston, Walter M.; And Others – Applied Psychological Measurement, 1991
The effectiveness of alternative procedures to correct for rater leniency/stringency effects was studied when true scores were known. Ordinary least squares, weighted least squares, and imputation of the missing data consistently outperformed averaging the observed ratings; and the imputation technique was superior to the least squares methods.…
Descriptors: Comparative Analysis, Computer Simulation, Educational Assessment, Equations (Mathematics)
International Association for Development of the Information Society, 2012
The IADIS CELDA 2012 Conference intention was to address the main issues concerned with evolving learning processes and supporting pedagogies and applications in the digital age. There had been advances in both cognitive psychology and computing that have affected the educational arena. The convergence of these two disciplines is increasing at a…
Descriptors: Academic Achievement, Academic Persistence, Academic Support Services, Access to Computers


