Publication Date
| In 2026 | 0 |
| Since 2025 | 0 |
| Since 2022 (last 5 years) | 0 |
| Since 2017 (last 10 years) | 0 |
| Since 2007 (last 20 years) | 4 |
Descriptor
| Reliability | 11 |
| Scoring Formulas | 11 |
| Comparative Analysis | 4 |
| Validity | 4 |
| Correlation | 3 |
| Evaluation Methods | 3 |
| Multiple Choice Tests | 3 |
| Statistical Analysis | 3 |
| True Scores | 3 |
| Achievement Tests | 2 |
| Error of Measurement | 2 |
| More ▼ | |
Source
Author
| Bakker, J. | 1 |
| Beek, F. J. A. | 1 |
| Cetin, Bayram | 1 |
| Chung, Jing-Mei | 1 |
| Cross, Lawrence H. | 1 |
| Daniel, Cathy | 1 |
| Dellinger, Amy | 1 |
| Denny, R. Kenton | 1 |
| Guler, Nese | 1 |
| Haaring, C. | 1 |
| Haberman, Shelby J. | 1 |
| More ▼ | |
Publication Type
| Reports - Research | 11 |
| Journal Articles | 9 |
Education Level
| Secondary Education | 3 |
| Elementary Education | 2 |
| Grade 7 | 2 |
| Junior High Schools | 2 |
| Middle Schools | 2 |
| High Schools | 1 |
| Higher Education | 1 |
| Postsecondary Education | 1 |
Audience
Location
| Turkey | 1 |
| United Kingdom (Scotland) | 1 |
Laws, Policies, & Programs
Assessments and Surveys
| Goodenough Harris Drawing Test | 1 |
What Works Clearinghouse Rating
Cetin, Bayram; Guler, Nese; Sarica, Rabia – Eurasian Journal of Educational Research, 2016
Problem Statement: In addition to being teaching tools, concept maps can be used as effective assessment tools. The use of concept maps for assessment has raised the issue of scoring them. Concept maps generated and used in different ways can be scored via various methods. Holistic and relational scoring methods are two of them. Purpose of the…
Descriptors: Generalizability Theory, Concept Mapping, Scoring, Scoring Formulas
Ravesloot, C. J.; Van der Schaaf, M. F.; Muijtjens, A. M. M.; Haaring, C.; Kruitwagen, C. L. J. J.; Beek, F. J. A.; Bakker, J.; Van Schaik, J.P.J.; Ten Cate, Th. J. – Advances in Health Sciences Education, 2015
Formula scoring (FS) is the use of a don't know option (DKO) with subtraction of points for wrong answers. Its effect on construct validity and reliability of progress test scores, is subject of discussion. Choosing a DKO may not only be affected by knowledge level, but also by risk taking tendency, and may thus introduce construct-irrelevant…
Descriptors: Scoring Formulas, Tests, Scores, Construct Validity
Plucker, Jonathan A.; Qian, Meihua; Schmalensee, Stephanie L. – Creativity Research Journal, 2014
In recent years, the social sciences have seen a resurgence in the study of divergent thinking (DT) measures. However, many of these recent advances have focused on abstract, decontextualized DT tasks (e.g., list as many things as you can think of that have wheels). This study provides a new perspective by exploring the reliability and validity…
Descriptors: Creative Thinking, Creativity Tests, Scoring Formulas, Evaluation Methods
Haberman, Shelby J. – ETS Research Report Series, 2008
In educational testing, subscores may be provided based on a portion of the items from a larger test. One consideration in evaluation of such subscores is their ability to predict a criterion score. Two limitations on prediction exist. The first, which is well known, is that the coefficient of determination for linear prediction of the criterion…
Descriptors: Scores, Validity, Educational Testing, Correlation
Peer reviewedNaglieri, Jack A.; Maxwell, Susanna – Perceptual and Motor Skills, 1981
Inter-rater reliability of the Goodenough-Harris and McCarthy Draw-A-Child scoring systems was examined for a sample of 60 children, including 20 school-labeled learning disabled, 20 mentally retarded, and 20 normal children between the ages of six and eight-and-one-half years. (Author)
Descriptors: Correlation, Intelligence Tests, Learning Disabilities, Mental Retardation
Peer reviewedCross, Lawrence H.; And Others – Journal of Experimental Education, 1980
Use of choice-weighted scores as a basis for assigning grades in college courses was investigated. Reliability and validity indices offer little to recommend either type of choice-weighted scoring over number-right scoring. The potential for choice-weighted scoring to enhance the teaching/testing process is discussed. (Author/GK)
Descriptors: Credit Courses, Grading, Higher Education, Multiple Choice Tests
Peer reviewedSpencer, Ernest – Scottish Educational Review, 1981
Using data from the SCRE Criterion Test composition papers, the author tests the hypothesis that the bulk of inter-marker unreliability is caused by inter-marker inconsistency--which is not correctable statistically. He suggests that a shift to "consensus" standards will realize greater improvements than statistical standardizing alone.…
Descriptors: Achievement Tests, English Instruction, Essay Tests, Reliability
Peer reviewedKleven, Thor Arnfinn – Scandinavian Journal of Educational Research, 1979
Supposing different values of the standard measurement error, the relation of scale coarseness to the total amount of error is studied on the basis of probability distribution of error. The analyses are performed within two models of error and with two criteria of amount of error. (Editor/SJL)
Descriptors: Cutting Scores, Error of Measurement, Goodness of Fit, Grading
Tollefson, Nona; Chung, Jing-Mei – 1986
Procedures for correcting for guessing and for assessing partial knowledge (correction-for-guessing, three-decision scoring, elimination/inclusion scoring, and confidence or probabilistic scoring) are discussed. Mean scores and internal consistency reliability estimates were compared across three administration and scoring procedures for…
Descriptors: Achievement Tests, Comparative Analysis, Evaluation Methods, Graduate Students
Peer reviewedStuhlmann, Janice; Daniel, Cathy; Dellinger, Amy; Denny, R. Kenton; Powers, Taylor – Reading Psychology, 1999
Investigates whether training raters to interpret the scoring dimensions on a rubric would increase reliability. Compares two groups of kindergarten and first-grade teachers: one group with training, one without. Finds that training increases raters' abilities to reliably interpret scoring items. (SC)
Descriptors: Childrens Writing, Comparative Analysis, Generalizability Theory, Grade 1
PDF pending restorationKane, Michael T.; Moloney, James M. – 1976
The Answer-Until-Correct (AUC) procedure has been proposed in order to increase the reliability of multiple-choice items. A model for examinees' behavior when they must respond to each item until they answer it correctly is presented. An expression for the reliability of AUC items, as a function of the characteristics of the item and the scoring…
Descriptors: Guessing (Tests), Item Analysis, Mathematical Models, Multiple Choice Tests

Direct link
