Publication Date
| In 2026 | 0 |
| Since 2025 | 1 |
| Since 2022 (last 5 years) | 1 |
| Since 2017 (last 10 years) | 2 |
| Since 2007 (last 20 years) | 6 |
Descriptor
| Correlation | 7 |
| Test Items | 7 |
| Standard Setting (Scoring) | 5 |
| Cutting Scores | 4 |
| Difficulty Level | 4 |
| Accuracy | 2 |
| College Students | 2 |
| Error of Measurement | 2 |
| Foreign Countries | 2 |
| Interrater Reliability | 2 |
| Item Response Theory | 2 |
| More ▼ | |
Source
| Assessment & Evaluation in… | 1 |
| Educational Sciences: Theory… | 1 |
| International Journal of… | 1 |
| Online Submission | 1 |
| ProQuest LLC | 1 |
| Research Matters | 1 |
Author
| Bayram Çetin | 1 |
| Bramley, Tom | 1 |
| Cope, Ronald T. | 1 |
| Darling, Jonathan | 1 |
| Gelbal, Selahattin | 1 |
| Hagen, Michael D. | 1 |
| Homer, Matt | 1 |
| Kroopnick, Marc Howard | 1 |
| O'Neill, Thomas R. | 1 |
| Peabody, Michael R. | 1 |
| Pell, Godfrey | 1 |
| More ▼ | |
Publication Type
| Reports - Research | 6 |
| Journal Articles | 4 |
| Dissertations/Theses -… | 1 |
| Speeches/Meeting Papers | 1 |
Education Level
| Higher Education | 3 |
| Postsecondary Education | 3 |
Audience
| Researchers | 1 |
Location
| Turkey | 1 |
| United Kingdom | 1 |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
Rümeysa Kaya; Bayram Çetin – International Journal of Assessment Tools in Education, 2025
In this study, the cut-off scores obtained from the Angoff, Angoff Y/N, Nedelsky and Ebel standard methods were compared with the 50 T score and the current cut-off score in various aspects. Data were collected from 448 students who took Module B1+ English Exit Exam IV and 14 experts. It was seen that while the Nedelsky method gave the lowest…
Descriptors: Standard Setting, Cutting Scores, Exit Examinations, Academic Achievement
Bramley, Tom – Research Matters, 2020
The aim of this study was to compare, by simulation, the accuracy of mapping a cut-score from one test to another by expert judgement (using the Angoff method) versus the accuracy with a small-sample equating method (chained linear equating). As expected, the standard-setting method resulted in more accurate equating when we assumed a higher level…
Descriptors: Cutting Scores, Standard Setting (Scoring), Equated Scores, Accuracy
O'Neill, Thomas R.; Peabody, Michael R.; Stelter, Keith L.; Hagen, Michael D. – Online Submission, 2015
(Purpose) The purpose of our study was to assess the need for an external searchable resource to be used in conjunction with the American Board of Family Medicine's (ABFM) Maintenance of Certification for Family Physicians (MC-FP) Examination, discuss the philosophical question of whether an ESR should be allowed on the examination, and outline…
Descriptors: Licensing Examinations (Professions), Family Practice (Medicine), Physicians, Online Searching
Çetin, Sevda; Gelbal, Selahattin – Educational Sciences: Theory and Practice, 2013
In this research, the cut score of a foundation university was re-calculated with bookmark method and with Angoff method, each of which is a standard setting method; and the cut scores found were compared with the current proficiency score. Thus, the final cut score was found to be 27.87 with the cooperative work of 17 experts through the Angoff…
Descriptors: Standard Setting (Scoring), Comparative Analysis, Cutting Scores, Correlation
Homer, Matt; Darling, Jonathan; Pell, Godfrey – Assessment & Evaluation in Higher Education, 2012
Over recent years, UK medical schools have moved to more integrated summative examinations. This paper analyses data from the written assessment of undergraduate medical students to investigate two key psychometric aspects of this type of high-stakes assessment. Firstly, the strength of the relationship between examiner predictions of item…
Descriptors: Foreign Countries, Medical Schools, Summative Evaluation, High Stakes Tests
Kroopnick, Marc Howard – ProQuest LLC, 2010
When Item Response Theory (IRT) is operationally applied for large scale assessments, unidimensionality is typically assumed. This assumption requires that the test measures a single latent trait. Furthermore, when tests are vertically scaled using IRT, the assumption of unidimensionality would require that the battery of tests across grades…
Descriptors: Simulation, Scaling, Standard Setting, Item Response Theory
Cope, Ronald T. – 1987
This study used generalizability theory and other statistical concepts to assess the application of the Angoff method to setting cutoff scores on two professional certification tests. A panel of ten judges gave pre- and post-feedback Angoff probability ratings of items of two forms of a professional certification test, and another panel of nine…
Descriptors: Certification, Correlation, Cutting Scores, Error of Measurement

Peer reviewed
Direct link
