NotesFAQContact Us
Collection
Advanced
Search Tips
Audience
Laws, Policies, & Programs
What Works Clearinghouse Rating
Showing 1 to 15 of 35 results Save | Export
Peer reviewed Peer reviewed
Direct linkDirect link
Leventhal, Brian C.; Grabovsky, Irina – Educational Measurement: Issues and Practice, 2020
Standard setting is arguably one of the most subjective techniques in test development and psychometrics. The decisions when scores are compared to standards, however, are arguably the most consequential outcomes of testing. Providing licensure to practice in a profession has high stake consequences for the public. Denying graduation or forcing…
Descriptors: Standard Setting (Scoring), Weighted Scores, Test Construction, Psychometrics
Peer reviewed Peer reviewed
PDF on ERIC Download full text
Cuhadar, Ismail; Gelbal, Selahattin – International Journal of Assessment Tools in Education, 2021
The institutions in education use various assessment methods to decide on the proficiency levels of students in a particular construct. This study investigated whether the decisions differed based on the type of assessment: norm-and criterion-referenced assessment. An achievement test with 20 multiple-choice items was administered to 107 students…
Descriptors: Norm Referenced Tests, Criterion Referenced Tests, Decision Making, Achievement Tests
Peer reviewed Peer reviewed
PDF on ERIC Download full text
Sivakorn Tangsakul; Kornwipa Poonpon – rEFLections, 2024
Given the significant global influence of the Common European Framework of Reference for Languages: Teaching, Learning, and Assessment (CEFR) on English language education, this study deals with aligning a university's academic reading tests to the CEFR. It aimed at validating the test construct of the academic reading tests in relation to the…
Descriptors: Alignment (Education), Reading Tests, Second Language Learning, Language Proficiency
Peer reviewed Peer reviewed
Direct linkDirect link
Liu, Ren; Qian, Hong; Luo, Xiao; Woo, Ada – Educational and Psychological Measurement, 2018
Subscore reporting under item response theory models has always been a challenge partly because the test length of each subdomain is limited for precisely locating individuals on multiple continua. Diagnostic classification models (DCMs), providing a pass/fail decision and associated probability of pass on each subdomain, are promising…
Descriptors: Classification, Probability, Pass Fail Grading, Scores
Peer reviewed Peer reviewed
Direct linkDirect link
Min, Shangchao; He, Lianzhen – Language Testing, 2022
In this study, we present the development of individualized feedback for a large-scale listening assessment by combining standard setting and cognitive diagnostic assessment (CDA) approaches. We used the performance data from 3,358 students' item-level responses to a field test of a national EFL test primarily intended for tertiary-level EFL…
Descriptors: Feedback (Response), Second Language Learning, Second Language Instruction, English (Second Language)
Peer reviewed Peer reviewed
PDF on ERIC Download full text
Ozarkan, Hatun Betul; Dogan, Celal Deha – Eurasian Journal of Educational Research, 2020
Purpose: This study aimed to compare the cut scores obtained by the Extended Angoff and Contrasting Groups methods for an achievement test consisting of constructed-response items. Research Methods: This study was based on survey research design. In the collection of data, the study group of the research consisted of eight mathematics teachers for…
Descriptors: Standard Setting (Scoring), Responses, Test Items, Cutting Scores
Peer reviewed Peer reviewed
Direct linkDirect link
Clauser, Jerome C.; Hambleton, Ronald K.; Baldwin, Peter – Educational and Psychological Measurement, 2017
The Angoff standard setting method relies on content experts to review exam items and make judgments about the performance of the minimally proficient examinee. Unfortunately, at times content experts may have gaps in their understanding of specific exam content. These gaps are particularly likely to occur when the content domain is broad and/or…
Descriptors: Scores, Item Analysis, Classification, Decision Making
Peer reviewed Peer reviewed
PDF on ERIC Download full text
Papageorgiou, Spiros; Wu, Sha; Hsieh, Ching-Ni; Tannenbaum, Richard J.; Cheng, Mengmeng – ETS Research Report Series, 2019
The past decade has seen an emerging interest in mapping (aligning or linking) test scores to language proficiency levels of external performance scales or frameworks, such as the Common European Framework of Reference (CEFR), as well as locally developed frameworks, such as China's Standards of English Language Ability (CSE). Such alignment is…
Descriptors: English (Second Language), Language Tests, Second Language Learning, Computer Assisted Testing
Peer reviewed Peer reviewed
Direct linkDirect link
Eckes, Thomas – Language Testing, 2017
This paper presents an approach to standard setting that combines the prototype group method (PGM; Eckes, 2012) with a receiver operating characteristic (ROC) analysis. The combined PGM-ROC approach is applied to setting cut scores on a placement test of English as a foreign language (EFL). To implement the PGM, experts first named learners whom…
Descriptors: English (Second Language), Language Tests, Cutting Scores, Standard Setting (Scoring)
Peer reviewed Peer reviewed
Direct linkDirect link
Pill, John; McNamara, Tim – Language Testing, 2016
This paper considers how to establish the minimum required level of professionally relevant oral communication ability in the medium of English for health practitioners with English as an additional language (EAL) to gain admission to practice in jurisdictions where English is the dominant language. A theoretical concern is the construct of…
Descriptors: Specialists, Standard Setting, Language Tests, English (Second Language)
Peer reviewed Peer reviewed
Direct linkDirect link
Hoepner, Andreas G. F.; Unerman, Jeffrey – Accounting Education, 2012
This paper addresses issues raised in two recent papers published in this journal about the UK "Association of Business Schools' Journal Quality Guide (ABS Guide)". While much of the debate about journal rankings in general, and the "ABS Guide" in particular, has focused on the construction, power and (mis)use of these…
Descriptors: Bias, Classification, Quality Assurance, Periodicals
Peer reviewed Peer reviewed
Direct linkDirect link
Hirner, Leo; Kochtanek, Thomas – Community College Journal of Research and Practice, 2012
The continued growth of online programs in higher education has resulted in concerns about how institutions monitor the quality of their online programs. These concerns indicate a need for a process by which online programs may be evaluated and compared. They provided the impetus for this study, the goals of which were to identify quality…
Descriptors: Delphi Technique, Distance Education, Online Courses, Educational Indicators
Peer reviewed Peer reviewed
Direct linkDirect link
Swail, Watson Scott – College and University, 2011
College rankings create much talk and discussion in the higher education arena. This love/hate relationship has not necessarily resulted in better rankings, but rather, more rankings. This paper looks at some of the measures and pitfalls of the current rankings systems, and proposes areas for improvement through a better focus on teaching and…
Descriptors: Higher Education, Measurement Objectives, Measurement Techniques, Classification
Peer reviewed Peer reviewed
Direct linkDirect link
Pill, John; Harding, Luke – Language Testing, 2013
This study identifies a unique context for exploring lay understandings of language testing and, by extension, for characterizing the nature of language assessment literacy among non-practitioners, stemming from data in an inquiry into the registration processes and support for overseas trained doctors by the Australian House of Representatives…
Descriptors: Language Tests, Testing, Foreign Nationals, Foreign Medical Graduates
Peer reviewed Peer reviewed
Direct linkDirect link
Plake, Barbara S.; Huff, Kristen; Reshetar, Rosemary – Applied Measurement in Education, 2010
In many large-scale assessment programs, achievement level descriptors (ALDs) provide a critical role in communicating what scores on the assessment mean and in interpreting what examinees know and are able to do based on their test performance. Based on their test performance, examinees are often classified into performance categories. The…
Descriptors: Evidence, Test Construction, Measurement, Standard Setting
Previous Page | Next Page »
Pages: 1  |  2  |  3