NotesFAQContact Us
Collection
Advanced
Search Tips
Publication Date
In 20260
Since 20250
Since 2022 (last 5 years)0
Since 2017 (last 10 years)0
Since 2007 (last 20 years)12
Audience
Laws, Policies, & Programs
No Child Left Behind Act 20012
What Works Clearinghouse Rating
Showing 1 to 15 of 27 results Save | Export
Peer reviewed Peer reviewed
Direct linkDirect link
Baird, Jo-Anne; Gray, Lena – Oxford Review of Education, 2016
The ways in which examination standards are conceptualised and operationalised differently across nations has not been given sufficient attention. The international literature on standard-setting has been dominated by the psychometrics tradition. Broader conceptualisations of examination standards have been discussed in the literature in England,…
Descriptors: Foreign Countries, Academic Standards, Position Papers, Educational Policy
Peer reviewed Peer reviewed
Direct linkDirect link
Davis-Becker, Susan L.; Buckendahl, Chad W. – International Journal of Testing, 2013
A critical component of the standard setting process is collecting evidence to evaluate the recommended cut scores and their use for making decisions and classifying students based on test performance. Kane (1994, 2001) proposed a framework by which practitioners can identify and evaluate evidence of the results of the standard setting from (1)…
Descriptors: Standard Setting (Scoring), Evidence, Validity, Cutting Scores
Peer reviewed Peer reviewed
PDF on ERIC Download full text
Tingting, Xu; Hua, Ma; Xiujuan, Wang; Jing, Wang – Higher Education Studies, 2015
The traditional JAVA course examination is just a list of questions from which we cannot know students' skills of programming. According to the eight abilities in curriculum objectives, we designed an assessment standard of JAVA programming course that is based on employment orientation and apply it to practical teaching to check the teaching…
Descriptors: Programming Languages, Programming, Behavioral Objectives, Labor Needs
Peer reviewed Peer reviewed
Direct linkDirect link
Allen, Martin – FORUM: for promoting 3-19 comprehensive education, 2013
Well before the examinations grade crisis of 2012, Michael Gove had set out clear intentions for reforming public examinations. Though he claimed to be improving examinations and assessment by replicating practices that took place in high-performing countries and thus improving the ability of the UK economy to "compete", this…
Descriptors: Foreign Countries, Academic Standards, Educational Change, Standardized Tests
Peer reviewed Peer reviewed
Direct linkDirect link
Wyse, Adam E. – Educational and Psychological Measurement, 2011
Standard setting is a method used to set cut scores on large-scale assessments. One of the most popular standard setting methods is the Bookmark method. In the Bookmark method, panelists are asked to envision a response probability (RP) criterion and move through a booklet of ordered items based on a RP criterion. This study investigates whether…
Descriptors: Testing Programs, Standard Setting (Scoring), Cutting Scores, Probability
Peer reviewed Peer reviewed
Direct linkDirect link
Lissitz, Robert W.; Wei, Hua – Educational Measurement: Issues and Practice, 2008
In this article we address the issue of consistency in standard setting in the context of an augmented state testing program. Information gained from the external NRT scores is used to help make an informed decision on the determination of cut scores on the state test. The consistency of cut scores on the CRT across grades is maintained by forcing…
Descriptors: Testing Programs, State Programs, Standard Setting, Reliability
Peer reviewed Peer reviewed
Direct linkDirect link
Davis, Susan L.; Buckendahl, Chad W.; Plake, Barbara S. – Journal of Educational Measurement, 2008
As an alternative to adaptation, tests may also be developed simultaneously in multiple languages. Although the items on such tests could vary substantially, scores from these tests may be used to make the same types of decisions about different groups of examinees. The ability to make such decisions is contingent upon setting performance…
Descriptors: Test Results, Testing Programs, Multilingualism, Standard Setting
Peer reviewed Peer reviewed
Direct linkDirect link
Alderson, J. Charles – Language Assessment Quarterly, 2011
The International Civil Aviation Association has developed a set of Language Proficiency Requirements (LPRs) and a Language Proficiency Rating Scale, which seeks to define proficiency in the language needed for aviation purposes at six different levels. Pilots, air traffic controllers and aeronautical station operators are required to achieve at…
Descriptors: Business Communication, Rating Scales, Language Proficiency, Educational Policy
Peer reviewed Peer reviewed
Direct linkDirect link
Lee, Guemin; Lewis, Daniel M. – Educational and Psychological Measurement, 2008
The bookmark standard-setting procedure is an item response theory-based method that is widely implemented in state testing programs. This study estimates standard errors for cut scores resulting from bookmark standard settings under a generalizability theory model and investigates the effects of different universes of generalization and error…
Descriptors: Generalizability Theory, Testing Programs, Error of Measurement, Cutting Scores
Peer reviewed Peer reviewed
Direct linkDirect link
O'Neill, Thomas R.; Buckendahl, Chad W.; Plake, Barbara S.; Taylor, Lynda – Language Assessment Quarterly, 2007
Licensure testing programs in the United States (e.g., nursing) face an increasing challenge of measuring the competency of internationally trained candidates, both in relation to their clinical competence and their English language competence. To assist with the latter, professional licensing bodies often adopt well-established and widely…
Descriptors: Testing Programs, Testing, Language Tests, Standard Setting
Peer reviewed Peer reviewed
Direct linkDirect link
Ferrara, Steve; Perie, Marianne; Johnson, Eugene – Journal of Applied Testing Technology, 2008
Psychometricians continue to introduce new approaches to setting cut scores for educational assessments in an attempt to improve on current methods. In this paper we describe the Item-Descriptor (ID) Matching method, a method based on IRT item mapping. In ID Matching, test content area experts match items (i.e., their judgments about the knowledge…
Descriptors: Test Results, Test Content, Testing Programs, Educational Testing
Peer reviewed Peer reviewed
Direct linkDirect link
Childs, Ruth A.; Jaciw, Andrew P.; Saunders, Kelsey – International Journal of Testing, 2007
Many approaches to standard-setting use item calibration and student score estimation results to structure panelists' tasks. However, this requires collecting standard-setting judgments after the item analysis results are available. The Scoring Guide Alignment approach collects standard-setting judgments during the scoring sessions from teachers…
Descriptors: Testing Programs, Scoring, Item Analysis, Test Items
Peer reviewed Peer reviewed
Green, Donald Ross; Trimble, C. Scott; Lewis, Daniel M. – Educational Measurement: Issues and Practice, 2003
Describes the procedures by which Kentucky's state assessment program synthesized results from three standard setting procedures (Contrasting Groups, Bookmark, and Jaeger-Mills) for the 2000 state assessment. Shows the value of using multiple standard-setting approaches to gather information from each. (SLD)
Descriptors: Achievement Tests, Standard Setting, State Programs, Synthesis
Peer reviewed Peer reviewed
Sireci, Stephen G.; Robin, Frederic; Patelis, Thanos – Applied Measurement in Education, 1999
Presents a procedure for standard setting that involves the cluster analysis of test takers to discover examinee groups that are useful for envisioning marginally competent performance or defining borderline or contrasting groups. Illustrates use of the procedure with a statewide mathematics test, and concludes that cluster analysis is useful in…
Descriptors: Cluster Analysis, Mathematics Tests, Standard Setting (Scoring), Standards
Peer reviewed Peer reviewed
Engelhard, George, Jr.; Anderson, David W. – Applied Measurement in Education, 1998
A new approach for examining the quality of judgments from standard-setting judges using a Binomial Trials Model (BTM) is presented and illustrated with 26 judges from the Georgia High School Graduation Test. Results suggest that the BTM provides information not available from other methods. (SLD)
Descriptors: Graduation Requirements, High Schools, Judges, Standard Setting (Scoring)
Previous Page | Next Page ยป
Pages: 1  |  2