NotesFAQContact Us
Collection
Advanced
Search Tips
Publication Type
Reports - Research15
Journal Articles10
Speeches/Meeting Papers4
Opinion Papers1
Audience
Laws, Policies, & Programs
Assessments and Surveys
Alabama High School…1
Praxis Series1
What Works Clearinghouse Rating
Showing all 15 results Save | Export
Peer reviewed Peer reviewed
Direct linkDirect link
Daniel McNeish; Melissa G. Wolf – Structural Equation Modeling: A Multidisciplinary Journal, 2024
Despite the popularity of traditional fit index cutoffs like RMSEA [less than or equal to] 0.06 and CFI [greater than or equal to] 0.95, several studies have noted issues with overgeneralizing traditional cutoffs. Computational methods have been proposed to avoid overgeneralization by deriving cutoffs specifically tailored to the characteristics…
Descriptors: Structural Equation Models, Cutting Scores, Generalizability Theory, Error of Measurement
Peer reviewed Peer reviewed
Direct linkDirect link
Nicolas Rochat; Laurent Lima; Pascal Bressoux – Journal of Psychoeducational Assessment, 2025
Inference is considered an important factor in comprehension models and has been described as a causal factor in predicting comprehension. To date, specific tests for inference are rare and often rely on specific thematic texts. This reliance on thematic inference may raise some concerns as inference is related to prior text-specific knowledge.…
Descriptors: Inferences, Reading Comprehension, Reading Tests, Test Reliability
Peer reviewed Peer reviewed
Direct linkDirect link
Maria Treadaway; John Read – Language Testing, 2024
Standard-setting is an essential component of test development, supporting the meaningfulness and appropriate interpretation of test scores. However, in the high-stakes testing environment of aviation, standard-setting studies are underexplored. To address this gap, we document two stages in the standard-setting procedures for the Overseas Flight…
Descriptors: Standard Setting, Diagnostic Tests, High Stakes Tests, English for Special Purposes
Peer reviewed Peer reviewed
PDF on ERIC Download full text
Sridhanyarat, Kietnawin; Pathong, Supakarn; Suranakkharin, Todsapon; Ammaralikit, Amornrat – English Language Teaching, 2021
This study aimed at developing the Silpakorn Test of English Proficiency (STEP), in alignment with the Common European Framework of Reference for Languages (CEFR), and in accordance with the theoretical framework established by Alderson et al. (2006). Four major steps were involved in the test construction. First, English language lecturers who…
Descriptors: Language Tests, Language Proficiency, Second Language Learning, Second Language Instruction
Peer reviewed Peer reviewed
PDF on ERIC Download full text
Akcamete, Gonul; Kayhan, Nilay; Yildirim, A. Emel Sardohan – Cypriot Journal of Educational Sciences, 2017
Professional ethics includes the principles set forth by professional associations and accepted as correct by discussions over time, and which has become the sine qua non of a profession today. Professional ethics are established to increase the quality of professional practices and ensure correct and honest conduct. Not having professional…
Descriptors: Ethics, Special Education, Special Education Teachers, Professional Associations
Peer reviewed Peer reviewed
PDF on ERIC Download full text
Bichi, Ado Abdu; Hafiz, Hadiza; Bello, Samira Abdullahi – International Journal of Evaluation and Research in Education, 2016
High-stakes testing is used for the purposes of providing results that have important consequences. Validity is the cornerstone upon which all measurement systems are built. This study applied the Item Response Theory principles to analyse Northwest University Kano Post-UTME Economics test items. The developed fifty (50) economics test items was…
Descriptors: Item Response Theory, Test Items, Difficulty Level, Statistical Analysis
Kane, Michael; Wilson, Jennifer – 1982
This paper evaluates the magnitude of the total error in estimates of the difference between an examinee's domain score and the cutoff score. An observed score based on a random sample of items from the domain, and an estimated cutoff score derived from a judgmental standard setting procedure are assumed. The work of Brennan and Lockwood (1980) is…
Descriptors: Criterion Referenced Tests, Cutting Scores, Error of Measurement, Mastery Tests
Peer reviewed Peer reviewed
Gross, Leon J. – Journal of Educational Measurement, 1982
Addressing Glass' argument (EJ 198 842) that a lack of interrelater reliability is an inherent deficiency in the Nedelsky technique, poor rater training and the need for a group decision procedure are presented as standard setting problems. (CM)
Descriptors: Academic Standards, Criterion Referenced Tests, Cutting Scores, Evaluation Criteria
Peer reviewed Peer reviewed
Grosse, Martin E.; Wright, Benjamin D. – Evaluation and the Health Professions, 1986
Based on the standard setting procedures or the American Board of Preventive Medicine for their Core Test, this article describes how Rasch measurement can facilitate using test content judgments in setting a standard. Rasch measurement can then be used to evaluate and improve the precision of the standard and to hold it constant across time.…
Descriptors: Certification, Criterion Referenced Tests, Difficulty Level, Health Personnel
Livingston, Samuel A.; Sims-Gunzenhauser, Alice – 1995
A study was conducted to provide information for setting two separate standards, the accuracy score and the documentation score, for the Praxis III: Classroom Performance Assessment (Praxis III). Praxis III is intended for making instructional and licensing decisions about beginning teachers. This standard-setting study was a person-judgment…
Descriptors: Beginning Teachers, Classroom Observation Techniques, Documentation, Elementary Secondary Education
van der Linden, Wim J. – 1982
A latent trait method is presented to investigate the possibility that Angoff or Nedelsky judges specify inconsistent probabilities in standard setting techniques for objectives-based instructional programs. It is suggested that judges frequently specify a low probability of success for an easy item but a large probability for a hard item. The…
Descriptors: Criterion Referenced Tests, Cutting Scores, Error of Measurement, Interrater Reliability
Peer reviewed Peer reviewed
Lang, Harry G. – Journal of Research in Science Teaching, 1982
Reliability, validity, and standards-setting procedure for a criterion-referenced test (Test of Metric Skills) were examined for use in science curricula. Results indicate a number of factors influencing test reliability/validity and that science teachers need to be aware of these factors to enhance accuracy of their judgments. (Author/JN)
Descriptors: College Science, Criterion Referenced Tests, Higher Education, Science Education
Fitzpatrick, Steven J.; And Others – 1994
In 1991 the Measurement and Evaluation Center of the University of Texas at Austin was asked to develop a test for credit by examination in four lower division courses in Japanese. The test (in Japanese) was constructed from locally developed items provided by instructors of Japanese. The developed test consisted of 80 items distributed among…
Descriptors: College Students, Cutting Scores, Equivalency Tests, Higher Education
Halpin, Glennelle; McLean, James E. – 1991
Although the standard-setting method of W. H. Angoff (1971) has broad-based support in the research literature, inconsistencies in the resulting standards do occur. Sources of these inconsistencies are examined in a study of judges, competencies (items), rounds (replications), and the interactions among them. A modified Angoff approach was used to…
Descriptors: Analysis of Variance, Error of Measurement, Evaluators, High Schools
Peer reviewed Peer reviewed
Direct linkDirect link
Polat, Filiz – American Annals of the Deaf, 2006
The article present results of standardization of the Meadow-Kendall Social-Emotional Assessment Inventory for Deaf and Hearing-Impaired Students (Meadow, 1983), school-age version, for use in Turkey. The SEAI is a 59-item measure for assessing socioemotional adjustment of school-age deaf and hearing impaired students. A sample of 1,097 deaf…
Descriptors: Turkish, Deafness, Foreign Countries, Emotional Adjustment