Publication Date
| In 2026 | 0 |
| Since 2025 | 0 |
| Since 2022 (last 5 years) | 1 |
| Since 2017 (last 10 years) | 3 |
| Since 2007 (last 20 years) | 4 |
Descriptor
Source
| Evaluation and the Health… | 2 |
| Applied Measurement in… | 1 |
| Civil Rights Project /… | 1 |
| ETS Research Report Series | 1 |
| Journal of Educational and… | 1 |
| Nebraska Department of… | 1 |
| Studies in Educational… | 1 |
Author
Publication Type
Education Level
| Elementary Secondary Education | 2 |
| Early Childhood Education | 1 |
| Kindergarten | 1 |
Audience
| Researchers | 2 |
| Policymakers | 1 |
Location
| Arizona | 1 |
| China | 1 |
| Nebraska | 1 |
| North Carolina | 1 |
Laws, Policies, & Programs
| Lau v Nichols | 1 |
| No Child Left Behind Act 2001 | 1 |
Assessments and Surveys
| National Teacher Examinations | 2 |
| Test of English as a Foreign… | 2 |
| Armed Services Vocational… | 1 |
| National Assessment of… | 1 |
What Works Clearinghouse Rating
Derek C. Briggs – Journal of Educational and Behavioral Statistics, 2024
I consider recent attempts to establish standards, principles, and goals for artificial intelligence (AI) through the lens of educational measurement. Distinctions are made between generative AI and AI-adjacent methods and applications of AI in formative versus summative assessment contexts. While expressing optimism about its possibilities, I…
Descriptors: Artificial Intelligence, Standard Setting, Standards, Measurement
Papageorgiou, Spiros; Wu, Sha; Hsieh, Ching-Ni; Tannenbaum, Richard J.; Cheng, Mengmeng – ETS Research Report Series, 2019
The past decade has seen an emerging interest in mapping (aligning or linking) test scores to language proficiency levels of external performance scales or frameworks, such as the Common European Framework of Reference (CEFR), as well as locally developed frameworks, such as China's Standards of English Language Ability (CSE). Such alignment is…
Descriptors: English (Second Language), Language Tests, Second Language Learning, Computer Assisted Testing
Nebraska Department of Education, 2018
The 2018 Nebraska Student-Centered Assessment System (NSCAS) Summative technical report documents the processes and procedures implemented to support the Spring 2018 NSCAS Summative English Language Arts (ELA), Mathematics, and Science assessments by NWEA under the supervision of the Nebraska Department of Education (NDE). The technical report…
Descriptors: Summative Evaluation, Language Tests, English, Mathematics Tests
Florez, Ida Rose – Civil Rights Project / Proyecto Derechos Civiles, 2010
The Arizona English Language Learners Assessment (AZELLA) is used by the Arizona Department of Education to determine which children should receive English support services. AZELLA results are used to determine if children are either proficient in English or have English language skills in one of four pre-proficient categories (pre-emergent,…
Descriptors: Validity, Second Language Learning, Cutting Scores, Kindergarten
Phillips, Gary W., Ed. – 1996
Recently, there has been a significant expansion in the use of performance assessment in large scale testing programs. Although there has been significant support from curriculum and policy stakeholders, the technical feasibility of large scale performance assessments has remained a question. This report is intended to contribute to the debate by…
Descriptors: Comparative Analysis, Generalizability Theory, Performance Based Assessment, Psychometrics
Peer reviewedHamilton, J. S.; McLone, R. R. – Studies in Educational Evaluation, 1989
Influences on the educational validity of examinations are reviewed. Changes occurring in approaches to standard setting are traced. A view of reliability is presented, with emphasis on assessment of project work, which often involves individual investigation and design by students. A consistency index formula for grading standards is presented.…
Descriptors: Cutting Scores, Educational Assessment, Elementary Secondary Education, Standard Setting (Scoring)
Jaeger, Richard M. – 1982
The implicit definition of competence and the inferential chain that links the standard-setting process to the decision outcomes of the method are considered for two classes of standard-setting procedures: those involving data-free judgments of items and those involving data-based judgment of items. The major underlying assumptions of competence…
Descriptors: Competence, Evaluation Methods, Graduation Requirements, High Schools
Peer reviewedMeskauskas, John A. – Evaluation and the Health Professions, 1986
Two new indices of stability of content-referenced standard-setting results are presented, relating variability of judges' decisions to the variability of candidate scores and to the reliability of the test. These indices are used to indicate whether scores resulting from a standard-setting study are of sufficient precision. (Author/LMO)
Descriptors: Certification, Credentials, Error of Measurement, Generalizability Theory
Peer reviewedHambleton, Ronald K.; Slater, Sharon C. – Applied Measurement in Education, 1997
A brief history of developments in the assessment of the reliability of credentialing examinations is presented, and some new results are outlined that highlight the interactions among scoring, standard setting, and the reliability and validity of pass-fail decisions. Decision consistency is an important concept in evaluating credentialing…
Descriptors: Certification, Credentials, Decision Making, Interaction
Busch, John Christian; Jaeger, Richard M. – 1986
This study examined the contribution of five categories of variables to the standard-setting recommendations of a representative sample of 241 judges recommending standards for the General Knowledge and Communication Skills Tests of the National Teacher Examinations (NTE) for the state of North Carolina. It was based on the assumption that…
Descriptors: Achievement, Adults, Cognitive Dissonance, Educational Attitudes
PDF pending restorationPowers, Donald E.; Stansfield, Charles W. – 1983
The increasing interest in oral proficiency during the past decade prompted Educational Testing Service (ETS) to undertake the development of the Test of Spoken English (TSE), a standardized test of speaking proficiency of non-native speakers of English. The test has been validated for the selection of non-native teaching assistants applying to…
Descriptors: Certification, Communicative Competence (Languages), Cutting Scores, English (Second Language)
Peer reviewedHambleton, Ronald K; Rogers, H. Jane – Evaluation and the Health Professions, 1986
Technical advances of the last 15 years in measurement theory and practice are described, notably criterion-referenced testing, item response theory, and computers and testing. Several remaining problems concerning the development and validation of credentialing examinations are also considered. (Author/LMO)
Descriptors: Certification, Computer Assisted Testing, Credentials, Criterion Referenced Tests
Commission on Civil Rights, Washington, DC. – 1993
Because of concerns about the validity of tests used in education and employment, a consultation on June 16, 1989, focused on tests of ability, achievement, and other skills. Invited experts were asked to address a set of issues common to both education and employment testing, primarily related to test construction procedures and how to establish…
Descriptors: Achievement Tests, Aptitude Tests, Civil Rights, Educational Testing
Wigdor, Alexandra K., Ed.; Green, Bert F., Jr., Ed. – 1991
This is the sixth and final report of the National Research Council's Committee on the Performance of Military Personnel on the Joint-Service Job Performance Measurement/Enlistment Standards (JPM) Project, a project designed to develop measures of performance for entry-level military jobs so that enlistment standards could be linked to performance…
Descriptors: Aptitude Tests, Armed Forces, Cost Effectiveness, Educational Assessment
Bowman, Harry L. – 1989
Comparative data and results of validation and standard-setting studies are presented for five subject-matter tests for teacher certification in two states. The five tests evaluated were available from the Educational Testing Service (ETS): one National Teacher Examination specialty area test (special education); and four tests from a consortium…
Descriptors: College Faculty, Comparative Analysis, Content Analysis, Elementary Secondary Education

Direct link
