Publication Date
| In 2026 | 0 |
| Since 2025 | 0 |
| Since 2022 (last 5 years) | 0 |
| Since 2017 (last 10 years) | 6 |
| Since 2007 (last 20 years) | 8 |
Descriptor
| Test Construction | 38 |
| Test Content | 38 |
| Test Use | 38 |
| Test Validity | 16 |
| Scoring | 13 |
| Achievement Tests | 9 |
| Elementary Secondary Education | 9 |
| Evaluation Methods | 9 |
| Student Evaluation | 9 |
| Test Items | 9 |
| Test Reliability | 9 |
| More ▼ | |
Source
Author
| Johnson, Bil | 2 |
| Angelo, Thomas A. | 1 |
| Ballator, Nada | 1 |
| Behuniak, Peter | 1 |
| Bennett, Randy Elliot | 1 |
| Bond, Linda A. | 1 |
| Buser, Karen | 1 |
| Chen, Ping | 1 |
| Cross, K. Patricia | 1 |
| Darling-Hammond, Linda | 1 |
| Davis, Larry | 1 |
| More ▼ | |
Publication Type
Education Level
| Elementary Secondary Education | 4 |
| Adult Education | 2 |
| Elementary Education | 2 |
| Adult Basic Education | 1 |
| Grade 6 | 1 |
| Higher Education | 1 |
Audience
| Practitioners | 8 |
| Teachers | 8 |
| Administrators | 1 |
| Parents | 1 |
| Students | 1 |
Location
| New Mexico | 2 |
| Alaska | 1 |
| Australia | 1 |
| Connecticut | 1 |
| Illinois | 1 |
| Pennsylvania | 1 |
Laws, Policies, & Programs
| Every Student Succeeds Act… | 2 |
Assessments and Surveys
What Works Clearinghouse Rating
Papageorgiou, Spiros; Davis, Larry; Norris, John M.; Garcia Gomez, Pablo; Manna, Venessa F.; Monfils, Lora – Educational Testing Service, 2021
The "TOEFL® Essentials"™ test is a new English language proficiency test in the "TOEFL"® family of assessments. It measures foundational language skills and communication abilities in academic and general (daily life) contexts. The test covers the four language skills of reading, listening, writing, and speaking and is intended…
Descriptors: Language Tests, English (Second Language), Second Language Learning, Language Proficiency
Sinharay, Sandip – Educational Measurement: Issues and Practice, 2018
The choice of anchor tests is crucial in applications of the nonequivalent groups with anchor test design of equating. Sinharay and Holland (2006, 2007) suggested "miditests," which are anchor tests that are content-representative and have the same mean item difficulty as the total test but have a smaller spread of item difficulties.…
Descriptors: Test Content, Difficulty Level, Test Items, Test Construction
Welch, Catherine J.; Dunbar, Stephen B. – Educational Measurement: Issues and Practice, 2020
The use of assessment results to inform school accountability relies on the assumption that the test design appropriately represents the content and cognitive emphasis reflected in the state's standards. Since the passage of the Every Student Succeeds Act and the certification of accountability assessments through federal peer review practices,…
Descriptors: Accountability, Test Construction, State Standards, Content Validity
Oliveri, María Elena; Nastal, Jessica; Slomp, David – ETS Research Report Series, 2020
This report discusses frameworks and assessment development approaches to consider fairness, opportunity to learn, and consequences of test use in the design and use of assessments administered to diverse populations. Examples include the integrated design and appraisal framework and the sociocognitively based evidence-centered design approach.…
Descriptors: Culture Fair Tests, Guidelines, Test Use, Test Construction
New Meridian Corporation, 2020
New Meridian Corporation has developed the "Quality Testing Standards and Criteria for Comparability Claims" (QTS) to provide guidance to states that are interested in including New Meridian content and would like to either keep reporting scores on the New Meridian Scale or use the New Meridian performance levels; that is, the state…
Descriptors: Testing, Standards, Comparative Analysis, Test Content
Sanders, Sara – National Technical Assistance Center for the Education of Neglected or Delinquent Children and Youth (NDTAC), 2019
This guide is designed to assist States, agencies, and/or facilities who work with youth who are neglected, delinquent, or at-risk (N or D). The information in the guide will benefit those who are (a) interested in implementing pre-posttests, (b) in the process of identifying an appropriate pre-posttest, or (c) ready to evaluate current testing…
Descriptors: At Risk Students, Delinquency, Pretests Posttests, Testing
Smith, Michael K. – Phi Delta Kappan, 2010
A national test can be designed for everyone--students, workers, etc.--that would measure their achievement in mathematics and other subjects and provide a score, normed at various levels from preschool to graduate school. The Internet and computer technology would enable both widespread administration of the test and access to scores that can be…
Descriptors: Test Use, Mathematics Achievement, Test Construction, National Competency Tests
Peer reviewedEignor, Daniel R. – Journal of Educational Measurement, 1997
The authors of the "Guidelines," a task force of eight, intend to present an organized list of features to be considered in reporting or evaluating computerized-adaptive assessments. Apart from a few weaknesses, the book is a useful and complete document that will be very helpful to test developers. (SLD)
Descriptors: Adaptive Testing, Computer Assisted Testing, Evaluation Methods, Guidelines
Mathieu, Cindy K. – 1997
This paper presents six steps in test construction generally recommended by measurement textbook authors. The focus is primarily on paper-and-pencil achievement tests as used by class instructions, although the discussion touches on the construction of other types of assessment. The six steps are: (1) determine the test purpose; (2) determine the…
Descriptors: Achievement Tests, Difficulty Level, Measurement Techniques, Selection
Tambini, Robert F. – 1999
The quality and the effectiveness of the 1992 New Jersey Grade 8 Early Warning Test (NJEWT) are assessed. Standardized tests possess clear advantages for educators, especially in the case of administration and scoring, but there are clear disadvantages as well, including the possibility of bias. Four criteria are applied to the NJEWT: adequacy,…
Descriptors: Achievement Tests, Grade 8, Junior High School Students, Junior High Schools
Rudner, Lawrence M. – 1994
The "Standards for Educational and Psychological Testing" of the American Educational Research Association, the American Psychological Association, and the National Council on Measurement in Education are intended to provide a comprehensive basis for evaluating tests. This digest identifies key standards applicable to most test…
Descriptors: Ability, Academic Achievement, Evaluation Methods, Norms
Reber, Anne M. – 1995
The Wechsler Intelligence Scale for Children-Third Edition (WISC-III) is an individually administered test of intelligence for assessing children aged 6 through 16 years, 11 months. The WISC-III consists of several subtests, each classified into a verbal or performance scale. The child's performance on these measures is summarized in three…
Descriptors: Children, Intelligence Quotient, Intelligence Tests, Performance Based Assessment
Gitomer, Drew H. – 1997
Assessing complex teaching performance in the National Board of Professional Teaching Standards (NBPTS) has caused the Educational Testing Service to wrestle with fundamental scoring issues that are both conceptual and technical. This report reviews the challenges encountered, how they are being addressed, and what the NBPTS effort has learned…
Descriptors: Elementary Secondary Education, Higher Education, Licensing Examinations (Professions), Performance Based Assessment
Bennett, Randy Elliot – 1998
This paper offers a scenario for how educational assessment might change in response to market forces that affect not only the future of large-scale testing but also society in general. The scenario divides into three generations distinguished by the purpose of testing, test format and content, and the extent to which testing capitalizes on new…
Descriptors: Accountability, Computer Assisted Testing, Educational Assessment, Educational Planning
Peer reviewedLeMahieu, Paul G.; And Others – Educational Measurement: Issues and Practice, 1995
Results are presented from a study of portfolio use for large-scale assessment in the Pittsburgh (Pennsylvania) public schools that are more encouraging for the use of this form of assessment than many other studies have been. Requirements for the psychometric integrity of portfolio assessment are discussed. (SLD)
Descriptors: Educational Assessment, Performance Based Assessment, Portfolios (Background Materials), Psychometrics

Direct link
