Publication Date
| In 2026 | 0 |
| Since 2025 | 49 |
| Since 2022 (last 5 years) | 211 |
| Since 2017 (last 10 years) | 492 |
| Since 2007 (last 20 years) | 984 |
Descriptor
| Test Validity | 3908 |
| Test Reliability | 1517 |
| Testing | 1090 |
| Test Construction | 1014 |
| Testing Problems | 1008 |
| Computer Assisted Testing | 616 |
| Elementary Secondary Education | 553 |
| Foreign Countries | 494 |
| Higher Education | 490 |
| Standardized Tests | 488 |
| Test Interpretation | 433 |
| More ▼ | |
Source
Author
| Ebel, Robert L. | 16 |
| Hambleton, Ronald K. | 13 |
| Green, Donald Ross | 10 |
| Popham, W. James | 10 |
| Linn, Robert L. | 9 |
| Haney, Walt | 8 |
| Koretz, Daniel | 8 |
| Sireci, Stephen G. | 8 |
| Thompson, Bruce | 8 |
| Tindal, Gerald | 8 |
| Hilliard, Asa G., III | 7 |
| More ▼ | |
Publication Type
Education Level
Audience
| Practitioners | 137 |
| Researchers | 134 |
| Teachers | 51 |
| Administrators | 34 |
| Policymakers | 18 |
| Counselors | 11 |
| Students | 8 |
| Parents | 5 |
| Support Staff | 4 |
| Community | 2 |
Location
| Canada | 57 |
| Australia | 40 |
| California | 40 |
| China | 34 |
| United Kingdom (England) | 31 |
| United Kingdom | 29 |
| New York | 28 |
| United States | 26 |
| Florida | 22 |
| Germany | 21 |
| Turkey | 20 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
Peer reviewedGoodwin, Laura D.; Leech, Nancy L. – Measurement and Evaluation in Counseling and Development, 2003
The treatment of validity in the newest edition of "Standards for Educational and Psychological Testing" is quite different from coverage in earlier editions of the Standards and in most measurement textbooks. The view of validity in the 1999 Standards is discussed, and suggestions for instructors of measurement courses are offered. (Contains 56…
Descriptors: Educational Testing, Evaluation Methods, Psychological Testing, Standards
Somers, Marie-Andree; Zhu, Pei; Wong, Edmond – National Center for Education Evaluation and Regional Assistance, 2011
This study examines the practical implications of using state tests to measure student achievement in impact evaluations that span multiple states and grades. In particular, the study examines the sensitivity of impact findings to (1) the type of assessment used to measured achievement (state tests or an external assessment administered by the…
Descriptors: Evaluators, Grades (Scholastic), Academic Achievement, Program Effectiveness
Noble, Tracy; Suarez, Catherine; Rosebery, Ann; O'Connor, Mary Catherine; Warren, Beth; Hudicourt-Barnes, Josiane – Journal of Research in Science Teaching, 2012
Education policy in the U.S. in the last two decades has emphasized large-scale assessment of students, with growing consequences for schools, teachers, and students. Given the high stakes of such tests, it is important to understand the relationships between students' answers to test items and their knowledge and skills in the tested content…
Descriptors: Testing, Science Tests, Second Language Learning, Measures (Individuals)
Bramley, Tom; Gill, Tim – Research Papers in Education, 2010
The rank-ordering method for standard maintaining was designed for the purpose of mapping a known cut-score (e.g. a grade boundary mark) on one test to an equivalent point on the test score scale of another test, using holistic expert judgements about the quality of exemplars of examinees' work (scripts). It is a novel application of an old…
Descriptors: Scores, Psychometrics, Measurement Techniques, Foreign Countries
Sampson, Demetrios G., Ed.; Ifenthaler, Dirk, Ed.; Isaías, Pedro, Ed. – International Association for Development of the Information Society, 2021
These proceedings contain the papers of the 18th International Conference on Cognition and Exploratory Learning in the Digital Age (CELDA 2021), held virtually, due to an exceptional situation caused by the COVID-19 pandemic, from October 13-15, 2021, and organized by the International Association for Development of the Information Society…
Descriptors: Computer Simulation, Open Educational Resources, Telecommunications, Handheld Devices
Barry, Carol L.; Finney, Sara J. – Research & Practice in Assessment, 2009
The effects of gathering test scores under low-stakes conditions has been a prominent domain of research in the assessment and testing literature. One important area within this larger domain concerns the implications of a test being low-stakes on test evaluation and development. The current study examined one variable, the testing context, that…
Descriptors: Testing, Context Effect, Comparative Analysis, Test Validity
Green, Donald Ross; Draper, John F. – 1972
This paper considers the question of bias in group administered academic achievement tests, bias which is inherent in the instruments themselves. A body of data on the test of performance of three disadvantaged minority groups--northern, urban black; southern, rural black; and, southwestern, Mexican-Americans--as tryout samples in contrast to…
Descriptors: Achievement Tests, Bias, Comparative Testing, Educational Testing
National Inst. of Education (ED), Washington, DC. – 1981
Barbara Jordan served as the hearing officer for three-day adversary evaluation hearings about the pros and cons of minimum competency testing (MCT). This report is the complete transcript of the second day of proceedings. The pro team, lead by James Popham, began by presenting representatives of four states (Florida, California, Texas, and…
Descriptors: Cutting Scores, Elementary Secondary Education, Hearings, Minimum Competency Testing
Leclercq, Dieudonne – Evaluation in Education: An International Review Series, 1982
In a confidence weighting situation, the examinee is asked to indicate the correct answer, and how certain he or she is of the correctness of that answer. This paper reviews the bases for confidence marking, its validity and accuracy in evaluating students, and it's use in research. (BW)
Descriptors: Confidence Testing, Educational Research, Measurement Techniques, Models
Casey, Emmett – New Directions for Community Colleges, 1987
Offers background on issues related to the testing of disabled students. Reports on a survey about testing accommodation for disabled students provided by California community colleges. Recommends additional ways in which testing practices can be modified to meet the needs of disabled students. (DMM)
Descriptors: Accessibility (for Disabled), Community Colleges, Design Requirements, Educational Testing
Peer reviewedCurtis, Connie June; And Others – Educational and Psychological Measurement, 1979
The score distributions of the two methods of administration described in the title revealed comparable means, standard deviations, and general shape of distribution. With respect to validity coefficients, no appreciable differences were found. (JKS)
Descriptors: Comparative Testing, Educational Testing, Eye Hand Coordination, Grade 2
Allen, Nancy L.; And Others – 1992
Many testing programs include a section of optional questions in addition to mandatory parts of a test. These optional parts of a test are not often truly parallel to one another, and groups of examinees selecting each optional test section are not equivalent to one another. This paper provides a general method based on missing-data methods for…
Descriptors: Comparative Testing, Estimation (Mathematics), Graphs, Scaling
Moreno, Kathleen E.; And Others – 1983
The relationship between selected subtests from the Armed Services Vocational Aptitude Battery (ASVAB) and corresponding subtests administered as computerized adaptive tests (CAT) was investigated using a sample of Marine recruits. Results showed that the CAT subtest scores correlated as well with initial ASVAB scores as did ASVAB retest scores,…
Descriptors: Adaptive Testing, Aptitude Tests, Computer Assisted Testing, Correlation
Kristof, Walter – 1972
We are concerned with the hypothesis that two variables have a perfect disattenuated correlation, hence measure the same trait except for errors of measurement. This hypothesis is equivalent to saying, within the adopted model, that true scores of two psychological tests satisfy a linear relation. A statistical test of this hypothesis is derived…
Descriptors: Correlation, Error of Measurement, Factor Analysis, Hypothesis Testing
Osborn, William C. – 1977
Four essential dimensions of a performance test are detailed: directness of test method, type of criterion, standardization of conditions, and objectivity of scoring. For simplicity these factors are described as if each were dichotomous, when in actuality each is a continuum; a test method may be more or less direct, conditions more or less…
Descriptors: Performance Tests, Scoring, Test Reliability, Test Validity

Direct link
