Publication Date
| In 2026 | 0 |
| Since 2025 | 52 |
| Since 2022 (last 5 years) | 194 |
| Since 2017 (last 10 years) | 494 |
| Since 2007 (last 20 years) | 742 |
Descriptor
| Test Items | 1186 |
| Test Reliability | 1186 |
| Test Validity | 684 |
| Test Construction | 565 |
| Foreign Countries | 348 |
| Difficulty Level | 279 |
| Item Analysis | 252 |
| Psychometrics | 233 |
| Item Response Theory | 219 |
| Factor Analysis | 183 |
| Multiple Choice Tests | 172 |
| More ▼ | |
Source
Author
| Schoen, Robert C. | 12 |
| LaVenia, Mark | 5 |
| Liu, Ou Lydia | 5 |
| Anderson, Daniel | 4 |
| Bauduin, Charity | 4 |
| DiLuzio, Geneva J. | 4 |
| Farina, Kristy | 4 |
| Haladyna, Thomas M. | 4 |
| Huck, Schuyler W. | 4 |
| Petscher, Yaacov | 4 |
| Stansfield, Charles W. | 4 |
| More ▼ | |
Publication Type
Education Level
Audience
| Practitioners | 39 |
| Researchers | 30 |
| Teachers | 24 |
| Administrators | 13 |
| Support Staff | 3 |
| Counselors | 2 |
| Students | 2 |
| Community | 1 |
| Parents | 1 |
| Policymakers | 1 |
Location
| Turkey | 68 |
| Indonesia | 37 |
| Germany | 20 |
| Canada | 17 |
| Florida | 17 |
| China | 16 |
| Australia | 15 |
| California | 12 |
| Iran | 11 |
| India | 10 |
| New York | 9 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Meets WWC Standards without Reservations | 1 |
| Meets WWC Standards with or without Reservations | 1 |
Gorin, Joanna S.; Embretson, Susan E. – Applied Psychological Measurement, 2006
Recent assessment research joining cognitive psychology and psychometric theory has introduced a new technology, item generation. In algorithmic item generation, items are systematically created based on specific combinations of features that underlie the processing required to correctly solve a problem. Reading comprehension items have been more…
Descriptors: Difficulty Level, Test Items, Modeling (Psychology), Paragraph Composition
Andrews, Jac J. W.; Conte, Richard – Canadian Journal of School Psychology, 2005
This article describes the development and use of a norm-referenced instrument called the Healthy School Indicator Tool (HSIT) that was designed to assist educational professionals monitor their progress in addressing critical health issues in schools. Factor analyses of two administrations of the survey indicated a stable factor structure. In…
Descriptors: Comprehensive School Health Education, Health Programs, Factor Structure, School Health Services
PDF pending restorationHyers, Albert D.; Anderson, Paul S. – 1991
Using matched pairs of geography questions, a new testing method for machine-scored fill-in-the-blank, multiple-digit testing (MDT) questions was compared to the traditional multiple-choice (MC) style. Data were from 118 matched or parallel test items for 4 tests from 764 college students of geography. The new method produced superior results when…
Descriptors: College Students, Comparative Testing, Computer Assisted Testing, Difficulty Level
Ward, William C.; And Others – 1986
The keylist format (rather than the conventional multiple-choice format) for item presentation provides a machine-scorable surrogate for a truly free-response test. In this format, the examinee is required to think of an answer, look it up in a long ordered list, and enter its number on an answer sheet. The introduction of keylist items into…
Descriptors: Analogy, Aptitude Tests, Construct Validity, Correlation
Legg, Sue M.; Algina, James – 1986
This paper focuses on the questions which arise as test practitioners monitor score scales derived from latent trait theory. Large scale assessment programs are dynamic and constantly challenge the assumptions and limits of latent trait models. Even though testing programs evolve, test scores must remain reliable indicators of progress.…
Descriptors: Difficulty Level, Educational Assessment, Elementary Secondary Education, Equated Scores
New York State Div. for Youth, Albany. – 1985
This guide is designed to serve as a reference to assist providers of Job Training Partnership Act-funded programs in selecting appropriate interest, aptitude, and pre-employment and job readiness tests. Descriptions of 53 interest tests, 38 aptitude tests, and 37 pre-employment and job readiness tests are provided. Each description contains…
Descriptors: Aptitude Tests, Employment Potential, Evaluation Criteria, Guidelines
Young, Robert E. – 1982
Perspectives on testing and grading are considered; and strategies, reference materials, and sample test items are presented. Attention is directed to specific impacts of testing and grading on students and teachers, purposes of testing and grading, and common complaints of students and teachers. Extra activities proposed for the teacher include…
Descriptors: College Instruction, College Students, Criterion Referenced Tests, Educational Testing
Baker, Eva L.; And Others – 1980
The materials presented were developed for use in a series of conferences on testing and instruction sponsored by the National Institute of Education, with the United States Office of Education, the UCLA Center for the Study of Evaluation, and a network of research and development agencies. They are intended for use by school practitioners and…
Descriptors: Criterion Referenced Tests, Elementary Secondary Education, Evaluation Criteria, Item Analysis
Bernknopf, Stan; And Others – 1979
The effectiveness of a model for determining a minimal cut-off score for criterion-referenced tests was examined. The model, based upon techniques presented originally by Nedelsky and by Angoff, was first used in conjunction with a multiple choice test developed for use in certifying school counselors in Georgia. A "knowledge estimation panel" was…
Descriptors: Counselor Certification, Court Litigation, Criterion Referenced Tests, Cutting Scores
Patience, Wayne M.; Reckase, Mark D. – 1979
Simulated tailored tests were used to investigate the relationships between characteristics of the item pool and the computer program, and the reliability and bias of the resulting ability estimates. The computer program was varied to provide for various step sizes (differences in difficulty between successive steps) and different acceptance…
Descriptors: Adaptive Testing, Computer Assisted Testing, Computer Programs, Educational Testing
McKee, Barbara G.; Blake, Rowland S. – 1979
A questionnaire concerning attitudes toward the importance of communication skills and toward different modes of communication was administered to 290 incoming freshmen at the National Technical Institute for the Deaf. Students responded to one of two types of a 38-item questionnaire: a multiple choice, or Likert-type (strongly agree...strongly…
Descriptors: Attitude Measures, Communication Skills, Communication (Thought Transfer), Deafness
Paiva, Rosalia E. A.; Vu, Nu Viet – 1979
The Southern Illinois University School of Medicine applies mastery learning concepts to an objectives-based curriculum and uses the Nedelsky method to assess mastery of core objectives. Provisions are made for remediating deficiences. The Nedelsky technique is used for determining an acceptable level of performance in multiple choice examinations…
Descriptors: Academic Standards, Competency Based Education, Cutting Scores, Difficulty Level
Godfrey, John R.; Galloway, Ann – Issues in Educational Research, 2004
This report examines the "Performance Indicators in Primary Schools" (PIPS) test as a reliable and cohesive instrument to assess early literacy and numeracy skills among Indigenous children. The process includes the examination of the reliability of the PIPS test using the Cronbach Alpha and the Split-half method with Pearson's r…
Descriptors: Numeracy, Emergent Literacy, Indigenous Populations, Test Validity
Hendrickson, Amy; Patterson, Brian; Melican, Gerald – College Board, 2008
Presented at the Annual National Council on Measurement in Education (NCME) in New York in March 2008. This presentation explores how different item weighting can affect the effective weights, validity coefficents and test reliability of composite scores among test takers.
Descriptors: Multiple Choice Tests, Test Format, Test Validity, Test Reliability
Royeen, Charlotte Brasic; And Others – 1994
This paper is a preliminary report on the development of alternate forms of an attitude scale to assess parents' and professionals' views toward the Individualized Family Service Plan (IFSP) process, a process evolving as a result of Federal regulations regarding early-intervention services. Development of the set of attitude scales is unique in…
Descriptors: Attitude Measures, Delivery Systems, Early Intervention, Federal Legislation

Peer reviewed
Direct link
