Publication Date
In 2025 | 0 |
Since 2024 | 1 |
Since 2021 (last 5 years) | 3 |
Since 2016 (last 10 years) | 4 |
Since 2006 (last 20 years) | 7 |
Descriptor
Test Construction | 70 |
Test Format | 70 |
Test Use | 70 |
Test Validity | 23 |
Elementary Secondary Education | 21 |
Computer Assisted Testing | 18 |
Test Items | 17 |
Test Reliability | 17 |
Standardized Tests | 14 |
Higher Education | 13 |
Scoring | 12 |
More ▼ |
Source
Author
Mott, Michael S. | 2 |
Roeber, Edward D. | 2 |
Straus, Murray A. | 2 |
Alderson, J. Charles | 1 |
Algozzine, Bob | 1 |
Amit Sevak | 1 |
Arter, Judith A. | 1 |
Baird, Leonard L. | 1 |
Bennett, Randy Elliot | 1 |
Burroway, Robert L. | 1 |
Buser, Karen | 1 |
More ▼ |
Publication Type
Education Level
Higher Education | 3 |
Postsecondary Education | 3 |
Location
Indonesia | 2 |
Alaska | 1 |
Canada | 1 |
Georgia | 1 |
Japan | 1 |
Maine | 1 |
Michigan | 1 |
New Jersey | 1 |
Singapore | 1 |
Utah | 1 |
Virginia | 1 |
More ▼ |
Laws, Policies, & Programs
Improving Americas Schools… | 1 |
Assessments and Surveys
What Works Clearinghouse Rating
Muhammad Yoga Prabowo; Sarah Rahmadian – TEFLIN Journal: A publication on the teaching and learning of English, 2023
The outbreak of the COVID-19 pandemic has transformed the educational landscape in a way unseen before. Educational institutions are navigating between offline and online learning worldwide. Computer-based testing is rapidly taking over paper-and-pencil testing as the dominant mode of assessment. In some settings, computer-based and…
Descriptors: English (Second Language), Second Language Learning, Test Format, Language Tests
Yulianto, Ahmad; Pudjitriherwanti, Anastasia; Kusumah, Chevy; Oktavia, Dies – International Journal of Language Testing, 2023
The increasing use of computer-based mode in language testing raises concern over its similarities with and differences from paper-based format. The present study aimed to delineate discrepancies between TOEFL PBT and CBT. For that objective, a quantitative method was employed to probe into scores equivalence, the performance of male-female…
Descriptors: Computer Assisted Testing, Test Format, Comparative Analysis, Scores
Patrick Kyllonen; Amit Sevak; Teresa Ober; Ikkyu Choi; Jesse Sparks; Daniel Fishtein – ETS Research Report Series, 2024
Assessment refers to a broad array of approaches for measuring or evaluating a person's (or group of persons') skills, behaviors, dispositions, or other attributes. Assessments range from standardized tests used in admissions, employee selection, licensure examinations, and domestic and international large-scale assessments of cognitive and…
Descriptors: Assessment Literacy, Testing, Test Bias, Test Construction
Fairman, Janet; Johnson, Amy; Mette, Ian; Wickerd, Garry; LaBrie, Sharon – Center for Education Policy, Applied Research, and Evaluation, 2018
The Maine Legislature requested the Maine Education Policy Research Institute (MEPRI) to conduct an assessment of standardized testing in Maine schools to understand the amount, cost, and usefulness of it. This report summarizes the resulting effort, which included a literature scan, document analysis, and surveys of two groups of school…
Descriptors: Standardized Tests, Educational Assessment, Educational Benefits, Screening Tests
Sriram, Rishi – NASPA - Student Affairs Administrators in Higher Education, 2014
When student affairs professionals assess their work, they often employ some type of survey. The use of surveys stems from a desire to objectively measure outcomes, a demand from someone else (e.g., supervisor, accreditation committee) for data, or the feeling that numbers can provide an aura of competence. Although surveys are effective tools for…
Descriptors: Surveys, Test Construction, Student Personnel Services, Test Use
Wah-Mei Kodimer – ProQuest LLC, 2015
The purpose of the proposed study is to add to the literature regarding the assessment of effort and malingering in the field of neuropsychology using the Rey Complex Figure Test (RCFT). The majority of the literature on this measure has been in the specific areas for which the instrument was developed, namely those of visual spatial and…
Descriptors: Psychological Evaluation, Neuropsychology, Recognition (Psychology), Test Construction
Hassan, Nurul Huda; Shih, Chih-Min – Language Assessment Quarterly, 2013
This article describes and reviews the Singapore-Cambridge General Certificate of Education Advanced Level General Paper (GP) examination. As a written test that is administered to preuniversity students, the GP examination is internationally recognised and accepted by universities and employers as proof of English competence. In this article, the…
Descriptors: Foreign Countries, College Entrance Examinations, English (Second Language), Writing Tests

Stern, Paul C.; Guagnano, Gregory A.; Dietz, Thomas – Educational and Psychological Measurement, 1998
A brief version of the instrument developed by S. Schwartz (1992, 1994) to measure the structure and content of human values was developed. Studies with 199 adults and 420 adults support the reliability of scores produced by the brief inventory's four three-item scales. Uses of the brief form are discussed. (SLD)
Descriptors: Adults, Reliability, Scores, Test Construction
Kame'enui, Edward; Simmons, Deborah; Cornachione, Cheri – 2001
This guide is designed to provide teachers and reading tutors with an easy-to-use and practical guide to selecting and using reading assessment tools that (1) provides descriptions of reading assessments for English and Spanish speaking students that can be used to diagnose and identify their reading skills and abilities; (2) helps teachers find…
Descriptors: Elementary Education, English, Reading Tests, Spanish Speaking

Rabiner, Donna J.; And Others – Evaluation and Program Planning, 1994
A 14-item instrument, the Dentist Satisfaction Survey-14, a form of a previously validated instrument, is described. Use with 522 dentists, and 29 in a follow-up, indicates that the short form is a parsimonious tool for general evaluation of dentists' job satisfaction. (SLD)
Descriptors: Attitude Measures, Dentists, Evaluation Methods, Followup Studies
Wood, Karen; Algozzine, Bob – Diagnostique, 1990
A format is suggested for testing using free association techniques. In this system, students are given key words or phrases and asked to relate as much relevant information as they can. Scoring techniques, strengths, and weaknesses of such open-ended approaches are discussed, along with possible suggestions for implementation. (PB)
Descriptors: Elementary Secondary Education, Learning Problems, Response Style (Tests), Teaching Methods

Downing, Steven M. – Educational Measurement: Issues and Practice, 1992
Research on true-false (TF), multiple-choice, and alternate-choice (AC) tests is reviewed, discussing strengths, weaknesses, and the usefulness in classroom and large-scale testing of each. Recommendations are made for improving use of AC items to overcome some of the problems associated with TF items. (SLD)
Descriptors: Comparative Analysis, Educational Research, Multiple Choice Tests, Objective Tests
Carlson, Sybil B.; Ward, William C. – 1988
Issues concerning the cost and feasibility of using Formulating Hypotheses (FH) test item types for the Graduate Record Examinations have slowed research into their use. This project focused on two major issues that need to be addressed in considering FH items for operational use: the costs of scoring and the assignment of scores along a range of…
Descriptors: Adaptive Testing, Computer Assisted Testing, Costs, Pilot Projects
Martinez, Michael E.; And Others – 1990
Large-scale testing is dominated by the multiple-choice question format. Widespread use of the format is due, in part, to the ease with which multiple-choice items can be scored automatically. This paper examines automatic scoring procedures for an alternative item type: figural response. Figural response items call for the completion or…
Descriptors: Automation, Computer Assisted Testing, Educational Technology, Multiple Choice Tests

Johnson, Nancy E.; And Others – Assessment, 1994
Development of an alternate form of Raven's Standard Progressive Matrices Test is described. Reliability analysis with 449 children of differing racial/ethnic backgrounds showed good reliability and comparable predictive validity. The alternate form is a promising research tool. (SLD)
Descriptors: Children, Ethnic Groups, Intelligence Tests, Matrices