Publication Date
In 2025 | 34 |
Since 2024 | 128 |
Since 2021 (last 5 years) | 467 |
Since 2016 (last 10 years) | 873 |
Since 2006 (last 20 years) | 1353 |
Descriptor
Source
Author
Publication Type
Education Level
Audience
Practitioners | 195 |
Teachers | 159 |
Researchers | 92 |
Administrators | 49 |
Students | 34 |
Policymakers | 14 |
Parents | 12 |
Counselors | 2 |
Community | 1 |
Media Staff | 1 |
Support Staff | 1 |
More ▼ |
Location
Canada | 62 |
Turkey | 59 |
Germany | 40 |
United Kingdom | 36 |
Australia | 35 |
Japan | 35 |
China | 32 |
United States | 32 |
California | 25 |
United Kingdom (England) | 25 |
Netherlands | 24 |
More ▼ |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating

Stevenson, Douglas K. – Language Testing, 1985
Discusses authenticity in language testing in relation to the language proficiency movement. Looks at both sociolinguistic and psychometric principles as they are concerned with authenticity and validity as well as the inferential distance that separates face validity from techical validities. Criticizes the belief that some test type possess…
Descriptors: Language Proficiency, Measurement Techniques, Methods, Psychometrics
Wedman, John F.; Stefanich, Greg P. – Educational Technology, 1984
Offers guidelines for designing computer-based testing which demands high-level cognitive functioning of students using computer-assisted instruction as a learning mode. Examples of conceptual, principle, and procedural learning evaluation approaches, strengths and weaknesses within the formats described, and suggestions for improving the…
Descriptors: Cognitive Objectives, Computer Assisted Instruction, Computer Assisted Testing, Evaluation Methods

Courtenay, Bradley C.; Weidman, Craig – Gerontologist, 1985
Undergraduates (N=141) completed different versions of Palmore's Facts on Aging (FAQ) quizzes to test effects of "don't know" (DK) answers. Findings suggest the DK option yields more accurate knowledge scores, eliminates guessing, enhances the use of FAQ as a research instrument and for pre/post evaluation of training in aging.…
Descriptors: Aging (Individuals), College Students, Educational Gerontology, Guessing (Tests)

Chavez-Oller, Mary Anne; And Others – Language Learning, 1985
Considers whether scores on cloze items are generally sensitive to amounts of context in excess of 10 words on either side of them and, if not, when they are sensitive to long-range constraints. Concludes that some are sensitive to constraints that reach beyond 50 words on either side of a blank. (SED)
Descriptors: Cloze Procedure, Context Clues, Language Research, Language Tests

Pratt, C.; Hacker, R. G. – Educational and Psychological Measurement, 1984
A unidimensional latent trait model was used to test a single-factor hypothesis of the Lawson Classroom Test of Formal Reasoning. The test failed to provide a valid measure of formal reasoning. This was a result of test format which neglected aspects of formal reasoning emphasized by Inhelder and Piaget. (Author/DWH)
Descriptors: Cognitive Processes, Group Testing, Higher Education, Latent Trait Theory

Spurgin, C. B. – Physics Education, 1985
Discusses issues related to examination questions which begin by asking students to "Describe an experiment to..." Indicates that this strategy is useful when focusing on important quantities/phenomena or "celebrated" experiments and that examining boards should not request students to describe experiments which verify or…
Descriptors: Physics, Science Education, Science Experiments, Science Tests

Bieliauskas, Vytautas J.; Farragher, John – Journal of Clinical Psychology, 1983
Administered the House-Tree-Person test to male college students (N=24) to examine the effects of varying the size of the drawing form on the scores. Results suggested that use of the drawing sheet did not have a significant influence upon the quantitative aspects of the drawing. (LLL)
Descriptors: College Students, Higher Education, Intelligence Tests, Males

Katz, Barry M.; McSweeney, Maryellen – Journal of Experimental Education, 1984
This paper developed and illustrated a technique to analyze categorical data when subjects can appear in any number of categories for multigroup designs. Post hoc procedures to be used in conjunction with the presented statistical test are also developed. The technique is a large sample technique whose small sample properties are as yet unknown.…
Descriptors: Data Analysis, Hypothesis Testing, Mathematical Models, Research Methodology

Sanjivamurthy, P.T.; Kumar, V.K. – Contemporary Educational Psychology, 1983
After six weeks of testing college algebra students (n=84) either on recall or recognition tests, the test modes were changed without warning. Results showed that performance suffered when the test mode was changed for students anticipating a recognition test. Students anticipating a recall test did equally well in both test modes. (Author/PN)
Descriptors: Algebra, Higher Education, Long Term Memory, Recall (Psychology)

Kiewra, Kenneth A. – Contemporary Educational Psychology, 1983
No differences in immediate recognition performance were found for 30 undergraduate students who reorganized notes into an instructor-generated matrix versus subjects who reviewed in their typical manner. Reorganization during review resulted in relatively higher achievement on a free recall test, while unstructured review produced higher…
Descriptors: Cues, Encoding (Psychology), Higher Education, Notetaking
Swygert, Kimberly A. – 2003
In this study, data from an operational computerized adaptive test (CAT) were examined in order to gather information concerning item response times in a CAT environment. The CAT under study included multiple-choice items measuring verbal, quantitative, and analytical reasoning. The analyses included the fitting of regression models describing the…
Descriptors: Adaptive Testing, Computer Assisted Testing, Item Response Theory, Participant Characteristics
DeVito, Pasquale J., Ed.; Koenig, Judith A., Ed. – 2001
A committee of the National Research Council studied the desirability, feasibility, and potential impact of two reporting practices for National Assessment of Educational Progress (NAEP) results: district-level reporting and market-basket reporting. NAEP's sponsors believe that reporting district-level NAEP results would support state and local…
Descriptors: Elementary Secondary Education, Research Methodology, Research Reports, School Districts
Tobias, Sheila; Raphael, Jacqueline – 1997
This volume, part two of "The Hidden Curriculum," is premised on the belief that testing practices influence educational procedures and learning outcomes. Graduate level science educators shared their assessment techniques in terms of the following categories: (1) exam design; (2) exam format; (3) exam environment; and (4) grading practices.…
Descriptors: College Science, Educational Change, Evaluation, Higher Education
Woldbeck, Tanya – 1998
This paper summarizes some of the basic concepts in test equating. Various types of equating methods, as well as data collection designs, are outlined, with attempts to provide insight into preferred methods and techniques. Test equating describes a group of methods that enable test constructors and users to compare scores from two different forms…
Descriptors: Comparative Analysis, Data Collection, Difficulty Level, Equated Scores
Schulz, E. Matthew; Wang, Lin – 2001
In this study, items were drawn from a full-length test of 30 items in order to construct shorter tests for the purpose of making accurate pass/fail classifications with regard to a specific criterion point on the latent ability metric. A three-item parameter Item Response Theory (IRT) framework was used. The criterion point on the latent ability…
Descriptors: Ability, Classification, Item Response Theory, Pass Fail Grading