NotesFAQContact Us
Collection
Advanced
Search Tips
Location
Turkey1
Laws, Policies, & Programs
What Works Clearinghouse Rating
Showing 1 to 15 of 17 results Save | Export
Peer reviewed Peer reviewed
PDF on ERIC Download full text
Sibic, Okan; Sesen, Burcin Acar – International Journal of Assessment Tools in Education, 2022
One of the main goals of science education is to make students gain science process skills. Thus, it is significant to measure whether students gain those skills or not. For this purpose, various tests have been produced and used in various studies. This study aims to examine science process skills tests which have been used in the theses produced…
Descriptors: Foreign Countries, Science Education, Science Process Skills, Masters Theses
Berk, Ronald A. – 1980
Two approaches to criterion-referenced measurement are described and contrasted--domain-referenced testing and mastery testing. This paper is organized according to ten issues or stages in test construction: (1) content domain specification; (2) item construction; (3) item domain specification; (4) item analysis; (5) item selection; (6) parallel…
Descriptors: Classification, Criterion Referenced Tests, Mastery Tests, Measurement Objectives
Peer reviewed Peer reviewed
Albanese, Mark A. – Educational Measurement: Issues and Practice, 1993
A comprehensive review is given of evidence, with a bearing on the recommendation to avoid use of complex multiple choice (CMC) items. Avoiding Type K items (four primary responses and five secondary choices) seems warranted, but evidence against CMC in general is less clear. (SLD)
Descriptors: Cues, Difficulty Level, Multiple Choice Tests, Responses
Woldbeck, Tanya – 1998
This paper summarizes some of the basic concepts in test equating. Various types of equating methods, as well as data collection designs, are outlined, with attempts to provide insight into preferred methods and techniques. Test equating describes a group of methods that enable test constructors and users to compare scores from two different forms…
Descriptors: Comparative Analysis, Data Collection, Difficulty Level, Equated Scores
Peer reviewed Peer reviewed
Downing, Steven M. – Educational Measurement: Issues and Practice, 1992
Research on true-false (TF), multiple-choice, and alternate-choice (AC) tests is reviewed, discussing strengths, weaknesses, and the usefulness in classroom and large-scale testing of each. Recommendations are made for improving use of AC items to overcome some of the problems associated with TF items. (SLD)
Descriptors: Comparative Analysis, Educational Research, Multiple Choice Tests, Objective Tests
Brittain, Mary M.; Brittain, Clay V. – 1981
A behavioral domain is well-defined when it is clear to both test developers and test users which categories of performance should or should not be considered for potential test items. Only those tests that are keyed to well-defined domains meet the definition of criterion-referenced tests. The greatest proliferation of criterion-referenced tests…
Descriptors: Criterion Referenced Tests, Reading Achievement, Reading Tests, Test Construction
Rosner, Frieda C.; Weber, Wilford A. – 1982
A review of the National Teacher Examinations (NTE) has focused on the Commons Examinations component. The Commons, now named the National Teacher Examinations Core Battery, tests general knowledge, communication skills, and professional knowledge in three separate tests. Users may select portions of the tests that best suit their needs at various…
Descriptors: Classroom Techniques, Higher Education, Program Evaluation, Standardized Tests
Gonzalez-Pino, Barbara – 1987
Testing guidelines based on teachers' experience and on a review of literature concerning approaches to the testing of oral second language skills, are presented in this paper. Considerations in developing the test included coordination with the syllabus, choice of format, grading, and administration. Suggestions are offered from the practices of…
Descriptors: Evaluation Criteria, Grading, Language Tests, Oral Language
Rinck, Christine – 1986
Steps are outlined in the construction of two self-report assessment scales designed to measure general life satisfaction and leisure activity levels of developmentally disabled adults. The potential usefulness of a reliable, valid self-report instrument as a measurement tool in client programming is noted in the context of a review of similar…
Descriptors: Developmental Disabilities, Leisure Time, Life Satisfaction, Middle Aged Adults
Hambleton, Ronald K.; Eignor, Daniel R. – 1978
In light of the widespread use of competency testing, the authors consider that it is important to determine ways of developing and using competency testing to insure that it achieves its full potential. The paper, in three parts, introduces a model for the development and validation of competency tests, reviews several methods for setting…
Descriptors: Competence, Criterion Referenced Tests, Cutting Scores, Elementary Secondary Education
Buser, Karen – 1996
Most seasoned test developers recognize the importance of thoughtful decision making when constructing a test. Unfortunately, many classroom achievement tests are created by novice test developed who have not received sufficient instruction in item writing (G. Gulliksen, 1986; R. J. Stiggins, 1991). The result is often a test that is poorly…
Descriptors: Achievement Tests, Decision Making, Educational Planning, Evaluation Methods
Bolden, Bernadine J.; Stoddard, Ann – 1980
This study examined the effect of two ways of question phrasing as related to three styles of expository writing on the test performance of elementary school children. Multiple-choice questions were developed for sets of passages which were written using three different syntactic structures; and which had different levels of difficulty. The…
Descriptors: Difficulty Level, Elementary Education, Kernel Sentences, Multiple Choice Tests
Peer reviewed Peer reviewed
Lundeberg, Mary A.; Fox, Paul W. – Review of Educational Research, 1991
A meta-analysis was conducted of 107 classroom and laboratory studies concerning the effects of expecting a recall, recognition, essay, multiple-choice, or true-false test on students' subsequent achievement. Laboratory studies did not generalize well to classrooms. In classroom studies, subjects performed better when they knew which test type to…
Descriptors: Classroom Research, Comparative Analysis, Elementary Secondary Education, Generalizability Theory
Perez, Kristina M. – 1996
The KeyMath Revised is a power test that measures the understanding and application of mathematics skills and concepts. It is individually administered and is intended for students from kindergarten through the ninth grade to determine student mastery of mathematics concepts. The revised version is designed to be user-friendly for the student and…
Descriptors: Comprehension, Curriculum Development, Diagnostic Tests, Educational Diagnosis
Straus, Murray A.; Hamby, Sherry L. – 1997
The Conflict Tactics Scales (CTS) are intended to measure use of nonviolent discipline, psychological aggression, and physical assault in parent-child and other family relationships. The latter two scales provide a basis for identifying psychological and physical maltreatment. Two revisions of the CTS became available in 1996. One, the CTS2 is…
Descriptors: Child Abuse, Conflict, Data Collection, Emotional Abuse
Previous Page | Next Page ยป
Pages: 1  |  2