Publication Date
In 2025 | 34 |
Since 2024 | 128 |
Since 2021 (last 5 years) | 467 |
Since 2016 (last 10 years) | 873 |
Since 2006 (last 20 years) | 1353 |
Descriptor
Source
Author
Publication Type
Education Level
Audience
Practitioners | 195 |
Teachers | 159 |
Researchers | 92 |
Administrators | 49 |
Students | 34 |
Policymakers | 14 |
Parents | 12 |
Counselors | 2 |
Community | 1 |
Media Staff | 1 |
Support Staff | 1 |
More ▼ |
Location
Canada | 62 |
Turkey | 59 |
Germany | 40 |
United Kingdom | 36 |
Australia | 35 |
Japan | 35 |
China | 32 |
United States | 32 |
California | 25 |
United Kingdom (England) | 25 |
Netherlands | 24 |
More ▼ |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
FitzPatrick, Kathleen A. – Journal of College Science Teaching, 2004
I altered the format of an exercise physiology course from traditional lecture to emphasizing daily reading quizzes and group problem-solving activities. I used the SALGains evaluation to compare the two approaches and saw significant improvements in the evaluation ratings of students who were taught using the new format. Narrative responses…
Descriptors: Test Format, Exercise Physiology, Student Attitudes, Student Evaluation
Read, John – International Journal of English Studies, 2007
This paper surveys some current developments in second language vocabulary assessment, with particular attention to the ways in which computer corpora can provide better quality information about the frequency of words and how they are used in specific contexts. The relative merits of different word lists are discussed, including the Academic Word…
Descriptors: Second Language Learning, Second Language Programs, Vocabulary, Educational Development
Yao, Lihua; Schwarz, Richard D. – Applied Psychological Measurement, 2006
Multidimensional item response theory (IRT) models have been proposed for better understanding the dimensional structure of data or to define diagnostic profiles of student learning. A compensatory multidimensional two-parameter partial credit model (M-2PPC) for constructed-response items is presented that is a generalization of those proposed to…
Descriptors: Models, Item Response Theory, Markov Processes, Monte Carlo Methods
Labi, Aisha – Chronicle of Higher Education, 2006
Educators in Europe are complaining about mishandled changes in a key English-language test and are demanding that the Educational Testing Service (ETS), which runs the examination, delay plans to introduce its new online format in March 2006 in more than 100 countries. Critics of the planned change in the Test of English as a foreign Language…
Descriptors: Foreign Countries, Universities, Foreign Students, Educational Testing
Adams, Richard; And Others – 1993
The purpose of this study was to determine whether it is both possible and cost-effective to revise middle-difficulty Graduate Record Examinations (GRE) discrete items in order to produce items of higher or lower difficulty. The basic procedure was to select items of a given difficulty and, by revising the distractors, make them easier or more…
Descriptors: Analogy, College Entrance Examinations, Cost Effectiveness, Difficulty Level
Bardovi-Harlig, Kathleen; Hartford, Beverly S. – 1993
A study compared the influence of two forms of discourse completion test (DCT) on the elicitation of rejection of advice. An open questionnaire providing scenarios alone was compared with a classic dialogue completion task in which a conversational turn is provided. The tasks were given to 32 graduate students, 19 native and 13 non-native…
Descriptors: Comparative Analysis, Dialogs (Language), Interpersonal Communication, Language Research
Maimon, Lia F. – 1994
Two studies addressed the effects of failure in reading test performance. In experiment 1, 36 students in 3 intact reading and study skills courses at an upstate New York community college completed a questionnaire, were administered an "unsolvable" reading test, were either given no feedback or "failure feedback," an…
Descriptors: Community Colleges, Failure, Reading Research, Reading Tests
Campbell, Todd; And Others – 1995
In the early 1970s A. Constantinople wrote a seminal article that led to the development of the construct of psychological androgyny. The Bem Sex-Role Inventory is a popular measure of the construct, but the measure remains controversial. The construct validity of scores from the measure was explored using confirmatory factor analysis on data from…
Descriptors: Androgyny, College Students, Construct Validity, Factor Structure
Berne, Jane E. – 1993
A study investigated the comparability of second language (L2) listening comprehension tests by examining the effects of varying text type and assessment task on student performance. Student target language experience was an additional variable considered. Subjects were 107 beginning and 64 advanced-intermediate college-level Spanish second…
Descriptors: Evaluation Methods, Language Role, Language Tests, Listening Comprehension
Price, Larry R.; Oshima, T. C. – 1998
Often, educational and psychological measurement instruments must be translated from one language to another when they are administered to different cultural groups. The translation process often necessarily introduces measurement inequivalence. Therefore, an examination may be said to exhibit differential functioning if the test provides a…
Descriptors: Certification, Cross Cultural Studies, Cultural Differences, Diving
Williams, Jane M. – 1991
This manual is designed to assist both special and regular educators with mastering the skills for developing quality teacher-made tests consistent with content-oriented instruction. The manual presents tips for constructing both objective and subjective, supply and select test questions--namely, short answer, essay, fill in the blanks or…
Descriptors: Cognitive Processes, Disabilities, Elementary Secondary Education, Item Analysis
Allen, Nancy L.; And Others – 1992
Many testing programs include a section of optional questions in addition to mandatory parts of a test. These optional parts of a test are not often truly parallel to one another, and groups of examinees selecting each optional test section are not equivalent to one another. This paper provides a general method based on missing-data methods for…
Descriptors: Comparative Testing, Estimation (Mathematics), Graphs, Scaling
Stansfield, Charles W.; Kenyon, Dorry Mann – 1990
This project extended the application of a model for development of semi-direct tests of oral proficiency, originally developed for Chinese, to a diverse set of less commonly taught languages spanning various language families and representing diverse cultural backgrounds. This second, final report covers development of tests for Hebrew, Hausa,…
Descriptors: Hausa, Hebrew, Indonesian, Language Proficiency
Donoghue, John R.; Allen, Nancy L. – 1991
This Monte Carlo study examined strategies for forming the matching variable for the Mantel-Haenszel (MH) differential item functioning (DIF) procedure. Data were generated using a three-parameter logistic item response theory model, with common guessing parameters. The number of subjects and test length were manipulated, as were the difficulty,…
Descriptors: Comparative Analysis, Difficulty Level, Equations (Mathematics), Item Bias
Thiede, Keith W.; And Others – 1991
A correlational analysis was performed to examine the relationship between recognition and recall test formats. A total of 236 college students completed one of four 80-item general knowledge tests; the forms contained 20 items of each of four formats: (1) true; (2) false; (3) multiple-choice; and (4) free response. Ninety-three of the subjects…
Descriptors: Cognitive Processes, College Students, Comparative Testing, Correlation