NotesFAQContact Us
Collection
Advanced
Search Tips
Laws, Policies, & Programs
No Child Left Behind Act 20013
What Works Clearinghouse Rating
Does not meet standards1
Showing 676 to 690 of 728 results Save | Export
Choppin, Bruce – 1982
A strategy for overcoming problems with the Rasch model's inability to handle missing data involves a pairwise algorithm which manipulates the data matrix to separate out the information needed for the estimation of item difficulty parameters in a test. The method of estimation compares two or three items at a time, separating out the ability…
Descriptors: Difficulty Level, Estimation (Mathematics), Goodness of Fit, Item Analysis
Berk, Ronald A. – 1978
Sixteen item statistics recommended for use in the development of criterion-referenced tests were evaluated. There were two major criteria: (1) practicability in terms of ease of computation and interpretation and (2) meaningfulness in the context of the development process. Most of the statistics were based on a comparison of performance changes…
Descriptors: Achievement Tests, Criterion Referenced Tests, Difficulty Level, Guides
Smilansky, Jonathan; Guerin, Robert O. – 1976
A method for establishing minimal acceptable performance levels for criterion referenced multiple choice examinations is described and two procedures for evaluating its validity are discussed. The method involves assigning a priori item difficulties based on the similarity between the distractor and the correct response. In estimating the validity…
Descriptors: Criteria, Criterion Referenced Tests, Cutting Scores, Difficulty Level
Kreines, David C.; Mead, Ronald J. – 1979
An explanation is given of what is meant by "sample-free" item calibration and by "item-free" person measurement as these terms are applied to the one-parameter logistic test theory model of Georg Rasch. When the difficulty of an item is calibrated separately for two different samples the results may differ; but, according the…
Descriptors: Difficulty Level, Equated Scores, Goodness of Fit, Item Analysis
Peer reviewed Peer reviewed
Kaufman, Nancy H. – Journal of Legal Education, 1994
A survey of 120 law schools investigated grading practices for different course types, use of different kinds of grading curves to standardize assessment at several instructional levels, and recent experience with institutional changes in grading systems. (MSE)
Descriptors: Academic Standards, Change Strategies, Difficulty Level, Educational Change
Peer reviewed Peer reviewed
Direct linkDirect link
Xu, Yuejin; Iran-Nejad, Asghar; Thoma, Stephen J. – Journal of Interactive Online Learning, 2007
The purpose of the study was to determine comparability of an online version to the original paper-pencil version of Defining Issues Test 2 (DIT2). This study employed methods from both Classical Test Theory (CTT) and Item Response Theory (IRT). Findings from CTT analyses supported the reliability and discriminant validity of both versions.…
Descriptors: Computer Assisted Testing, Test Format, Comparative Analysis, Test Theory
Peer reviewed Peer reviewed
Direct linkDirect link
Sgoutas-Emch, Sandra A.; Nagel, Erik; Flynn, Scott – Journal of Instructional Psychology, 2007
Undergraduate students routinely rated science-related courses such as biopsychology as intimidating and very difficult. Identification of factors that may contribute to success in these types of courses is important in order to help increase performance and interest in these topics. To examine what variables are related to performance, we studied…
Descriptors: Identification (Psychology), Undergraduate Students, Test Anxiety, Grade Point Average
Livingston, Samuel A. – 1986
This paper deals with test fairness regarding a test consisting of two parts: (1) a "common" section, taken by all students; and (2) a "variable" section, in which some students may answer a different set of questions from other students. For example, a test taken by several thousand students each year contains a common multiple-choice portion and…
Descriptors: Difficulty Level, Error of Measurement, Essay Tests, Mathematical Models
Lancaster, Diana M.; And Others – 1987
Difficulty and discrimination ability were compared between multiple choice and short answer items in midterm and final examinations for the internal medicine course at Louisiana State University School of Dentistry. The examinations were administered to 67 sophomore dental students in that course. Additionally, the impact of the source of the…
Descriptors: Dental Schools, Dentistry, Difficulty Level, Discriminant Analysis
Ackerman, Phillip L.; And Others – 1982
Statistical methods employed to test individual differences in dual-task performance and the existence of a general time-sharing ability are reviewed and critiqued. Specifically, both the types of data being collected and the types of data analysis procedures have been inadequate to the critical evaluation of a hypothetical…
Descriptors: Attention Control, Cognitive Processes, Difficulty Level, Factor Analysis
Scheetz, James P.; Forsyth, Robert A. – 1977
Empirical evidence is presented related to the effects of using a stratified sampling of items in multiple matrix sampling on the accuracy of estimates of the population mean. Data were obtained from a sample of 600 high school students for a 36-item mathematics test and a 40-item vocabulary test, both subtests of the Iowa Tests of Educational…
Descriptors: Achievement Tests, Difficulty Level, Item Analysis, Item Sampling
Peer reviewed Peer reviewed
Direct linkDirect link
Oxford, Rebecca; Cho, Yunkyoung; Leung, Santoi; Kim, Hae-Jin – International Review of Applied Linguistics in Language Teaching (IRAL), 2004
Assessing use of language learning strategies has become commonplace around the world. One strategy assessment tool is the questionnaire, which usually asks students to report on their typical, general use of language learning strategies. Because of this general focus, most questionnaires do not require respondents to complete an actual language…
Descriptors: Learning Strategies, Second Language Learning, Task Analysis, Evaluation Methods
Peer reviewed Peer reviewed
Direct linkDirect link
Al-A'ali, Mansoor – Educational Technology & Society, 2007
Computer adaptive testing is the study of scoring tests and questions based on assumptions concerning the mathematical relationship between examinees' ability and the examinees' responses. Adaptive student tests, which are based on item response theory (IRT), have many advantages over conventional tests. We use the least square method, a…
Descriptors: Educational Testing, Higher Education, Elementary Secondary Education, Student Evaluation
Cope, Ronald T. – 1995
This paper deals with the problems that arise in performance assessment from the granularity that results from having a small number of tasks or prompts and raters of responses to these tasks or prompts. Two problems are discussed in detail: (1) achieving a satisfactory degree of reliability; and (2) equating or adjusting for differences of…
Descriptors: Difficulty Level, Educational Assessment, Equated Scores, High Stakes Tests
de Jong, John H. A. L. – 1984
The Netherlands' secondary education system is highly differentiated, with four different school types for four scholastic ability levels. Final examinations must accommodate these four levels, and require a test-independent definition of the intended final ability levels as well as a sample-free evaluation of the range of ability levels at which…
Descriptors: Difficulty Level, Efficiency, Equated Scores, Foreign Countries
Pages: 1  |  ...  |  39  |  40  |  41  |  42  |  43  |  44  |  45  |  46  |  47  |  48  |  49