Publication Date
| In 2026 | 0 |
| Since 2025 | 0 |
| Since 2022 (last 5 years) | 0 |
| Since 2017 (last 10 years) | 1 |
| Since 2007 (last 20 years) | 1 |
Descriptor
Author
| Frary, Robert B. | 2 |
| Kelderman, Henk | 2 |
| Aiken, Lewis R. | 1 |
| Algina, James | 1 |
| Carifio, James | 1 |
| Cramer, Stephen E. | 1 |
| Gardner, Eric | 1 |
| Harwell, Michael | 1 |
| Hutchinson, T.P. | 1 |
| Kohr, Richard L. | 1 |
| Legg, Sue M. | 1 |
| More ▼ | |
Publication Type
| Reports - Evaluative | 17 |
| Speeches/Meeting Papers | 9 |
| Journal Articles | 4 |
| Reports - Research | 2 |
| ERIC Digests in Full Text | 1 |
| ERIC Publications | 1 |
Education Level
Audience
| Researchers | 1 |
Location
| Netherlands | 4 |
Laws, Policies, & Programs
Assessments and Surveys
| Pennsylvania Educational… | 1 |
What Works Clearinghouse Rating
Harwell, Michael – Journal of Experimental Education, 2019
Measures of socioeconomic status (SES) are widely used in educational research and policy applications in no small part because of a deeply rooted belief of the importance of SES. This paper argues that the usefulness of common SES measures can be undermined by (a) an atheoretical approach to conceptualizing SES and selecting measures, which…
Descriptors: Socioeconomic Status, Measures (Individuals), Testing Problems, Educational Research
Wollack, James A. – Applied Measurement in Education, 2006
Many of the currently available statistical indexes to detect answer copying lack sufficient power at small [alpha] levels or when the amount of copying is relatively small. Furthermore, there is no one index that is uniformly best. Depending on the type or amount of copying, certain indexes are better than others. The purpose of this article was…
Descriptors: Statistical Analysis, Item Analysis, Test Length, Sample Size
Gardner, Eric – 1989
Five of the common misuses of tests are reviewed: (1) acceptance of the test title as an accurate and complete description of the variable being measured (failure to examine the manual and the items carefully to know the specific aspects to be tested can result in misuse through selection of an inappropriate test for a particular purpose or…
Descriptors: Error of Measurement, Evaluation Problems, Examiners, Scoring
Nandakumar, Ratna – 1989
The theoretical differences between the traditional definition of dimensionality and the more recently defined notion of essential dimensionality are presented. Monte Carlo simulations are used to demonstrate the utility of W. F. Stout's procedure to assess the essential unidimensionality of the latent space underlying a set of terms. The…
Descriptors: Definitions, Educational Assessment, Latent Trait Theory, Mathematical Models
Peer reviewedAiken, Lewis R. – Journal of Experimental Education, 1988
Several statistical rational and empirical procedures are presented for dealing with the problems of non-response or low return rates in surveys, namely mail surveys. Advantages and shortcomings of these procedures in educational research are discussed. (SLD)
Descriptors: Data Collection, Educational Research, Mail Surveys, Response Style (Tests)
Carifio, James; And Others – 1990
Possible bias due to sampling problems or low response rates has been a troubling "nuisance" variable in empirical research since seminal and classical studies were done on these problems at the beginning of this century. Recent research suggests that: (1) earlier views of the alleged bias problem were misleading; (2) under a variety of fairly…
Descriptors: Data Collection, Evaluation Methods, Research Problems, Response Rates (Questionnaires)
Willingness to Answer Multiple-Choice Questions as Manifested Both in Genuine and in Nonsense Items.
Peer reviewedFrary, Robert B.; Hutchinson, T.P. – Educational and Psychological Measurement, 1982
Alternate versions of Hutchinson's theory were compared, and one which implies the existence of partial knowledge was found to be better than one which implies that an appropriate measure of ability is obtained by applying the conventional correction for guessing. (Author/PN)
Descriptors: Guessing (Tests), Latent Trait Theory, Multiple Choice Tests, Scoring Formulas
Wilcox, Rand R. – 1978
Two fundamental problems in mental test theory are to estimate true score and to estimate the amount of error when testing an examinee. In this report, three probability models which characterize a single test item in terms of a population of examinees are described. How these models may be modified to characterize a single examinee in terms of an…
Descriptors: Achievement Tests, Comparative Analysis, Error of Measurement, Mathematical Models
Frary, Robert B. – 1992
Practical and effective methods for detecting copying of multiple-choice test responses have been available for many years. These methods have been used routinely by large admissions and licensing testing programs. However, these methods are seldom applied in the areas of standardized or classroom testing in schools or colleges, and knowledge…
Descriptors: Cheating, College Entrance Examinations, Ethics, Evaluation Methods
Kelderman, Henk – 1986
A method is proposed for the detection of item bias with respect to observed or unobserved subgroups. The method uses quasi-loglinear models for the incomplete subgroup x test score x item 1 x ... x item k contingency table. If the subgroup membership is unknown, the models are the incomplete-latent-class models of S. J. Haberman (1979). The…
Descriptors: Foreign Countries, Higher Education, Latent Trait Theory, Mathematical Models
Kelderman, Henk; Macready, George B. – 1988
The use of loglinear latent class models to detect item bias was studied. Purposes of the study were to: (1) develop procedures for use in assessing item bias when the grouping variable with respect to which bias occurs is not observed; (2) develop bias detection procedures that relate to a conceptually different assessed trait--a categorical…
Descriptors: Foreign Countries, Higher Education, Latent Trait Theory, Mathematical Models
Theunissen, Phiel J. J. M. – 1983
Any systematic approach to the assessment of students' ability implies the use of a model. The more explicit the model is, the more its users know about what they are doing and what the consequences are. The Rasch model is a strong model where measurement is a bonus of the model itself. It is based on four ideas: (1) separation of observable…
Descriptors: Ability Grouping, Difficulty Level, Evaluation Criteria, Item Sampling
PDF pending restorationKohr, Richard L. – 1982
A statistical analysis was conducted to examine changes across time in some school districts of Pennsylvania. Comparisons were made for grades 5, 8, and 11 during two assessment periods. The Educational Quality Assessment data provided the basis for analysis. Correlation coefficients for school mean goal scores are displayed. Changes at the three…
Descriptors: Academic Achievement, Educational Assessment, Educational Trends, Elementary Secondary Education
Cramer, Stephen E. – 1990
A standard-setting procedure was developed for the Georgia Teacher Certification Testing Program as tests in 30 teaching fields were revised. A list of important characteristics of a standard-setting procedure was derived, drawing on the work of R. A. Berk (1986). The best method was found to be a highly formalized judgmental, empirical Angoff…
Descriptors: Computer Assisted Testing, Cutting Scores, Data Collection, Elementary Secondary Education
Samson, Digna M. M. – 1983
The traditional multiple-choice reading comprehension test of English as a second language, used in the Dutch school-leaving examinations, has been criticized for its apparent lack of construct validity. The Dutch National Institute for Educational Measurement has conducted a number of studies to determine whether there is a different skill…
Descriptors: English (Second Language), Foreign Countries, Language Tests, Multiple Choice Tests
Previous Page | Next Page ยป
Pages: 1 | 2
Direct link
