Publication Date
| In 2026 | 0 |
| Since 2025 | 52 |
| Since 2022 (last 5 years) | 194 |
| Since 2017 (last 10 years) | 494 |
| Since 2007 (last 20 years) | 742 |
Descriptor
| Test Items | 1186 |
| Test Reliability | 1186 |
| Test Validity | 684 |
| Test Construction | 565 |
| Foreign Countries | 348 |
| Difficulty Level | 279 |
| Item Analysis | 252 |
| Psychometrics | 233 |
| Item Response Theory | 219 |
| Factor Analysis | 183 |
| Multiple Choice Tests | 172 |
| More ▼ | |
Source
Author
| Schoen, Robert C. | 12 |
| LaVenia, Mark | 5 |
| Liu, Ou Lydia | 5 |
| Anderson, Daniel | 4 |
| Bauduin, Charity | 4 |
| DiLuzio, Geneva J. | 4 |
| Farina, Kristy | 4 |
| Haladyna, Thomas M. | 4 |
| Huck, Schuyler W. | 4 |
| Petscher, Yaacov | 4 |
| Stansfield, Charles W. | 4 |
| More ▼ | |
Publication Type
Education Level
Audience
| Practitioners | 39 |
| Researchers | 30 |
| Teachers | 24 |
| Administrators | 13 |
| Support Staff | 3 |
| Counselors | 2 |
| Students | 2 |
| Community | 1 |
| Parents | 1 |
| Policymakers | 1 |
Location
| Turkey | 68 |
| Indonesia | 37 |
| Germany | 20 |
| Canada | 17 |
| Florida | 17 |
| China | 16 |
| Australia | 15 |
| California | 12 |
| Iran | 11 |
| India | 10 |
| New York | 9 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Meets WWC Standards without Reservations | 1 |
| Meets WWC Standards with or without Reservations | 1 |
McKinley, Robert L.; Reckase, Mark D. – 1981
A study was conducted to compare tailored testing procedures based on a Bayesian ability estimation technique and on a maximum likelihood ability estimation technique. The Bayesian tailored testing procedure selected items so as to minimize the posterior variance of the ability estimate distribution, while the maximum likelihood tailored testing…
Descriptors: Academic Ability, Adaptive Testing, Bayesian Statistics, Comparative Analysis
Schau, Candace Garrett; Kahn, Lynne – 1978
A 21-item group-administered rating scale was designed to measure elementary school students' sex-stereotyped attitudes about adult occupations. The students responded to items by designating who could, as well as who should fill certain occupations. By choosing from five responses (only women, more women than men, about the same number of women…
Descriptors: Attitude Measures, Career Choice, Elementary Education, Elementary School Students
Campbell, Noma Jo; Grissom, Stephen – 1979
To investigate the effects of wording in attitude test items, a five-point Likert-type rating scale was administered to 173 undergraduate education majors. The test measured attitudes toward college and self, and contained 38 positively-worded items. Thirty-eight negatively-worded items were also written to parallel the positive statements.…
Descriptors: Affective Measures, Attitude Measures, Higher Education, Rating Scales
A Comparison of Three Types of Test Development Procedures Using Classical and Latent Trait Methods.
Benson, Jeri; Wilson, Michael – 1979
Three methods of item selection were used to select sets of 38 items from a 50-item verbal analogies test and the resulting item sets were compared for internal consistency, standard errors of measurement, item difficulty, biserial item-test correlations, and relative efficiency. Three groups of 1,500 cases each were used for item selection. First…
Descriptors: Comparative Analysis, Difficulty Level, Efficiency, Error of Measurement
Merz, William R.; Grossen, Neal E. – 1978
Six approaches to assessing test item bias were examined: transformed item difficulty, point biserial correlations, chi-square, factor analysis, one parameter item characteristic curve, and three parameter item characteristic curve. Data sets for analysis were generated by a Monte Carlo technique based on the three parameter model; thus, four…
Descriptors: Difficulty Level, Evaluation Methods, Factor Analysis, Item Analysis
Sullivan, Arthur P. – 1978
Sullivan's Ethical Reasoning Scale contains three dilemmas with response pairs representing Kohlberg's stages of moral development. In Kohlberg's first three stages, goodness is equated with lack of punishment, usefulness, and approval, respectively. Good is seen as conformity to rule and ruler in stage four, and stage five comprises…
Descriptors: Adolescents, Adults, Attitude Measures, Conflict Resolution
Peer reviewedBudescu, David V. – Applied Psychological Measurement, 1988
A multiple matching test--a 24-item Hebrew vocabulary test--was examined, in which distractors from several items are pooled into one list at the test's end. Construction of such tests was feasible. Reliability, validity, and reduction of random guessing were satisfactory when applied to data from 717 applicants to Israeli universities. (SLD)
Descriptors: College Applicants, Feasibility Studies, Foreign Countries, Guessing (Tests)
Peer reviewedZeidner, Moshe – Higher Education, 1986
A study of possible test bias in the Arabic and Hebrew versions of a standardized scholastic aptitude test used in Israel found a slight overprediction of performance for Arabs, but the findings appear to be more consistent with psychometric than cultural bias. (Author/MSE)
Descriptors: Aptitude Tests, Arabic, Arabs, College Bound Students
Cochran, H. Keith – 1997
The purpose of this study was to develop a psychometrically sound instrument to measure teachers' attitudes toward students with special needs, the Scale of Teacher's Attitudes Toward Inclusion (STATIC). Approximately 1,440 inservice teachers were asked to complete the STATIC. There were 516 respondents from 5 school districts in Alabama. Various…
Descriptors: Attitude Measures, Attitudes toward Disabilities, Disabilities, Elementary Secondary Education
Peer reviewedTrevisan, Michael S.; And Others – Educational and Psychological Measurement, 1991
The reliability and validity of multiple-choice tests were computed as a function of the number of options per item and student ability for 435 parochial high school juniors, who were administered the Washington Pre-College Test Battery. Results suggest the efficacy of the three-option item. (SLD)
Descriptors: Ability, Comparative Testing, Distractors (Tests), Grade Point Average
Wheldall, Kevin; Madelaine, Alison – Australasian Journal of Special Education, 2006
The aim of this study was to develop a means of tracking the reading performance of low-progress readers on a weekly basis, so as to inform instructional decision-making. A representative sample of 261 primary school children from Years 1 to 5 were tested on 21 different text passages taken from a developing passage reading test, the Wheldall…
Descriptors: Reading Tests, Test Reliability, Test Validity, Difficulty Level
PDF pending restorationBliss, Leonard B.; And Others – 1996
The Inventario de Comportamiento de Estudio (ICE), a Spanish translation of the Study Behavior Inventory (SBI) was developed and tested using a group of 594 undergraduate students from randomly selected classes at a private comprehensive university in Mexico. Both instruments were designed to assess the study behaviors of students in institutions…
Descriptors: Behavior Patterns, Bilingual Education, Factor Analysis, Factor Structure
Lett, Nancy J.; Kamphaus, Randy W. – 1992
Results of the Behavior Assessment System for Children (BASC) Student Observation Scale (SOS), a measure of classroom behavior, were correlated with results of the BASC Teacher Rating Scale (TRS). Two classroom observations were made of each of 30 students (21 males and 9 females) aged 5 to 11 years. Teachers of those students completed the TRS.…
Descriptors: Children, Classroom Observation Techniques, Classroom Research, Comparative Testing
Salies, Tania Gastao – 1998
A discussion of the evaluation of writing, particularly in English as a Second Language, argues for a communicative approach reflecting the current approach to language teaching and learning. The movement toward more communication-oriented and more valid language testing is examined briefly, and direct assessment is chosen as the preferred format…
Descriptors: Communicative Competence (Languages), English (Second Language), Evaluation Criteria, Foreign Countries
Stansfield, Charles W.; And Others – 1992
The development of the Polish Proficiency Test, a standardized, nationally-normed test of listening and reading comprehension for English-speaking learners of Polish, is reported. An introductory chapter provides background information about the test's development, including discussion of the relationship between the test and Polish language…
Descriptors: Language Proficiency, Language Tests, Listening Comprehension, Polish

Direct link
