Publication Date
| In 2026 | 0 |
| Since 2025 | 200 |
| Since 2022 (last 5 years) | 1070 |
| Since 2017 (last 10 years) | 2580 |
| Since 2007 (last 20 years) | 4941 |
Descriptor
Source
Author
Publication Type
Education Level
Audience
| Practitioners | 653 |
| Teachers | 563 |
| Researchers | 250 |
| Students | 201 |
| Administrators | 81 |
| Policymakers | 22 |
| Parents | 17 |
| Counselors | 8 |
| Community | 7 |
| Support Staff | 3 |
| Media Staff | 1 |
| More ▼ | |
Location
| Turkey | 225 |
| Canada | 223 |
| Australia | 155 |
| Germany | 116 |
| United States | 99 |
| China | 90 |
| Florida | 86 |
| Indonesia | 82 |
| Taiwan | 78 |
| United Kingdom | 73 |
| California | 65 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Meets WWC Standards without Reservations | 4 |
| Meets WWC Standards with or without Reservations | 4 |
| Does not meet standards | 1 |
Badgett, John L.; Christmann, Edwin P. – Corwin, 2009
While today's curriculum is largely driven by standards, many teachers find the lack of specificity in the standards to be confounding and even intimidating. Now this practical book provides middle and high school teachers with explicit guidance on designing specific objectives and developing appropriate formative and summative assessments to…
Descriptors: Test Items, Student Evaluation, Knowledge Level, National Standards
Abedi, Jamal – Journal of Applied Testing Technology, 2009
English language learners with disabilities (ELLWD) face many challenges in their academic career. Learning a new language and coping with their disabilities create obstacles in their academic progress. Variables relegating accessibility of assessments for students with disabilities and ELL students may seriously hinder the academic performance of…
Descriptors: Reading Achievement, Second Language Learning, Disabilities, Classification
Emmerich, Walter; And Others – 1991
The aim of this research was to identify, develop, and evaluate empirically new reasoning item types that might be used to broaden the analytical measure of the Graduate Record Examinations (GRE) General Test and to strengthen its construct validity. Six item types were selected for empirical evaluation, including the two currently used in the GRE…
Descriptors: Construct Validity, Correlation, Evaluation Methods, Sex Differences
Kaplan, Randy M.; Bennett, Randy Elliot – 1994
This study explores the potential for using a computer-based scoring procedure for the formulating-hypotheses (F-H) item. This item type presents a situation and asks the examinee to generate explanations for it. Each explanation is judged right or wrong, and the number of creditable explanations is summed to produce an item score. Scores were…
Descriptors: Automation, Computer Assisted Testing, Correlation, Higher Education
Stocking, Martha L.; And Others – 1991
This paper presents a new heuristic approach to interactive test assembly that is called the successive item replacement algorithm. This approach builds on the work of W. J. van der Linden (1987) and W. J. van der Linden and E. Boekkooi-Timminga (1989) in which methods of mathematical optimization are combined with item response theory to…
Descriptors: Algorithms, Automation, Computer Selection, Heuristics
D'Costa, Ayres – 1993
The Sato Caution Index takes into account the number and difficulty of items gotten wrong by a student within his or her ability, as well as the number and difficulty of items gotten right beyond his or her ability. Sato subtracts the two components to define a single Caution Index. In this study, the components are kept separate, defining a…
Descriptors: Ability, College Students, Error Patterns, Factor Analysis
Sheehan, Kathleen M.; Mislevy, Robert J. – 1988
In many practical applications of item response theory, the parameters of overlapping subsets of test items are estimated from different samples of examinees. A linking procedure is then employed to place the resulting item parameter estimates onto a common scale. It is standard practice to ignore the uncertainty associated with the linking step…
Descriptors: Error of Measurement, Estimation (Mathematics), Item Response Theory, Measurement Techniques
Kehoe, Jerard – 1995
This digest presents a list of recommendations for writing multiple-choice test items, based on psychometrics statistics are typically provided by a measurement, or test scoring, service, where tests are machine-scored or by testing software packages. Test makers can capitalize on the fact that "bad" items can be differentiated from…
Descriptors: Item Analysis, Item Banks, Measurement Techniques, Multiple Choice Tests
Tang, K. Linda – 1996
The average Kullback-Keibler (K-L) information index (H. Chang and Z. Ying, in press) is a newly proposed statistic in Computerized Adaptive Testing (CAT) item selection based on the global information function. The objectives of this study were to improve understanding of the K-L index with various parameters and to compare the performance of the…
Descriptors: Ability, Adaptive Testing, Comparative Analysis, Computer Assisted Testing
Stocking, Martha L. – 1988
The relationship between examinee ability and the accuracy of maximum likelihood item parameter estimation is explored in terms of the expected (Fisher) information. Information functions are used to find the optimum ability levels and maximum contributions to information for estimating item parameters in three commonly used logistic item response…
Descriptors: Ability, Adaptive Testing, Estimation (Mathematics), Item Response Theory
Lyu, C. Felicia; And Others – 1995
A smoothed version of standardization, which merges kernel smoothing with the traditional standardization differential item functioning (DIF) approach, was used to examine DIF for student-produced response (SPR) items on the Scholastic Assessment Test (SAT) I mathematics test at both the item and testlet levels. This nonparametric technique avoids…
Descriptors: Aptitude Tests, Item Bias, Mathematics Tests, Multiple Choice Tests
Wingersky, Marilyn S. – 1989
In a variable-length adaptive test with a stopping rule that relied on the asymptotic standard error of measurement of the examinee's estimated true score, M. S. Stocking (1987) discovered that it was sufficient to know the examinee's true score and the number of items administered to predict with some accuracy whether an examinee's true score was…
Descriptors: Adaptive Testing, Bayesian Statistics, Error of Measurement, Estimation (Mathematics)
Kim, Seock-Ho – 1997
Hierarchical Bayes procedures for the two-parameter logistic item response model were compared for estimating item parameters. Simulated data sets were analyzed using two different Bayes estimation procedures, the two-stage hierarchical Bayes estimation (HB2) and the marginal Bayesian with known hyperparameters (MB), and marginal maximum…
Descriptors: Bayesian Statistics, Difficulty Level, Estimation (Mathematics), Item Bias
Ingebo, George S. – 1993
Item response theory (IRT) is based on the assumption that a direct relationship exists between an examinee's total performance on a set of items and the difficulty of each item on the test. The Rasch model represents this relationship mathematically on an equal interval scale. This paper argues that IRT, under the required conditions, provides…
Descriptors: Achievement Tests, Difficulty Level, Item Banks, Item Response Theory
Flowers, Claudia P.; And Others – 1997
An item response theory-based parametric procedure proposed by N. S. Raju, W. J. van der Linden, and P. F. Fleer (1995) known as differential functioning of items and tests (DFIT) can be used with unidimensional and multidimensional data with dichotomous or polytomous scoring. This study describes the polytomous DFIT framework and evaluates and…
Descriptors: Chi Square, Computer Simulation, Item Bias, Item Response Theory

Direct link
Peer reviewed
