Publication Date
In 2025 | 42 |
Since 2024 | 165 |
Since 2021 (last 5 years) | 588 |
Since 2016 (last 10 years) | 1225 |
Since 2006 (last 20 years) | 2731 |
Descriptor
Source
Author
Publication Type
Education Level
Audience
Researchers | 169 |
Practitioners | 49 |
Teachers | 32 |
Administrators | 8 |
Policymakers | 8 |
Counselors | 4 |
Students | 4 |
Media Staff | 1 |
Location
Turkey | 172 |
Australia | 81 |
Canada | 79 |
China | 70 |
United States | 55 |
Germany | 43 |
Taiwan | 43 |
Japan | 40 |
United Kingdom | 38 |
Iran | 36 |
Spain | 33 |
More ▼ |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
Meets WWC Standards without Reservations | 1 |
Meets WWC Standards with or without Reservations | 1 |
Does not meet standards | 1 |
Fowell, S. L.; Fewtrell, R.; McLaughlin, P. J. – Advances in Health Sciences Education, 2008
Absolute standard setting procedures are recommended for assessment in medical education. Absolute, test-centred standard setting procedures were introduced for written assessments in the Liverpool MBChB in 2001. The modified Angoff and Ebel methods have been used for short answer question-based and extended matching question-based papers,…
Descriptors: Medical Education, Standard Setting (Scoring), Judges, Interrater Reliability
Yi, Qing; Zhang, Jinming; Chang, Hua-Hua – Applied Psychological Measurement, 2008
Criteria had been proposed for assessing the severity of possible test security violations for computerized tests with high-stakes outcomes. However, these criteria resulted from theoretical derivations that assumed uniformly randomized item selection. This study investigated potential damage caused by organized item theft in computerized adaptive…
Descriptors: Test Items, Simulation, Item Analysis, Safety
Carle, Adam C. – Hispanic Journal of Behavioral Sciences, 2008
Confirmatory factor analyses for ordered-categorical measures probed for differential item functioning on a standardized measure of alcohol dependence across Hispanics (n = 834) and non-Hispanic Caucasians (n = 14,001) in a nationally representative survey of alcohol use in the United States conducted in 1992. Analyses investigated whether 30…
Descriptors: Test Bias, Validity, Drinking, Factor Analysis
Vock, Miriam; Holling, Heinz – Intelligence, 2008
The objective of this study is to explore the potential for developing IRT-based working memory scales for assessing specific working memory components in children (8-13 years). These working memory scales should measure cognitive abilities reliably in the upper range of ability distribution as well as in the normal range, and provide a…
Descriptors: Test Items, Academic Achievement, Factor Structure, Factor Analysis
McIntosh, Kent; Brown, Jacqueline A.; Borgmeier, Christopher J. – Assessment for Effective Intervention, 2008
This article discusses the evidence for intervention validity of Functional Behavior Assessment (FBA) in designing support for students with intensive behavioral needs. Since its inclusion into the Individuals With Disabilities Education Act nearly a decade ago, FBA has been the subject of significant research investigating its use and…
Descriptors: Intervention, Functional Behavioral Assessment, Item Analysis, Program Validation
Vigneau, Francois; Bors, Douglas A. – Intelligence, 2008
Various taxonomies of Raven's Advanced Progressive Matrices (APM) items have been proposed in the literature to account for performance on the test. In the present article, three such taxonomies based on information processing, namely Carpenter, Just and Shell's [Carpenter, P.A., Just, M.A., & Shell, P., (1990). What one intelligence test…
Descriptors: Intelligence, Intelligence Tests, Factor Analysis, Classification
Carretero-Dios, Hugo; Macarena, De los Santos-Roig; Buela-Casal, Gualberto – Learning and Individual Differences, 2008
This study is an item analysis of the Matching Familiar Figures Test-20. We examined error scores in the Matching Familiar Figures Test-20 to determine the influence of the difficulty of the test on the assessment of reflection-impulsivity. The sample included 700 participants aged between 6 and 12 years. The results obtained from the corrected…
Descriptors: Conceptual Tempo, Individual Differences, Item Analysis, Children
Meath, Sian E.; Aye, Lu; Haritos, Nicholas – Bulletin of Science, Technology & Society, 2008
This article focuses on the accuracy of satellite data, which may then be used in wave power applications. The satellite data are compared to data from wave buoys, which are currently considered to be the most accurate of the devices available for measuring wave characteristics. This article presents an analysis of satellite- (Topex/Poseidon) and…
Descriptors: Spectroscopy, Structural Analysis (Science), Satellites (Aerospace), Program Validation
El-Alfy, El-Sayed M.; Abdel-Aal, Radwan E. – Computers & Education, 2008
Recent advances in educational technologies and the wide-spread use of computers in schools have fueled innovations in test construction and analysis. As the measurement accuracy of a test depends on the quality of the items it includes, item selection procedures play a central role in this process. Mathematical programming and the item response…
Descriptors: Test Items, Item Analysis, Educational Technology, Test Construction
Heh, Peter – ProQuest LLC, 2009
The current study examined the validation and alignment of the PASA-Science by determining whether the alternate science assessment anchors linked to the regular education science anchors; whether the PASA-Science assessment items are science; whether the PASA-Science assessment items linked to the alternate science eligible content, and what…
Descriptors: Program Effectiveness, Special Education, Science Education, Science Tests
Setzer, J. Carl; He, Yi – GED Testing Service, 2009
Reliability Analysis for the Internationally Administered 2002 Series GED (General Educational Development) Tests Reliability refers to the consistency, or stability, of test scores when the authors administer the measurement procedure repeatedly to groups of examinees (American Educational Research Association [AERA], American Psychological…
Descriptors: Educational Research, Error of Measurement, Scores, Test Reliability
Costagliola, Gennaro; Fuccella, Vittorio – International Journal of Distance Education Technologies, 2009
To correctly evaluate learners' knowledge, it is important to administer tests composed of good quality question items. By the term "quality" we intend the potential of an item in effectively discriminating between skilled and untrained students and in obtaining tutor's desired difficulty level. This article presents a rule-based e-testing system…
Descriptors: Difficulty Level, Test Items, Computer Assisted Testing, Item Response Theory
Carr, W. David; Frey, Bruce B.; Swann, Elizabeth – Athletic Training Education Journal, 2009
Objective: To establish the validity and reliability of an online assessment instrument's items developed to track educational outcomes over time. Design and Setting: A descriptive study of the validation arguments and reliability testing of the assessment items. The instrument is available to graduating students enrolled in entry-level Athletic…
Descriptors: Athletics, Educational Objectives, Outcomes of Education, Validity
Kuntsche, Emmanuel; Kuntsche, Sandra – Journal of Clinical Child and Adolescent Psychology, 2009
A short form of the Drinking Motive Questionnaire Revised (DMQ-R; Cooper, 1994) was developed, using different item selection strategies based on a national representative sample of 5,617 12- to 18-year-old students in Switzerland. To confirm the concurrent validity of the short-form questionnaire, or DMQ-R SF, data from a second national sample…
Descriptors: Structural Equation Models, International Studies, Test Validity, Drinking
Ariel, Robert; Dunlosky, John; Bailey, Heather – Journal of Experimental Psychology: General, 2009
Theories of self-regulated study assume that learners monitor item difficulty when making decisions about which items to select for study. To complement such theories, the authors propose an agenda-based regulation (ABR) model in which learners' study decisions are guided by an agenda that learners develop to prioritize items for study, given…
Descriptors: Test Items, Time Management, Item Analysis, Rewards