Publication Date
| In 2026 | 2 |
| Since 2025 | 97 |
| Since 2022 (last 5 years) | 471 |
| Since 2017 (last 10 years) | 1288 |
| Since 2007 (last 20 years) | 3642 |
Descriptor
| Testing | 12711 |
| Higher Education | 1800 |
| Foreign Countries | 1651 |
| Elementary Secondary Education | 1513 |
| Language Tests | 1438 |
| Academic Achievement | 1413 |
| Student Evaluation | 1413 |
| Test Construction | 1274 |
| Evaluation Methods | 1257 |
| Second Language Learning | 1172 |
| Test Validity | 1090 |
| More ▼ | |
Source
Author
Publication Type
Education Level
Audience
| Practitioners | 853 |
| Teachers | 528 |
| Researchers | 193 |
| Administrators | 185 |
| Policymakers | 172 |
| Students | 57 |
| Parents | 50 |
| Counselors | 36 |
| Community | 20 |
| Media Staff | 7 |
| Support Staff | 4 |
| More ▼ | |
Location
| Canada | 192 |
| Australia | 174 |
| United States | 149 |
| United Kingdom (England) | 134 |
| California | 132 |
| New York | 122 |
| United Kingdom | 119 |
| Texas | 94 |
| China | 85 |
| Florida | 84 |
| United Kingdom (Great Britain) | 79 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Does not meet standards | 2 |
Clapper, John P. – Journal of Experimental Psychology: Learning, Memory, and Cognition, 2012
This article describes 5 experiments investigating the role of prior knowledge in incidental category learning. Experiments 1 to 3 showed that prior knowledge improved learning only if the categories in a given set were related to contrasting themes; there was no consistent knowledge effect if the categories were related to the same theme.…
Descriptors: Memory, Testing, Prior Learning, Role
Mattern, Krista D.; Kobrin, Jennifer L.; Camara, Wayne J. – Measurement: Interdisciplinary Research and Perspectives, 2012
As researchers at a testing organization concerned with the appropriate uses and validity evidence for our assessments, we provide an applied perspective related to the issues raised in the focus article. Newton's proposal for elaborating the consensus definition of validity is offered with the intention to reduce the risks of inadequate…
Descriptors: Evidence, Validity, Tests, Testing
Brewer, Gene A.; Unsworth, Nash – Journal of Memory and Language, 2012
The current study examined individual differences in the effects of retrieval from long-term memory (i.e., the testing effect). The effects of retrieving from memory make tested information more accessible for future retrieval attempts. Despite the broad applied ramifications of such a potent memorization technique there is a paucity of research…
Descriptors: Individual Differences, Long Term Memory, Testing, Attention Control
Robelen, Erik W. – Education Week, 2012
At a time when U.S. political and business leaders are raising concerns about the need to better nurture creativity and innovative thinking among young people, several states are exploring the development of an index that would gauge the extent to which schools provide opportunities to foster those qualities. In Massachusetts, a new state…
Descriptors: Creativity, Measures (Individuals), Public Schools, Testing
Pepper, Mark – Mathematics Teaching, 2012
Assessment of performance, when national standards are required, has to be sustained by confidence. Confidence in the learner that they are engaged in learning that will provide them with a qualification that is worthwhile, and the confidence of the end user that the assessment is rigorous, equitable institution to institution, and accurate.…
Descriptors: National Standards, Mathematics, Mathematics Instruction, Accreditation (Institutions)
Eastwell, Peter – School Science Review, 2012
This paper defines the terms "hypothesis," "prediction," and "conclusion" and shows how to use the terms correctly in scientific investigations in both the school and science education research contexts. The scientific method, or hypothetico-deductive (HD) approach, is described and it is argued that an understanding of the scientific method,…
Descriptors: Prediction, Science Education, Educational Research, Scientific Methodology
Guo, Hongwen – Psychometrika, 2010
After many equatings have been conducted in a testing program, equating errors can accumulate to a degree that is not negligible compared to the standard error of measurement. In this paper, the author investigates the asymptotic accumulative standard error of equating (ASEE) for linear equating methods, including chained linear, Tucker, and…
Descriptors: Testing Programs, Testing, Error of Measurement, Equated Scores
Kuiper, Rebecca M.; Hoijtink, Herbert – Psychological Methods, 2010
This article discusses comparisons of means using exploratory and confirmatory approaches. Three methods are discussed: hypothesis testing, model selection based on information criteria, and Bayesian model selection. Throughout the article, an example is used to illustrate and evaluate the two approaches and the three methods. We demonstrate that…
Descriptors: Models, Testing, Hypothesis Testing, Probability
Andrich, David; Styles, Irene – Journal of Applied Measurement, 2011
There is a substantial literature on attempts to obtain information on the proficiency of respondents from distractors in multiple choice items. Information in a distractor implies that a person who chooses that distractor has greater proficiency than if the person chose another distractor with no information. A further implication is that the…
Descriptors: Multiple Choice Tests, Testing, Item Response Theory
Kane, Michael – Journal of Educational Measurement, 2011
Errors don't exist in our data, but they serve a vital function. Reality is complicated, but our models need to be simple in order to be manageable. We assume that attributes are invariant over some conditions of observation, and once we do that we need some way of accounting for the variability in observed scores over these conditions of…
Descriptors: Error of Measurement, Scores, Test Interpretation, Testing
Haelermans, Carla; Ghysels, Joris; Prince, Fernao – British Journal of Educational Technology, 2015
This paper describes a dataset with data from three individually randomized educational technology experiments on differentiation, formative testing and feedback during one school year for a group of 8th grade students in the Netherlands, using administrative data and the online motivation questionnaire of Boekaerts. The dataset consists of pre-…
Descriptors: Foreign Countries, Educational Technology, Middle School Students, Grade 8
Higgins, Jennifer; Patterson, Margaret Becker; Bozman, Martha; Katz, Michael – Journal of Technology, Learning, and Assessment, 2010
This study examined the feasibility of administering GED Tests using a computer based testing system with embedded accessibility tools and the impact on test scores and test-taker experience when GED Tests are transitioned from paper to computer. Nineteen test centers across five states successfully installed the computer based testing program,…
Descriptors: Testing Programs, Testing, Computer Uses in Education, Mathematics Tests
Phelps, Amy L.; Spangler, William E. – American Journal of Business Education, 2013
A standardized exam for program-level assessment can take the form of 1) a customized exam developed in-house by faculty and linked explicitly to program-level learning goals; or 2) a standardized exam developed externally by assessment experts and linked to a set of somewhat broader and more generalizable learning goals. This article discusses…
Descriptors: College Outcomes Assessment, Standardized Tests, Business Administration Education, Undergraduate Students
Deacon, S. Helene; Leung, Dilys – Applied Psycholinguistics, 2013
This study tested the diverging predictions of recent theories of children's learning of spelling regularities. We asked younger (Grades 1 and 2) and older (Grades 3 and 4) elementary school-aged children to choose the correct endings for words that varied in their morphological structure. We tested the impacts of semantic frequency by…
Descriptors: Spelling, Semantics, Psycholinguistics, Prediction
Cheeseman, Jill; McDonough, Andrea – International Journal for Mathematics Teaching and Learning, 2013
This article reports an innovative use of photographs in a pencil-and-paper test which was developed to assess young children's understanding of mass measurement. Two hundred and ninety-five tests were administered by thirteen teachers of Years 1 and 2 children in 3 urban and rural schools. Many of these children of 6-8 years of age were able to…
Descriptors: Performance Based Assessment, Young Children, Measurement, Teaching Methods

Peer reviewed
Direct link
