Publication Date
| In 2026 | 0 |
| Since 2025 | 0 |
| Since 2022 (last 5 years) | 7 |
| Since 2017 (last 10 years) | 16 |
| Since 2007 (last 20 years) | 44 |
Descriptor
| Testing | 75 |
| Classification | 74 |
| Models | 13 |
| Concept Formation | 11 |
| Accuracy | 10 |
| Statistical Analysis | 10 |
| Test Construction | 10 |
| Comparative Analysis | 9 |
| Correlation | 9 |
| Foreign Countries | 9 |
| Scores | 9 |
| More ▼ | |
Source
Author
| Klausmeier, Herbert J. | 5 |
| Chen, Ping | 2 |
| Ding, Shuliang | 2 |
| Fields, Lanny | 2 |
| Gelman, Susan A. | 2 |
| Lee, Won-Chan | 2 |
| Mulligan, Neil W. | 2 |
| Peterson, Daniel J. | 2 |
| Smith, Edward L. | 2 |
| Song, Lihong | 2 |
| Wang, Wenyi | 2 |
| More ▼ | |
Publication Type
| Reports - Research | 75 |
| Journal Articles | 58 |
| Speeches/Meeting Papers | 5 |
| Information Analyses | 2 |
| Guides - Non-Classroom | 1 |
| Reports - General | 1 |
| Tests/Questionnaires | 1 |
Education Level
| Higher Education | 4 |
| Elementary Secondary Education | 3 |
| Postsecondary Education | 3 |
| Secondary Education | 3 |
| Elementary Education | 2 |
| Middle Schools | 2 |
| Early Childhood Education | 1 |
| Grade 3 | 1 |
| Grade 4 | 1 |
| Grade 6 | 1 |
| Grade 8 | 1 |
| More ▼ | |
Audience
| Researchers | 2 |
Location
| New York | 3 |
| Australia | 2 |
| California | 2 |
| Arizona | 1 |
| Brazil | 1 |
| Canada | 1 |
| China | 1 |
| Florida | 1 |
| France | 1 |
| Georgia | 1 |
| Iran | 1 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
Kang, Yewon; Ha, Hyorim; Lee, Hee Seung – Educational Psychology Review, 2023
Natural category learning is important in science education. One strategy that has been empirically supported for enhancing category learning is testing, which facilitates not only the learning of previously studied information (backward testing effect) but also the learning of newly studied information (forward testing effect). However, in…
Descriptors: Science Education, Science Tests, Testing, Classification
Amanda A. Wolkowitz; Russell Smith – Practical Assessment, Research & Evaluation, 2024
A decision consistency (DC) index is an estimate of the consistency of a classification decision on an exam. More specifically, DC estimates the percentage of examinees that would have the same classification decision on an exam if they were to retake the same or a parallel form of the exam again without memory of taking the exam the first time.…
Descriptors: Testing, Test Reliability, Replication (Evaluation), Decision Making
Park, Seohee; Kim, Kyung Yong; Lee, Won-Chan – Journal of Educational Measurement, 2023
Multiple measures, such as multiple content domains or multiple types of performance, are used in various testing programs to classify examinees for screening or selection. Despite the popular usages of multiple measures, there is little research on classification consistency and accuracy of multiple measures. Accordingly, this study introduces an…
Descriptors: Testing, Computation, Classification, Accuracy
V. N. Vimal Rao; Jeffrey K. Bye; Sashank Varma – Cognitive Research: Principles and Implications, 2024
The 0.05 boundary within Null Hypothesis Statistical Testing (NHST) "has made a lot of people very angry and been widely regarded as a bad move" (to quote Douglas Adams). Here, we move past meta-scientific arguments and ask an empirical question: What is the psychological standing of the 0.05 boundary for statistical significance? We…
Descriptors: Psychological Patterns, Statistical Analysis, Testing, Statistical Significance
Dalia Khairy; Nouf Alharbi; Mohamed A. Amasha; Marwa F. Areed; Salem Alkhalaf; Rania A. Abougalala – Education and Information Technologies, 2024
Student outcomes are of great importance in higher education institutions. Accreditation bodies focus on them as an indicator to measure the performance and effectiveness of the institution. Forecasting students' academic performance is crucial for every educational establishment seeking to enhance performance and perseverance of its students and…
Descriptors: Prediction, Tests, Scores, Information Retrieval
Eunsook Kim; Nathaniel von der Embse – Journal of Experimental Education, 2024
Using data from multiple informants has long been considered best practice in education. However, multiple informants often disagree on similar constructs, complicating decision-making. Polynomial regression and response-surface analysis (PRA) is often used to test the congruence effect between multiple informants on an outcome. However, PRA…
Descriptors: Congruence (Psychology), Information Sources, Best Practices, Regression (Statistics)
Rios, Joseph A. – Applied Measurement in Education, 2022
Testing programs are confronted with the decision of whether to report individual scores for examinees that have engaged in rapid guessing (RG). As noted by the "Standards for Educational and Psychological Testing," this decision should be based on a documented criterion that determines score exclusion. To this end, a number of heuristic…
Descriptors: Testing, Guessing (Tests), Academic Ability, Scores
Wang, Wenyi; Song, Lihong; Chen, Ping; Ding, Shuliang – Journal of Educational Measurement, 2019
Most of the existing classification accuracy indices of attribute patterns lose effectiveness when the response data is absent in diagnostic testing. To handle this issue, this article proposes new indices to predict the correct classification rate of a diagnostic test before administering the test under the deterministic noise input…
Descriptors: Cognitive Tests, Classification, Accuracy, Diagnostic Tests
Sejung Yang – Language Documentation & Conservation, 2020
Testing is increasingly recognized as a vital part of language revitalization. I demonstrate here that assessment of linguistic knowledge should also be part of the planning process that precedes the creation of a revitalization program. I take as an example Jejueo, the language of Korea's Jeju Island. Whereas previously published work…
Descriptors: Testing, Language Tests, Vocabulary Skills, Language Patterns
Grabovsky, Irina; Wainer, Howard – Journal of Educational and Behavioral Statistics, 2017
In this article, we extend the methodology of the Cut-Score Operating Function that we introduced previously and apply it to a testing scenario with multiple independent components and different testing policies. We derive analytically the overall classification error rate for a test battery under the policy when several retakes are allowed for…
Descriptors: Cutting Scores, Weighted Scores, Classification, Testing
Huang, Hung-Yu – Journal of Educational Measurement, 2017
Cognitive diagnosis models (CDMs) have been developed to evaluate the mastery status of individuals with respect to a set of defined attributes or skills that are measured through testing. When individuals are repeatedly administered a cognitive diagnosis test, a new class of multilevel CDMs is required to assess the changes in their attributes…
Descriptors: Testing, Cognitive Measurement, Test Items, Classification
Miciak, Jeremy; Taylor, W. Pat; Stuebing, Karla K.; Fletcher, Jack M. – Journal of Psychoeducational Assessment, 2018
We investigated the classification accuracy of learning disability (LD) identification methods premised on the identification of an intraindividual pattern of processing strengths and weaknesses (PSW) method using multiple indicators for all latent constructs. Known LD status was derived from latent scores; values at the observed level identified…
Descriptors: Accuracy, Learning Disabilities, Classification, Identification
Park, Ryoungsun; Kim, Jiseon; Chung, Hyewon; Dodd, Barbara G. – Educational and Psychological Measurement, 2017
The current study proposes novel methods to predict multistage testing (MST) performance without conducting simulations. This method, called MST test information, is based on analytic derivation of standard errors of ability estimates across theta levels. We compared standard errors derived analytically to the simulation results to demonstrate the…
Descriptors: Testing, Performance, Prediction, Error of Measurement
Schmidgall, Jonathan E.; Getman, Edward P.; Zu, Jiyun – Language Testing, 2018
In this study, we define the term "screener test," elaborate key considerations in test design, and describe how to incorporate the concepts of practicality and argument-based validation to drive an evaluation of screener tests for language assessment. A screener test is defined as a brief assessment designed to identify an examinee as a…
Descriptors: Test Validity, Test Use, Test Construction, Language Tests
Sturz, Bradley R.; Bell, Z. Kade; Bodily, Kent D. – Journal of Experimental Psychology: Learning, Memory, and Cognition, 2018
During spatial reorientation, the use of local geometric cues (e.g., corner angles) and global geometric cues (e.g., principal axis) is differentially influenced by enclosure size. Local geometric cues exert more influence in large enclosures compared to small enclosures, whereas the use of global geometric cues is not influenced by changes in…
Descriptors: Spatial Ability, Comparative Analysis, Testing, Classification

Peer reviewed
Direct link
