Publication Date
| In 2026 | 0 |
| Since 2025 | 0 |
| Since 2022 (last 5 years) | 2 |
| Since 2017 (last 10 years) | 10 |
| Since 2007 (last 20 years) | 61 |
Descriptor
Source
Author
| Spray, Judith A. | 6 |
| Kalohn, John C. | 4 |
| Alonzo, Julie | 2 |
| Anderson, Daniel | 2 |
| Buckendahl, Chad W. | 2 |
| Dietel, Ronald | 2 |
| Herman, Joan L. | 2 |
| Huang, Chi-Yu | 2 |
| Jiao, Hong | 2 |
| Lin, Chuan-Ju | 2 |
| Thompson, Nathan A. | 2 |
| More ▼ | |
Publication Type
| Reports - Evaluative | 111 |
| Journal Articles | 79 |
| Speeches/Meeting Papers | 17 |
| Information Analyses | 5 |
| Guides - Non-Classroom | 2 |
| Numerical/Quantitative Data | 2 |
| Tests/Questionnaires | 2 |
| Opinion Papers | 1 |
| Reports - Research | 1 |
Education Level
| Elementary Secondary Education | 8 |
| Higher Education | 5 |
| Elementary Education | 4 |
| Grade 4 | 4 |
| Grade 5 | 4 |
| Grade 8 | 4 |
| Secondary Education | 4 |
| Grade 3 | 3 |
| Grade 6 | 3 |
| Grade 7 | 3 |
| Grade 2 | 2 |
| More ▼ | |
Audience
| Practitioners | 1 |
Location
| Australia | 3 |
| Nebraska | 2 |
| Taiwan | 2 |
| California | 1 |
| Canada | 1 |
| Denmark | 1 |
| Egypt | 1 |
| Georgia | 1 |
| Missouri | 1 |
| Netherlands | 1 |
| New Jersey | 1 |
| More ▼ | |
Laws, Policies, & Programs
| No Child Left Behind Act 2001 | 3 |
Assessments and Surveys
What Works Clearinghouse Rating
Coggeshall, Whitney Smiley – Educational Measurement: Issues and Practice, 2021
The continuous testing framework, where both successful and unsuccessful examinees have to demonstrate continued proficiency at frequent prespecified intervals, is a framework that is used in noncognitive assessment and is gaining in popularity in cognitive assessment. Despite the rigorous advantages of this framework, this paper demonstrates that…
Descriptors: Classification, Accuracy, Testing, Failure
Sinharay, Sandip – Journal of Educational and Behavioral Statistics, 2022
Takers of educational tests often receive proficiency levels instead of or in addition to scaled scores. For example, proficiency levels are reported for the Advanced Placement (AP®) and U.S. Medical Licensing examinations. Technical difficulties and other unforeseen events occasionally lead to missing item scores and hence to incomplete data on…
Descriptors: Computation, Data Analysis, Educational Testing, Accuracy
Ormerod, Christopher; Lottridge, Susan; Harris, Amy E.; Patel, Milan; van Wamelen, Paul; Kodeswaran, Balaji; Woolf, Sharon; Young, Mackenzie – International Journal of Artificial Intelligence in Education, 2023
We introduce a short answer scoring engine made up of an ensemble of deep neural networks and a Latent Semantic Analysis-based model to score short constructed responses for a large suite of questions from a national assessment program. We evaluate the performance of the engine and show that the engine achieves above-human-level performance on a…
Descriptors: Computer Assisted Testing, Scoring, Artificial Intelligence, Semantics
Phelps, Richard P. – Online Submission, 2020
This review critiques the highly-praised and influential 2001 study, "Getting Tough? The Impact of High School Graduation Exams," which concluded that "minimum competency," or high school "graduation exams," had no effect on student achievement. The review compares the test classifications of "Getting…
Descriptors: High School Students, Exit Examinations, Academic Achievement, Minimum Competencies
Leventhal, Brian C.; Grabovsky, Irina – Educational Measurement: Issues and Practice, 2020
Standard setting is arguably one of the most subjective techniques in test development and psychometrics. The decisions when scores are compared to standards, however, are arguably the most consequential outcomes of testing. Providing licensure to practice in a profession has high stake consequences for the public. Denying graduation or forcing…
Descriptors: Standard Setting (Scoring), Weighted Scores, Test Construction, Psychometrics
Missaoui, Siwar; Maalel, Ahmed – Education and Information Technologies, 2021
A student's profile defines the best way a student chooses to learn. It comprises information on student's characteristics such as background knowledge, learning style preference, goals, personality etc. The foremost challenge that the students experience in learning system is that they are unable to bring back relevant information based on their…
Descriptors: Profiles, Models, Computer Games, Cognitive Style
Elliott, Julian G.; Resing, Wilma C. M.; Beckmann, Jens F. – Educational Review, 2018
This paper updates a review of dynamic assessment in education by the first author, published in this journal in 2003. It notes that the original review failed to examine the important conceptual distinction between dynamic testing (DT) and dynamic assessment (DA). While both approaches seek to link assessment and intervention, the former is of…
Descriptors: Alternative Assessment, Educational Assessment, Testing, Intervention
Hoogland, Kees; Tout, Dave – ZDM: The International Journal on Mathematics Education, 2018
In recent decades, technology has influenced various aspects of assessment in mathematics education: (1) supporting the assessment of higher-order thinking skills in mathematics, (2) representing authentic problems from the world around us to use and apply mathematical knowledge and skills, and (3) making the delivery of tests and the analysis of…
Descriptors: Computer Assisted Testing, At Risk Persons, Mathematics Education, Thinking Skills
Hamre, Bjørn; Axelsson, Thom; Ludvigsen, Kari – Paedagogica Historica: International Journal of the History of Education, 2019
This article explores the role of psychiatry in the sorting of schoolchildren in Denmark, Norway, and Sweden from 1920 to 1950. Whereas the role and rise of educational psychology and IQ-testing in the differentiation processes in schooling have been examined through earlier research, the role of psychiatry in the interprofessional collaboration…
Descriptors: Psychiatry, Psychiatric Hospitals, Educational Psychology, Intelligence Tests
Kibler, Amanda K.; Valdés, Guadalupe – Modern Language Journal, 2016
Through examination of one recently manufactured term for language learners (Long-term English Learners) and review of a century of "MLJ" articles, we examine varying "socioinstitutional" conceptualizations of second/foreign/heritage language learners as shaped by educational institutions and related stakeholders over time,…
Descriptors: Second Language Learning, Second Language Instruction, Definitions, Teaching Methods
Sireci, Stephen G.; Faulkner-Bond, Molly – Review of Research in Education, 2015
Across the globe, educational tests are being used at a rapidly increasing rate. More recently, educational tests are being used to inform educational policy and for holding educators accountable for student learning. One reason educational assessments are used for these important purposes is that they are considered to provide reliable and…
Descriptors: English Language Learners, Accountability, Educational Testing, Student Evaluation
Lin, Chuan-Ju – Educational and Psychological Measurement, 2011
This study compares four item selection criteria for a two-category computerized classification testing: (1) Fisher information (FI), (2) Kullback-Leibler information (KLI), (3) weighted log-odds ratio (WLOR), and (4) mutual information (MI), with respect to the efficiency and accuracy of classification decision using the sequential probability…
Descriptors: Computer Assisted Testing, Adaptive Testing, Selection, Test Items
Eggen, Theo J. H. M. – Educational Research and Evaluation, 2011
If classification in a limited number of categories is the purpose of testing, computerized adaptive tests (CATs) with algorithms based on sequential statistical testing perform better than estimation-based CATs (e.g., Eggen & Straetmans, 2000). In these computerized classification tests (CCTs), the Sequential Probability Ratio Test (SPRT) (Wald,…
Descriptors: Test Length, Adaptive Testing, Classification, Item Analysis
Wang, Wen-Chung; Liu, Chen-Wei – Educational and Psychological Measurement, 2011
The generalized graded unfolding model (GGUM) has been recently developed to describe item responses to Likert items (agree-disagree) in attitude measurement. In this study, the authors (a) developed two item selection methods in computerized classification testing under the GGUM, the current estimate/ability confidence interval method and the cut…
Descriptors: Computer Assisted Testing, Adaptive Testing, Classification, Item Response Theory
Thompson, Nathan A. – Practical Assessment, Research & Evaluation, 2011
Computerized classification testing (CCT) is an approach to designing tests with intelligent algorithms, similar to adaptive testing, but specifically designed for the purpose of classifying examinees into categories such as "pass" and "fail." Like adaptive testing for point estimation of ability, the key component is the…
Descriptors: Adaptive Testing, Computer Assisted Testing, Classification, Probability

Peer reviewed
Direct link
