Publication Date
| In 2026 | 0 |
| Since 2025 | 0 |
| Since 2022 (last 5 years) | 1 |
| Since 2017 (last 10 years) | 2 |
| Since 2007 (last 20 years) | 15 |
Descriptor
| Comparative Analysis | 29 |
| Models | 29 |
| Test Construction | 29 |
| Foreign Countries | 8 |
| Computer Assisted Testing | 6 |
| Scores | 6 |
| Test Items | 6 |
| Adaptive Testing | 5 |
| Evaluation Methods | 5 |
| Item Response Theory | 5 |
| Measurement Techniques | 4 |
| More ▼ | |
Source
Author
Publication Type
| Journal Articles | 17 |
| Reports - Research | 16 |
| Reports - Evaluative | 7 |
| Speeches/Meeting Papers | 5 |
| Collected Works - Proceedings | 1 |
| Dissertations/Theses -… | 1 |
| Information Analyses | 1 |
| Reports - Descriptive | 1 |
Education Level
| Postsecondary Education | 5 |
| Elementary Secondary Education | 4 |
| Higher Education | 4 |
| Adult Education | 2 |
| Elementary Education | 2 |
| Secondary Education | 2 |
| Grade 4 | 1 |
| Intermediate Grades | 1 |
| Junior High Schools | 1 |
| Middle Schools | 1 |
Audience
| Researchers | 1 |
Location
| China | 2 |
| Arkansas | 1 |
| Brazil | 1 |
| Egypt | 1 |
| Ghana | 1 |
| Indonesia | 1 |
| Japan | 1 |
| Netherlands | 1 |
| Netherlands (Amsterdam) | 1 |
| Singapore | 1 |
| South Korea | 1 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
| Graduate Management Admission… | 1 |
| Major Field Achievement Test… | 1 |
| National Assessment of… | 1 |
What Works Clearinghouse Rating
The AI Teacher Test: Measuring the Pedagogical Ability of Blender and GPT-3 in Educational Dialogues
Tack, Anaïs; Piech, Chris – International Educational Data Mining Society, 2022
How can we test whether state-of-the-art generative models, such as Blender and GPT-3, are good AI teachers, capable of replying to a student in an educational dialogue? Designing an AI teacher test is challenging: although evaluation methods are much-needed, there is no off-the-shelf solution to measuring pedagogical ability. This paper reports…
Descriptors: Artificial Intelligence, Dialogs (Language), Bayesian Statistics, Decision Making
Hamdi, Syukrul; Kartowagiran, Badrun; Haryanto – International Journal of Instruction, 2018
The purpose of this study was to develop a Mathematics test instrument testlet model for a classroom assessment at elementary school. Testlet Model is a group of multiple choice question acquiring similar information with different grade of responses model. This research was conducted in East Lombok, Indonesia. The design used was research…
Descriptors: Test Items, Models, Elementary School Mathematics, Mathematics Instruction
Green, Jeffrey J.; Stone, Courtenay Clifford; Zegeye, Abera – Journal of Education for Business, 2014
Colleges and universities are being asked by numerous sources to provide assurance of learning assessments of their students and programs. Colleges of business have responded by using a plethora of assessment tools, including the Major Field Test in Business. In this article, the authors show that the use of the Major Field Test in Business for…
Descriptors: Business Administration Education, Student Evaluation, Accreditation (Institutions), Comparative Analysis
Han, Kyung T.; Rudner, Lawrence M. – Graduate Management Admission Council, 2014
This study uses mixed integer quadratic programming (MIQP) to construct multiple highly equivalent item pools simultaneously, and compares the results from mixed integer programming (MIP). Three different MIP/MIQP models were implemented and evaluated using real CAT item pool data with 23 different content areas and a goal of equal information…
Descriptors: Item Banks, Programming, Computer Assisted Testing, Adaptive Testing
The Effect of Using Hendy's 4Cs Model on Teaching and Learning Science in Middle School in Mid-Egypt
Hendy, Mohamed H. – Online Submission, 2016
Educational research and practice have proven that there are many benefits for applying learning theories' recommendations through teaching and learning of different subjects in all school levels. Based on interrelationships among learning theories of contextualism, connectivism, constructivism, and cognitivism, the researcher proposed an…
Descriptors: Science Instruction, Learning Theories, Models, Instructional Effectiveness
Wang, Yan; Mu, Guanglun Michael; Wang, Zhiqing; Deng, Meng; Cheng, Li; Wang, Hongxia – International Journal of Disability, Development and Education, 2015
Classroom support plays a salient role in successful inclusive education, hence it has been widely debated in the literature. Much extant work has only focused on a particular aspect of classroom support. A comprehensive, systematic discussion of classroom support is sporadic in the literature. Relevant research concerning the Chinese context is…
Descriptors: Multidimensional Scaling, Inclusion, Classroom Techniques, Classroom Environment
Fernandes Malaquias, Rodrigo; de Oliveira Malaquias, Fernanda Francielle – Turkish Online Journal of Distance Education, 2014
The objective of this study was to validate a scale for assessment of academic projects. As a complement, we examined its predictive ability by comparing the scores of advised/corrected projects based on the model and the final scores awarded to the work by an examining panel (approximately 10 months after the project design). Results of…
Descriptors: Predictive Measurement, Predictive Validity, Predictor Variables, Test Construction
Finkelman, Matthew D.; Kim, Wonsuk; Roussos, Louis; Verschoor, Angela – Applied Psychological Measurement, 2010
Automated test assembly (ATA) has been an area of prolific psychometric research. Although ATA methodology is well developed for unidimensional models, its application alongside cognitive diagnosis models (CDMs) is a burgeoning topic. Two suggested procedures for combining ATA and CDMs are to maximize the cognitive diagnostic index and to use a…
Descriptors: Automation, Test Construction, Programming, Models
Chen, Tzu-An – ProQuest LLC, 2010
This simulation study compared the performance of two multilevel measurement testlet (MMMT) models: Beretvas and Walker's (2008) two-level MMMT model and Jiao, Wang, and Kamata's (2005) three-level model. Several conditions were manipulated (including testlet length, sample size, and the pattern of the testlet effects) to assess the impact on the…
Descriptors: Simulation, Item Response Theory, Comparative Analysis, Models
Fulmer, Gavin W.; Liang, Ling L. – Journal of Science Education and Technology, 2013
This study tested a student survey to detect differences in instruction between teachers in a modeling-based science program and comparison group teachers. The Instructional Activities Survey measured teachers' frequency of modeling, inquiry, and lecture instruction. Factor analysis and Rasch modeling identified three subscales, Modeling and…
Descriptors: Comparative Analysis, Factor Analysis, Science Instruction, Effect Size
Sultan, Parves; Wong, Ho – Quality Assurance in Education: An International Perspective, 2010
Purpose: This paper aims to develop and empirically test the performance-based higher education service quality model. Design/methodology/approach: The study develops 67-item instrument for measuring performance-based service quality with a particular focus on the higher education sector. Scale reliability is confirmed using the Cronbach's alpha.…
Descriptors: Foreign Countries, Higher Education, Quality Control, Program Effectiveness
Arendasy, Martin E.; Sommer, Markus; Gittler, Georg – Intelligence, 2010
Marked gender differences in three-dimensional mental rotation have been broadly reported in the literature in the last few decades. Various theoretical models and accounts were used to explain the observed differences. Within the framework of linking item design features of mental rotation tasks to cognitive component processes associated with…
Descriptors: Cues, Females, Models, Protocol Analysis
Mueller Gathercole, Virginia C.; Thomas, Enlli Mon; Hughes, Emma – International Journal of Bilingual Education and Bilingualism, 2008
The purpose of this paper is to propose an applied model for the assessment of bilingual children's language abilities with standardised tests. We discuss the purposes of such tests, especially in relation to vocabulary knowledge, and potential applications of test results for each of those purposes. The specific case to be examined here is that…
Descriptors: Test Results, Language Tests, Monolingualism, Vocabulary Development
Xu, Xueli; von Davier, Matthias – ETS Research Report Series, 2008
Three strategies for linking two consecutive assessments are investigated and compared by analyzing reading data for the National Assessment of Educational Progress (NAEP) using the general diagnostic model. These strategies are compared in terms of marginal and joint expectations of skills, joint probabilities of skill patterns, and item…
Descriptors: National Competency Tests, Probability, Reading Achievement, Test Items
Garrison, Wayne M.; White, Karl R. – 1979
Rasch and classical test analysis methods were compared with respect to their similarities and differences in the identification of noninformative items and implausible person records. Using computer simulated data with known parameters, each model was evaluated in terms of its effectiveness in: (1) identifying noninformative or "bad"…
Descriptors: Comparative Analysis, Item Analysis, Models, Monte Carlo Methods
Previous Page | Next Page »
Pages: 1 | 2
Peer reviewed
Direct link
