NotesFAQContact Us
Collection
Advanced
Search Tips
Audience
Researchers1
Laws, Policies, & Programs
What Works Clearinghouse Rating
Showing 1 to 15 of 29 results Save | Export
Peer reviewed Peer reviewed
PDF on ERIC Download full text
Tack, Anaïs; Piech, Chris – International Educational Data Mining Society, 2022
How can we test whether state-of-the-art generative models, such as Blender and GPT-3, are good AI teachers, capable of replying to a student in an educational dialogue? Designing an AI teacher test is challenging: although evaluation methods are much-needed, there is no off-the-shelf solution to measuring pedagogical ability. This paper reports…
Descriptors: Artificial Intelligence, Dialogs (Language), Bayesian Statistics, Decision Making
Peer reviewed Peer reviewed
PDF on ERIC Download full text
Hamdi, Syukrul; Kartowagiran, Badrun; Haryanto – International Journal of Instruction, 2018
The purpose of this study was to develop a Mathematics test instrument testlet model for a classroom assessment at elementary school. Testlet Model is a group of multiple choice question acquiring similar information with different grade of responses model. This research was conducted in East Lombok, Indonesia. The design used was research…
Descriptors: Test Items, Models, Elementary School Mathematics, Mathematics Instruction
Peer reviewed Peer reviewed
Direct linkDirect link
Green, Jeffrey J.; Stone, Courtenay Clifford; Zegeye, Abera – Journal of Education for Business, 2014
Colleges and universities are being asked by numerous sources to provide assurance of learning assessments of their students and programs. Colleges of business have responded by using a plethora of assessment tools, including the Major Field Test in Business. In this article, the authors show that the use of the Major Field Test in Business for…
Descriptors: Business Administration Education, Student Evaluation, Accreditation (Institutions), Comparative Analysis
Han, Kyung T.; Rudner, Lawrence M. – Graduate Management Admission Council, 2014
This study uses mixed integer quadratic programming (MIQP) to construct multiple highly equivalent item pools simultaneously, and compares the results from mixed integer programming (MIP). Three different MIP/MIQP models were implemented and evaluated using real CAT item pool data with 23 different content areas and a goal of equal information…
Descriptors: Item Banks, Programming, Computer Assisted Testing, Adaptive Testing
Hendy, Mohamed H. – Online Submission, 2016
Educational research and practice have proven that there are many benefits for applying learning theories' recommendations through teaching and learning of different subjects in all school levels. Based on interrelationships among learning theories of contextualism, connectivism, constructivism, and cognitivism, the researcher proposed an…
Descriptors: Science Instruction, Learning Theories, Models, Instructional Effectiveness
Peer reviewed Peer reviewed
Direct linkDirect link
Wang, Yan; Mu, Guanglun Michael; Wang, Zhiqing; Deng, Meng; Cheng, Li; Wang, Hongxia – International Journal of Disability, Development and Education, 2015
Classroom support plays a salient role in successful inclusive education, hence it has been widely debated in the literature. Much extant work has only focused on a particular aspect of classroom support. A comprehensive, systematic discussion of classroom support is sporadic in the literature. Relevant research concerning the Chinese context is…
Descriptors: Multidimensional Scaling, Inclusion, Classroom Techniques, Classroom Environment
Peer reviewed Peer reviewed
PDF on ERIC Download full text
Fernandes Malaquias, Rodrigo; de Oliveira Malaquias, Fernanda Francielle – Turkish Online Journal of Distance Education, 2014
The objective of this study was to validate a scale for assessment of academic projects. As a complement, we examined its predictive ability by comparing the scores of advised/corrected projects based on the model and the final scores awarded to the work by an examining panel (approximately 10 months after the project design). Results of…
Descriptors: Predictive Measurement, Predictive Validity, Predictor Variables, Test Construction
Peer reviewed Peer reviewed
Direct linkDirect link
Finkelman, Matthew D.; Kim, Wonsuk; Roussos, Louis; Verschoor, Angela – Applied Psychological Measurement, 2010
Automated test assembly (ATA) has been an area of prolific psychometric research. Although ATA methodology is well developed for unidimensional models, its application alongside cognitive diagnosis models (CDMs) is a burgeoning topic. Two suggested procedures for combining ATA and CDMs are to maximize the cognitive diagnostic index and to use a…
Descriptors: Automation, Test Construction, Programming, Models
Chen, Tzu-An – ProQuest LLC, 2010
This simulation study compared the performance of two multilevel measurement testlet (MMMT) models: Beretvas and Walker's (2008) two-level MMMT model and Jiao, Wang, and Kamata's (2005) three-level model. Several conditions were manipulated (including testlet length, sample size, and the pattern of the testlet effects) to assess the impact on the…
Descriptors: Simulation, Item Response Theory, Comparative Analysis, Models
Peer reviewed Peer reviewed
Direct linkDirect link
Fulmer, Gavin W.; Liang, Ling L. – Journal of Science Education and Technology, 2013
This study tested a student survey to detect differences in instruction between teachers in a modeling-based science program and comparison group teachers. The Instructional Activities Survey measured teachers' frequency of modeling, inquiry, and lecture instruction. Factor analysis and Rasch modeling identified three subscales, Modeling and…
Descriptors: Comparative Analysis, Factor Analysis, Science Instruction, Effect Size
Peer reviewed Peer reviewed
Direct linkDirect link
Sultan, Parves; Wong, Ho – Quality Assurance in Education: An International Perspective, 2010
Purpose: This paper aims to develop and empirically test the performance-based higher education service quality model. Design/methodology/approach: The study develops 67-item instrument for measuring performance-based service quality with a particular focus on the higher education sector. Scale reliability is confirmed using the Cronbach's alpha.…
Descriptors: Foreign Countries, Higher Education, Quality Control, Program Effectiveness
Peer reviewed Peer reviewed
Direct linkDirect link
Arendasy, Martin E.; Sommer, Markus; Gittler, Georg – Intelligence, 2010
Marked gender differences in three-dimensional mental rotation have been broadly reported in the literature in the last few decades. Various theoretical models and accounts were used to explain the observed differences. Within the framework of linking item design features of mental rotation tasks to cognitive component processes associated with…
Descriptors: Cues, Females, Models, Protocol Analysis
Peer reviewed Peer reviewed
Direct linkDirect link
Mueller Gathercole, Virginia C.; Thomas, Enlli Mon; Hughes, Emma – International Journal of Bilingual Education and Bilingualism, 2008
The purpose of this paper is to propose an applied model for the assessment of bilingual children's language abilities with standardised tests. We discuss the purposes of such tests, especially in relation to vocabulary knowledge, and potential applications of test results for each of those purposes. The specific case to be examined here is that…
Descriptors: Test Results, Language Tests, Monolingualism, Vocabulary Development
Peer reviewed Peer reviewed
PDF on ERIC Download full text
Xu, Xueli; von Davier, Matthias – ETS Research Report Series, 2008
Three strategies for linking two consecutive assessments are investigated and compared by analyzing reading data for the National Assessment of Educational Progress (NAEP) using the general diagnostic model. These strategies are compared in terms of marginal and joint expectations of skills, joint probabilities of skill patterns, and item…
Descriptors: National Competency Tests, Probability, Reading Achievement, Test Items
Garrison, Wayne M.; White, Karl R. – 1979
Rasch and classical test analysis methods were compared with respect to their similarities and differences in the identification of noninformative items and implausible person records. Using computer simulated data with known parameters, each model was evaluated in terms of its effectiveness in: (1) identifying noninformative or "bad"…
Descriptors: Comparative Analysis, Item Analysis, Models, Monte Carlo Methods
Previous Page | Next Page »
Pages: 1  |  2