ERIC - Search Results

Publication Date

In 2026	0
Since 2025	0
Since 2022 (last 5 years)	1
Since 2017 (last 10 years)	2
Since 2007 (last 20 years)	15

Descriptor

Comparative Analysis	29
Models	29
Test Construction	29
Foreign Countries	8
Computer Assisted Testing	6
Scores	6
Test Items	6
Adaptive Testing	5
Evaluation Methods	5
Item Response Theory	5
Measurement Techniques	4
Psychometrics	4
Test Validity	4
Cognitive Tests	3
Criterion Referenced Tests	3
Decision Making	3
Educational Testing	3
Elementary Secondary Education	3
Factor Analysis	3
Higher Education	3
Interviews	3
Item Analysis	3
Questionnaires	3
Test Reliability	3
Achievement Tests	2
More ▼

Publication Type

Journal Articles	17
Reports - Research	16
Reports - Evaluative	7
Speeches/Meeting Papers	5
Collected Works - Proceedings	1
Dissertations/Theses -…	1
Information Analyses	1
Reports - Descriptive	1

Education Level

Postsecondary Education	5
Elementary Secondary Education	4
Higher Education	4
Adult Education	2
Elementary Education	2
Secondary Education	2
Grade 4	1
Intermediate Grades	1
Junior High Schools	1
Middle Schools	1

Audience

Researchers

Location

China	2
Arkansas	1
Brazil	1
Egypt	1
Ghana	1
Indonesia	1
Japan	1
Netherlands	1
Netherlands (Amsterdam)	1
Singapore	1
South Korea	1
Taiwan	1
Turkey	1
Virginia	1
More ▼

Laws, Policies, & Programs

Assessments and Surveys

Graduate Management Admission…	1
Major Field Achievement Test…	1
National Assessment of…	1

What Works Clearinghouse Rating

Showing 1 to 15 of 29 results Save | Export

The AI Teacher Test: Measuring the Pedagogical Ability of Blender and GPT-3 in Educational Dialogues

Peer reviewed
PDF on ERIC

Download full text

Tack, Anaïs; Piech, Chris – International Educational Data Mining Society, 2022

How can we test whether state-of-the-art generative models, such as Blender and GPT-3, are good AI teachers, capable of replying to a student in an educational dialogue? Designing an AI teacher test is challenging: although evaluation methods are much-needed, there is no off-the-shelf solution to measuring pedagogical ability. This paper reports…

Descriptors: Artificial Intelligence, Dialogs (Language), Bayesian Statistics, Decision Making

Developing a Testlet Model for Mathematics at Elementary Level

Peer reviewed
PDF on ERIC

Download full text

Hamdi, Syukrul; Kartowagiran, Badrun; Haryanto – International Journal of Instruction, 2018

The purpose of this study was to develop a Mathematics test instrument testlet model for a classroom assessment at elementary school. Testlet Model is a group of multiple choice question acquiring similar information with different grade of responses model. This research was conducted in East Lombok, Indonesia. The design used was research…

Descriptors: Test Items, Models, Elementary School Mathematics, Mathematics Instruction

The Major Field Test in Business: A Solution to the Problem of Assurance of Learning Assessment?

Peer reviewed

Direct link

Green, Jeffrey J.; Stone, Courtenay Clifford; Zegeye, Abera – Journal of Education for Business, 2014

Colleges and universities are being asked by numerous sources to provide assurance of learning assessments of their students and programs. Colleges of business have responded by using a plethora of assessment tools, including the Major Field Test in Business. In this article, the authors show that the use of the Major Field Test in Business for…

Descriptors: Business Administration Education, Student Evaluation, Accreditation (Institutions), Comparative Analysis

Item Pool Construction Using Mixed Integer Quadratic Programming (MIQP). GMAC® Research Report RR-14-01

Download full text

Han, Kyung T.; Rudner, Lawrence M. – Graduate Management Admission Council, 2014

This study uses mixed integer quadratic programming (MIQP) to construct multiple highly equivalent item pools simultaneously, and compares the results from mixed integer programming (MIP). Three different MIP/MIQP models were implemented and evaluated using real CAT item pool data with 23 different content areas and a goal of equal information…

Descriptors: Item Banks, Programming, Computer Assisted Testing, Adaptive Testing

The Effect of Using Hendy's 4Cs Model on Teaching and Learning Science in Middle School in Mid-Egypt

Download full text

Hendy, Mohamed H. – Online Submission, 2016

Educational research and practice have proven that there are many benefits for applying learning theories' recommendations through teaching and learning of different subjects in all school levels. Based on interrelationships among learning theories of contextualism, connectivism, constructivism, and cognitivism, the researcher proposed an…

Descriptors: Science Instruction, Learning Theories, Models, Instructional Effectiveness

Multidimensional Classroom Support to Inclusive Education Teachers in Beijing, China

Peer reviewed

Direct link

Wang, Yan; Mu, Guanglun Michael; Wang, Zhiqing; Deng, Meng; Cheng, Li; Wang, Hongxia – International Journal of Disability, Development and Education, 2015

Classroom support plays a salient role in successful inclusive education, hence it has been widely debated in the literature. Much extant work has only focused on a particular aspect of classroom support. A comprehensive, systematic discussion of classroom support is sporadic in the literature. Relevant research concerning the Chinese context is…

Descriptors: Multidimensional Scaling, Inclusion, Classroom Techniques, Classroom Environment

Project Evaluation: Validation of a Scale and Analysis of Its Predictive Capacity

Peer reviewed
PDF on ERIC

Download full text

Fernandes Malaquias, Rodrigo; de Oliveira Malaquias, Fernanda Francielle – Turkish Online Journal of Distance Education, 2014

The objective of this study was to validate a scale for assessment of academic projects. As a complement, we examined its predictive ability by comparing the scores of advised/corrected projects based on the model and the final scores awarded to the work by an examining panel (approximately 10 months after the project design). Results of…

Descriptors: Predictive Measurement, Predictive Validity, Predictor Variables, Test Construction

A Binary Programming Approach to Automated Test Assembly for Cognitive Diagnosis Models

Peer reviewed

Direct link

Finkelman, Matthew D.; Kim, Wonsuk; Roussos, Louis; Verschoor, Angela – Applied Psychological Measurement, 2010

Automated test assembly (ATA) has been an area of prolific psychometric research. Although ATA methodology is well developed for unidimensional models, its application alongside cognitive diagnosis models (CDMs) is a burgeoning topic. Two suggested procedures for combining ATA and CDMs are to maximize the cognitive diagnostic index and to use a…

Descriptors: Automation, Test Construction, Programming, Models

Random or Fixed Testlet Effects: A Comparison of Two Multilevel Testlet Models

Direct link

Chen, Tzu-An – ProQuest LLC, 2010

This simulation study compared the performance of two multilevel measurement testlet (MMMT) models: Beretvas and Walker's (2008) two-level MMMT model and Jiao, Wang, and Kamata's (2005) three-level model. Several conditions were manipulated (including testlet length, sample size, and the pattern of the testlet effects) to assess the impact on the…

Descriptors: Simulation, Item Response Theory, Comparative Analysis, Models

Measuring Model-Based High School Science Instruction: Development and Application of a Student Survey

Peer reviewed

Direct link

Fulmer, Gavin W.; Liang, Ling L. – Journal of Science Education and Technology, 2013

This study tested a student survey to detect differences in instruction between teachers in a modeling-based science program and comparison group teachers. The Instructional Activities Survey measured teachers' frequency of modeling, inquiry, and lecture instruction. Factor analysis and Rasch modeling identified three subscales, Modeling and…

Descriptors: Comparative Analysis, Factor Analysis, Science Instruction, Effect Size

Performance-Based Service Quality Model: An Empirical Study on Japanese Universities

Peer reviewed

Direct link

Sultan, Parves; Wong, Ho – Quality Assurance in Education: An International Perspective, 2010

Purpose: This paper aims to develop and empirically test the performance-based higher education service quality model. Design/methodology/approach: The study develops 67-item instrument for measuring performance-based service quality with a particular focus on the higher education sector. Scale reliability is confirmed using the Cronbach's alpha.…

Descriptors: Foreign Countries, Higher Education, Quality Control, Program Effectiveness

Combining Automatic Item Generation and Experimental Designs to Investigate the Contribution of Cognitive Components to the Gender Difference in Mental Rotation

Peer reviewed

Direct link

Arendasy, Martin E.; Sommer, Markus; Gittler, Georg – Intelligence, 2010

Marked gender differences in three-dimensional mental rotation have been broadly reported in the literature in the last few decades. Various theoretical models and accounts were used to explain the observed differences. Within the framework of linking item design features of mental rotation tasks to cognitive component processes associated with…

Descriptors: Cues, Females, Models, Protocol Analysis

Designing a Normed Receptive Vocabulary Test for Bilingual Populations: A Model from Welsh

Peer reviewed

Direct link

Mueller Gathercole, Virginia C.; Thomas, Enlli Mon; Hughes, Emma – International Journal of Bilingual Education and Bilingualism, 2008

The purpose of this paper is to propose an applied model for the assessment of bilingual children's language abilities with standardised tests. We discuss the purposes of such tests, especially in relation to vocabulary knowledge, and potential applications of test results for each of those purposes. The specific case to be examined here is that…

Descriptors: Test Results, Language Tests, Monolingualism, Vocabulary Development

Linking for the General Diagnostic Model. Research Report. ETS RR-08-08

Peer reviewed
PDF on ERIC

Download full text

Xu, Xueli; von Davier, Matthias – ETS Research Report Series, 2008

Three strategies for linking two consecutive assessments are investigated and compared by analyzing reading data for the National Assessment of Educational Progress (NAEP) using the general diagnostic model. These strategies are compared in terms of marginal and joint expectations of skills, joint probabilities of skill patterns, and item…

Descriptors: National Competency Tests, Probability, Reading Achievement, Test Items

A Simulation Study on the Utility of Rasch and Classical Test Analysis Procedures.

Garrison, Wayne M.; White, Karl R. – 1979

Rasch and classical test analysis methods were compared with respect to their similarities and differences in the identification of noninformative items and implausible person records. Using computer simulated data with known parameters, each model was evaluated in terms of its effectiveness in: (1) identifying noninformative or "bad"…

Descriptors: Comparative Analysis, Item Analysis, Models, Monte Carlo Methods

Previous Page | Next Page »

Pages: 1 | 2

Applied Psychological…	1
Association for Educational…	1
ETS Research Report Series	1
Educational and Psychological…	1
Evaluation in Education:…	1
Graduate Management Admission…	1
Intelligence	1
International Educational…	1
International Journal of…	1
International Journal of…	1
International Journal of…	1
Journal of Applied Testing…	1
Journal of Education for…	1
Journal of Educational…	1
Journal of Science Education…	1
Language Learning	1
Online Submission	1
ProQuest LLC	1
Quality Assurance in…	1
Science Education	1
Social Indicators Research	1
Turkish Online Journal of…	1
More ▼

Aiga, Hirotsugu	1
Arendasy, Martin E.	1
Carlson, Gaylen R.	1
Chen, Tzu-An	1
Cheng, Li	1
Clark, John L. D.	1
Comijs, Hannie	1
Deng, Meng	1
Edirisooriya, Gunapala	1
Fernandes Malaquias, Rodrigo	1
Finkelman, Matthew D.	1
Frick, Theodore W.	1
Fulmer, Gavin W.	1
Garrison, Wayne M.	1
Gittler, Georg	1
Green, Jeffrey J.	1
Haladyna, Thomas M.	1
Hamdi, Syukrul	1
Han, Kyung T.	1
Haryanto	1
Hendy, Mohamed H.	1
Hughes, Emma	1
Kartowagiran, Badrun	1
Kim, Wonsuk	1
More ▼