NotesFAQContact Us
Collection
Advanced
Search Tips
Audience
Laws, Policies, & Programs
What Works Clearinghouse Rating
Showing 1 to 15 of 53 results Save | Export
Peer reviewed Peer reviewed
Direct linkDirect link
Baldwin, Peter; Clauser, Brian E. – Journal of Educational Measurement, 2022
While score comparability across test forms typically relies on common (or randomly equivalent) examinees or items, innovations in item formats, test delivery, and efforts to extend the range of score interpretation may require a special data collection before examinees or items can be used in this way--or may be incompatible with common examinee…
Descriptors: Scoring, Testing, Test Items, Test Format
Han, Lu – ProQuest LLC, 2022
This dissertation study explored the feasibility of using authenticated spoken texts to test L2 Chinese listening comprehension. The spoken texts used in the study were created using an "authenticating" technique, in which scripted spoken Chinese texts were infused with characteristics of real-world, unscripted spoken Chinese. In the…
Descriptors: Second Language Learning, Second Language Instruction, Listening Comprehension Tests, Chinese
Cresswell, John; Schwantner, Ursula; Waters, Charlotte – OECD Publishing, 2015
This report reviews the major international and regional large-scale educational assessments, including international surveys, school-based surveys and household-based surveys. The report compares and contrasts the cognitive and contextual data collection instruments and implementation methods used by the different assessments in order to identify…
Descriptors: International Assessment, Educational Assessment, Data Collection, Comparative Analysis
Peer reviewed Peer reviewed
Direct linkDirect link
Prasad, Joshua J.; Showler, Morgan B.; Schmitt, Neal; Ryan, Ann Marie; Nye, Christopher D. – International Journal of Testing, 2017
The present research compares the operation of situational judgement and biodata measures between Chinese and U.S. respondents. We describe the development and past research on both measures, followed by hypothesized differences across the two groups of respondents. We base hypotheses on the nature of the Chinese and U.S. educational systems and…
Descriptors: Measures (Individuals), Hypothesis Testing, Cross Cultural Studies, Comparative Analysis
Australian Council for Educational Research, 2015
Monitoring Trends in Educational Growth (MTEG) offers a flexible, collaborative approach to developing and implementing an assessment of learning outcomes that yields high-quality, nationally relevant data. MTEG is a service that involves ACER staff working closely with each country to develop an assessment program that meets the country's…
Descriptors: Educational Development, Educational Trends, Progress Monitoring, Educational Quality
Wagemaker, Hans, Ed. – International Association for the Evaluation of Educational Achievement, 2020
Although International Association for the Evaluation of Educational Achievement-pioneered international large-scale assessment (ILSA) of education is now a well-established science, non-practitioners and many users often substantially misunderstand how large-scale assessments are conducted, what questions and challenges they are designed to…
Descriptors: International Assessment, Achievement Tests, Educational Assessment, Comparative Analysis
Peer reviewed Peer reviewed
PDF on ERIC Download full text
Chen, Haiwen H.; von Davier, Matthias; Yamamoto, Kentaro; Kong, Nan – ETS Research Report Series, 2015
One major issue with large-scale assessments is that the respondents might give no responses to many items, resulting in less accurate estimations of both assessed abilities and item parameters. This report studies how the types of items affect the item-level nonresponse rates and how different methods of treating item-level nonresponses have an…
Descriptors: Achievement Tests, Foreign Countries, International Assessment, Secondary School Students
Zhang, Bin – ProQuest LLC, 2012
Social scientists usually are more interested in consumers' dichotomous choice, such as purchase a product or not, adopt a technology or not, etc. However, up to date, there is nearly no model can help us solve the problem of multi-network effects comparison with a dichotomous dependent variable. Furthermore, the study of multi-network…
Descriptors: Social Networks, Network Analysis, Comparative Analysis, Population Groups
Peer reviewed Peer reviewed
Direct linkDirect link
Detterman, Douglas K. – Intelligence, 2011
Watson's Jeopardy victory raises the question of the similarity of artificial intelligence and human intelligence. Those of us who study human intelligence issue a challenge to the artificial intelligence community. We will construct a unique battery of tests for any computer that would provide an actual IQ score for the computer. This is the same…
Descriptors: Artificial Intelligence, Intelligence, Human Body, Comparative Analysis
Peer reviewed Peer reviewed
Direct linkDirect link
Shawer, Saad F. – Journal of Further and Higher Education, 2013
This quantitative investigation examined the influence of low and high self-efficacy on candidate teacher academic performance in a foreign language teaching methodology course through testing the speculation that high self-efficacy levels would improve pedagogical-content knowledge (PCK). Positivism guided the research design at the levels of…
Descriptors: Teacher Education, Preservice Teachers, Self Efficacy, Academic Achievement
Peer reviewed Peer reviewed
Direct linkDirect link
Chavez, Oscar; Papick, Ira; Ross, Daniel J.; Grouws, Douglas A. – Mathematics Education Research Journal, 2011
This article describes the process of development of assessment instruments for a three-year longitudinal comparative study that focused on evaluating American high school students' mathematics learning from two distinct approaches to content organization: curriculum built around a sequence of three full-year courses (Algebra 1, Geometry, and…
Descriptors: Mathematics Curriculum, Mathematics Education, Interrater Reliability, Scoring Rubrics
Peer reviewed Peer reviewed
Direct linkDirect link
Christ, Theodore J.; Riley-Tillman, T. Chris; Chafouleas, Sandra; Jaffery, Rosemary – School Psychology Review, 2011
The method of Direct Behavior Rating (DBR) incorporates aspects of both systematic direct observation and behavior rating scales to provide an efficient means to collect time series data. This study extended the development and evaluation of DBR Single-Item Scales (DBR-SIS) as a behavior assessment tool. Eighty-eight undergraduate students used…
Descriptors: Video Technology, Behavior Problems, Student Behavior, Observation
Kowal, Julie; Hassel, Emily Ayscue – Public Impact, 2010
For too long, performance measurement systems in education have failed to document and recognize real differences among educators. But a recent national push to use performance evaluations for critical personnel decisions has highlighted the shortcomings of the current systems and increased the urgency to dramatically improve them. As state and…
Descriptors: Teaching (Occupation), Teacher Evaluation, Performance Based Assessment, Comparative Analysis
Kaliski, Pamela; Huff, Kristen; Barry, Carol – College Board, 2011
For educational achievement tests that employ multiple-choice (MC) items and aim to reliably classify students into performance categories, it is critical to design MC items that are capable of discriminating student performance according to the stated achievement levels. This is accomplished, in part, by clearly understanding how item design…
Descriptors: Alignment (Education), Academic Achievement, Expertise, Evaluative Thinking
Peer reviewed Peer reviewed
Direct linkDirect link
Pyle, Katie; Jones, Emily; Williams, Chris; Morrison, Jo – Educational Research, 2009
Background: All national curriculum tests in England are pre-tested as part of the development process. Differences in pupil performance between pre-test and live test are consistently found. This difference has been termed the pre-test effect. Understanding the pre-test effect is essential in the test development and selection processes and in…
Descriptors: Foreign Countries, Pretesting, Context Effect, National Curriculum
Previous Page | Next Page ยป
Pages: 1  |  2  |  3  |  4