Publication Date
| In 2026 | 0 |
| Since 2025 | 220 |
| Since 2022 (last 5 years) | 1089 |
| Since 2017 (last 10 years) | 2599 |
| Since 2007 (last 20 years) | 4960 |
Descriptor
Source
Author
Publication Type
Education Level
Audience
| Practitioners | 653 |
| Teachers | 563 |
| Researchers | 250 |
| Students | 201 |
| Administrators | 81 |
| Policymakers | 22 |
| Parents | 17 |
| Counselors | 8 |
| Community | 7 |
| Support Staff | 3 |
| Media Staff | 1 |
| More ▼ | |
Location
| Turkey | 226 |
| Canada | 223 |
| Australia | 155 |
| Germany | 116 |
| United States | 99 |
| China | 90 |
| Florida | 86 |
| Indonesia | 82 |
| Taiwan | 78 |
| United Kingdom | 73 |
| California | 66 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Meets WWC Standards without Reservations | 4 |
| Meets WWC Standards with or without Reservations | 4 |
| Does not meet standards | 1 |
Tian, Yan – Research-publishing.net, 2017
Translation is one of the items tested in many national English proficiency tests for non-English majors in China because translation competence is regarded as one of the productive language skills which could be used to assess learners' language proficiency. However, the feedback on translation exercises and self-tests are usually provided by…
Descriptors: Translation, English (Second Language), Second Language Learning, Second Language Instruction
Stanley, Leanne M.; Edwards, Michael C. – Educational and Psychological Measurement, 2016
The purpose of this article is to highlight the distinction between the reliability of test scores and the fit of psychometric measurement models, reminding readers why it is important to consider both when evaluating whether test scores are valid for a proposed interpretation and/or use. It is often the case that an investigator judges both the…
Descriptors: Test Reliability, Goodness of Fit, Scores, Patients
Jerrim, John – Assessment in Education: Principles, Policy & Practice, 2016
The Programme for International Assessment (PISA) is an important cross-national study of 15-year olds academic achievement. Although it has traditionally been conducted using paper-and-pencil tests, the vast majority of countries will use computer-based assessment from 2015. In this paper, we consider how cross-country comparisons of children's…
Descriptors: Foreign Countries, Achievement Tests, International Assessment, Secondary School Students
Banerjee, Jayanti; Papageorgiou, Spiros – International Journal of Listening, 2016
The research reported in this article investigates differential item functioning (DIF) in a listening comprehension test. The study explores the relationship between test-taker age and the items' language domains across multiple test forms. The data comprise test-taker responses (N = 2,861) to a total of 133 unique items, 46 items of which were…
Descriptors: Correlation, High Stakes Tests, Test Items, Listening Comprehension Tests
Anderson, Steven W.; Libarkin, Julie C. – Journal of Geoscience Education, 2016
Nationwide pre- and posttesting of introductory courses with the Geoscience Concept Inventory (GCI) shows little gain for many of its questions. Analysis of more than 3,500 tests shows that 22 of the 73 GCI questions had gains of <0.03, and nearly half of these focused on basic physics and chemistry. We also discovered through an assessment of…
Descriptors: Earth Science, Concept Teaching, Scientific Concepts, Pretests Posttests
Finch, Holmes; Edwards, Julianne M. – Educational and Psychological Measurement, 2016
Standard approaches for estimating item response theory (IRT) model parameters generally work under the assumption that the latent trait being measured by a set of items follows the normal distribution. Estimation of IRT parameters in the presence of nonnormal latent traits has been shown to generate biased person and item parameter estimates. A…
Descriptors: Item Response Theory, Computation, Nonparametric Statistics, Bayesian Statistics
Mahmud, Jumailiyah; Sutikno, Muzayanah; Naga, Dali S. – Educational Research and Reviews, 2016
The aim of this study is to determine variance difference between maximum likelihood and expected A posteriori estimation methods viewed from number of test items of aptitude test. The variance presents an accuracy generated by both maximum likelihood and Bayes estimation methods. The test consists of three subtests, each with 40 multiple-choice…
Descriptors: Maximum Likelihood Statistics, Computation, Item Response Theory, Test Items
Chubbuck, Kay; Curley, W. Edward; King, Teresa C. – ETS Research Report Series, 2016
This study gathered quantitative and qualitative evidence concerning gender differences in performance by using critical reading material on the "SAT"® test with sports and science content. The fundamental research questions guiding the study were: If sports and science are to be included in a skills test, what kinds of material are…
Descriptors: College Entrance Examinations, Gender Differences, Critical Reading, Reading Tests
Golino, Hudson F.; Gomes, Cristiano M. A. – International Journal of Research & Method in Education, 2016
This paper presents a non-parametric imputation technique, named random forest, from the machine learning field. The random forest procedure has two main tuning parameters: the number of trees grown in the prediction and the number of predictors used. Fifty experimental conditions were created in the imputation procedure, with different…
Descriptors: Item Response Theory, Regression (Statistics), Difficulty Level, Goodness of Fit
Ellis, Steven; Barber, Jill – Practitioner Research in Higher Education, 2016
In the Manchester Pharmacy School, we first adopted summative on-line examinations in 2005. Since then, we have increased the range of question types to include short answers, short essays and questions incorporating chemical structures and we achieve time savings of up to 90% in the marking process. Online assessments allow two novel forms of…
Descriptors: Foreign Countries, Pharmacy, Higher Education, Feedback (Response)
Ali, Syed Haris; Carr, Patrick A.; Ruit, Kenneth G. – Journal of the Scholarship of Teaching and Learning, 2016
Plausible distractors are important for accurate measurement of knowledge via multiple-choice questions (MCQs). This study demonstrates the impact of higher distractor functioning on validity and reliability of scores obtained on MCQs. Freeresponse (FR) and MCQ versions of a neurohistology practice exam were given to four cohorts of Year 1 medical…
Descriptors: Scores, Multiple Choice Tests, Test Reliability, Test Validity
Latifi, Syed; Bulut, Okan; Gierl, Mark; Christie, Thomas; Jeeva, Shehzad – SAGE Open, 2016
The purpose of this study is to evaluate two methodological perspectives of test fairness using a national Secondary School Certificate (SSC) examinations. SSC is a suit of multi-subject national qualification tests at Grade 10 level in South Asian countries, such as Bangladesh, India, and Pakistan. Because it is a high-stakes test, the fairness…
Descriptors: Foreign Countries, National Competency Tests, Language Tests, Mathematics Tests
Stugart, Melissa – ProQuest LLC, 2016
Our nation is in the midst of one of the largest education reforms in decades centered on the adoption of the Common Core State Standards (CCSS) and aligned assessments. In an era of rising accountability measures and declining literacy proficiency, it is vital to ensure that educational resources, such as benchmark assessments, are appropriately…
Descriptors: Common Core State Standards, Benchmarking, Educational Assessment, Test Items
Lessne, Deborah; Cidade, Melissa – National Center for Education Statistics, 2016
This report outlines the development, methodology, and results of the split-half administration of the 2015 School Crime Supplement (SCS) to the National Crime Victimization Survey (NCVS). The National Crime Victimization Survey (NCVS) is sponsored by the U.S. Department of Justice, Bureau of Justice Statistics (BJS). The National Center for…
Descriptors: National Surveys, Victims of Crime, Bullying, Schools
Kiliçkaya, Ferit – Online Submission, 2016
This study reports initial findings from a small-scale qualitative study aimed at gaining insights into English language teachers' assessment practices in Turkey by examining the formal exam papers. Based on the technique of content analysis, formal exam papers were analyzed in terms of assessment items, language skills tested as well as the…
Descriptors: English (Second Language), Qualitative Research, Content Analysis, Test Items

Peer reviewed
Direct link
