Publication Date
| In 2026 | 34 |
| Since 2025 | 2433 |
| Since 2022 (last 5 years) | 12948 |
| Since 2017 (last 10 years) | 34073 |
| Since 2007 (last 20 years) | 68564 |
Descriptor
| Foreign Countries | 30631 |
| Test Validity | 21786 |
| Scores | 18282 |
| Academic Achievement | 16944 |
| Test Construction | 16779 |
| Test Reliability | 15055 |
| Achievement Tests | 14883 |
| Standardized Tests | 14734 |
| Comparative Analysis | 14432 |
| Elementary Secondary Education | 13052 |
| Language Tests | 12558 |
| More ▼ | |
Source
Author
Publication Type
Education Level
Audience
| Practitioners | 5034 |
| Teachers | 3394 |
| Researchers | 2630 |
| Policymakers | 1232 |
| Administrators | 979 |
| Students | 687 |
| Parents | 325 |
| Counselors | 216 |
| Community | 162 |
| Support Staff | 50 |
| Media Staff | 34 |
| More ▼ | |
Location
| Turkey | 2831 |
| Australia | 2433 |
| Canada | 2273 |
| California | 1857 |
| United States | 1729 |
| Texas | 1618 |
| China | 1583 |
| United Kingdom | 1316 |
| Florida | 1312 |
| United Kingdom (England) | 1205 |
| Germany | 1125 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Meets WWC Standards without Reservations | 121 |
| Meets WWC Standards with or without Reservations | 189 |
| Does not meet standards | 174 |
Lee, Elizabeth – Studies in Applied Linguistics & TESOL, 2020
Ensuring that test-score use brings about socially positive consequences for test-takers is an important aspect of test validation. While many studies use an inductive approach to evaluate test consequences, few studies have implemented Appraisal analysis. To that end, this case study investigated the test consequences of an English reading…
Descriptors: English (Second Language), Second Language Learning, Language Tests, Reading Tests
Wang, Lu; Steedle, Jeffrey – ACT, Inc., 2020
In recent ACT mode comparability studies, students testing on laptop or desktop computers earned slightly higher scores on average than students who tested on paper, especially on the ACT® reading and English tests (Li et al., 2017). Equating procedures adjust for such "mode effects" to make ACT scores comparable regardless of testing…
Descriptors: Test Format, Reading Tests, Language Tests, English
Elturki, Eman – English Teaching Forum, 2020
Accrediting agencies for English language programs, such as the Commission on English Language Program Accreditation (CEA), require a plan in writing for monitoring and reviewing assessment practices. Nonetheless, web-search queries such as "assessing assessment," "how to assess assessment," "assessing assessment…
Descriptors: College Second Language Programs, English (Second Language), Student Evaluation, Test Reliability
ShayesteFar, Parvaneh – Educational Assessment, Evaluation and Accountability, 2020
Research on test change often documents high-stakes English test impact on English language learning, whereas evidence for simultaneous impact on affective predictors of learning is still missing. We tested a theoretical model positing that changing high-stake English tests (English Language Requirements for University Entrance, in this study)…
Descriptors: Language Tests, High Stakes Tests, English Language Learners, Student Attitudes
Turhan, Nihan Sölpük – International Journal of Progressive Education, 2020
Measurement tools that are used in education are important factors that affect course success and motivation of students. This study aims to determine the opinions of high school students on different question types. As the subgoals of the research, the study aims to determine the reasons for multiple choice test preference and its effect on…
Descriptors: Test Items, Preferences, High School Students, Learning Motivation
Priyatni, Endah Tri; Martutik – SAGE Open, 2020
The ability to think critically and creatively is essential for students to help them thrive in the 21st century. Creative and critical thinking can be measured through problem solving because the assessment contains tasks that require students to find problems, analyze and evaluate problems, and work out the solutions. Therefore, this study was…
Descriptors: Problem Solving, Reading Tests, Test Construction, Test Validity
Gasteiger, Hedwig; Bruns, Julia; Benz, Christiane; Brunner, Esther; Sprenger, Priska – ZDM: The International Journal on Mathematics Education, 2020
Measurement instruments of early childhood teachers' mathematical pedagogical content knowledge (MPCK) have to consider the special characteristics of early childhood teaching. Early childhood teaching includes some planned activities but in contrast to learning in school, it is often motivated and generated by situations which unfold…
Descriptors: Mathematics Instruction, Pedagogical Content Knowledge, Multiple Choice Tests, Kindergarten
Lim, Lyndon – Journal of Psychoeducational Assessment, 2020
This article outlines the development and validation of the Computer-Delivered Test (CDT) Acceptance Questionnaire (CTAQ). The CTAQ was designed to be a practical measure of CDT acceptance of Singapore secondary and high school students (Grades 7-12) toward taking tests within an e-assessment system. The stages of test (questionnaire item)…
Descriptors: Student Attitudes, High School Students, Secondary School Students, Computer Assisted Testing
Senan, Nabil Ahmed Mareai; Sulphey, M. M. – Education & Training, 2022
Purpose: Globally, serious doubts are now expressed about the quality of accounting education, and employers are concerned about the lack of employability among graduates. There is a lack of a validated tool to measure employability in the Saudi Arabia context. Such a tool is required to assess the level of employability so that required…
Descriptors: Test Construction, Test Validity, Employment Potential, Questionnaires
D'Souza, Derrick E.; Bement, Danuse; Cory, Kenneth – Decision Sciences Journal of Innovative Education, 2022
Practitioner surveys suggest that despite well-intentioned efforts, undergraduate business programs could better equip students with "soft skills." This research study focuses on the soft skills associated with cross-functional integration (CFI), where skill gaps are believed to exist but have not been confirmed. As the first study to…
Descriptors: Business Schools, Undergraduate Study, Business Administration Education, Soft Skills
Vuorre, Matti; Metcalfe, Janet – Metacognition and Learning, 2022
This article investigates the concern that assessment of metacognitive resolution (or relative accuracy--often evaluated by gamma correlations or signal detection theoretic measures such as d[subscript a]) is vulnerable to an artifact due to guessing that differentially impacts low as compared to high performers on tasks that involve…
Descriptors: Metacognition, Accuracy, Memory, Multiple Choice Tests
Polat, Murat; Turhan, Nihan S.; Toraman, Cetin – Pegem Journal of Education and Instruction, 2022
Testing English writing skills could be multi-dimensional; thus, the study aimed to compare students' writing scores calculated according to Classical Test Theory (CTT) and Multi-Facet Rasch Model (MFRM). The research was carried out in 2019 with 100 university students studying at a foreign language preparatory class and four experienced…
Descriptors: Comparative Analysis, Test Theory, Item Response Theory, Student Evaluation
Yan, Zi; Pastore, Serafina – Journal of Psychoeducational Assessment, 2022
A significant challenge in studying formative assessment is the lack of suitable instruments for assessing teachers' formative assessment practices. This paper reports the development of the Teacher Formative Assessment Practice Scale (TFAPS) and its psychometric properties based on two samples of primary and secondary school teachers: one from…
Descriptors: Formative Evaluation, Educational Strategies, Foreign Countries, Elementary School Teachers
Williamson, Joanna – Research Matters, 2022
Providing evidence that can inform awarding is an important application of Comparative Judgement (CJ) methods in high-stakes qualifications. The process of marking scripts is not changed, but CJ methods can assist in the maintenance of standards from one series to another by informing decisions about where to place grade boundaries or cut scores.…
Descriptors: Standards, Grading, Decision Making, Comparative Analysis
Gill, Tim – Research Matters, 2022
In Comparative Judgement (CJ) exercises, examiners are asked to look at a selection of candidate scripts (with marks removed) and order them in terms of which they believe display the best quality. By including scripts from different examination sessions, the results of these exercises can be used to help with maintaining standards. Results from…
Descriptors: Comparative Analysis, Decision Making, Scripts, Standards

Peer reviewed
Direct link
