Publication Date
| In 2026 | 0 |
| Since 2025 | 389 |
| Since 2022 (last 5 years) | 1887 |
| Since 2017 (last 10 years) | 4031 |
| Since 2007 (last 20 years) | 6737 |
Descriptor
Source
Author
Publication Type
Education Level
Audience
| Practitioners | 644 |
| Teachers | 455 |
| Researchers | 440 |
| Administrators | 126 |
| Policymakers | 68 |
| Students | 68 |
| Counselors | 26 |
| Parents | 24 |
| Community | 10 |
| Support Staff | 5 |
| Media Staff | 3 |
| More ▼ | |
Location
| Turkey | 603 |
| Australia | 339 |
| Canada | 254 |
| China | 180 |
| Indonesia | 147 |
| United States | 143 |
| United Kingdom | 130 |
| Germany | 116 |
| Taiwan | 111 |
| California | 109 |
| Spain | 107 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Meets WWC Standards without Reservations | 3 |
| Meets WWC Standards with or without Reservations | 3 |
| Does not meet standards | 2 |
Debeer, Dries; Ali, Usama S.; van Rijn, Peter W. – Journal of Educational Measurement, 2017
Test assembly is the process of selecting items from an item pool to form one or more new test forms. Often new test forms are constructed to be parallel with an existing (or an ideal) test. Within the context of item response theory, the test information function (TIF) or the test characteristic curve (TCC) are commonly used as statistical…
Descriptors: Test Format, Test Construction, Statistical Analysis, Comparative Analysis
Newton, Paul E. – Educational Measurement: Issues and Practice, 2017
The dominant narrative for assessment design seems to reflect a strong, albeit largely implicit undercurrent of purpose purism, which idealizes the principle that assessment design should be driven by a single assessment purpose. With a particular focus on achievement assessments, the present article questions the tenability of purpose purism,…
Descriptors: Evaluation Methods, Test Construction, Instructional Design, Decision Making
Peterson, Christina Hamme; Peterson, N. Andrew; Powell, Kristen Gilmore – Measurement and Evaluation in Counseling and Development, 2017
Cognitive interviewing (CI) is a method to identify sources of confusion in assessment items and to assess validity evidence on the basis of content and response processes. We introduce readers to CI and describe a process for conducting such interviews and analyzing the results. Recommendations for best practice are provided.
Descriptors: Test Items, Test Construction, Interviews, Test Validity
Miller, Faith G.; Riley-Tillman, T. Chris; Chafouleas, Sandra M.; Schardt, Alyssa A. – Assessment for Effective Intervention, 2017
The purpose of this study was to investigate the impact of two different Direct Behavior Rating--Single Item Scale (DBR-SIS) formats on rating accuracy. A total of 119 undergraduate students participated in one of two study conditions, each utilizing a different DBR-SIS scale format: one that included percentage of time anchors on the DBR-SIS…
Descriptors: Behavior Rating Scales, Test Format, Accuracy, Undergraduate Students
Hilton, Charlotte Emma – International Journal of Social Research Methodology, 2017
The development of questionnaires, surveys and psychometric scales is an iterative research process that includes a number of carefully planned stages. Pretesting is a method of checking that questions work as intended and are understood by those individuals who are likely to respond to them. However, detailed reports of appropriate methods to…
Descriptors: Questionnaires, Pretesting, Interviews, Test Construction
Shepard, Lorrie A. – Assessment in Education: Principles, Policy & Practice, 2016
The AERA, APA, NCME Standards define validity as "the degree to which evidence and theory support the interpretations of test scores for proposed uses of tests". A century of disagreement about validity does not mean that there has not been substantial progress. This consensus definition brings together interpretations and use so that it…
Descriptors: Test Validity, Standards, Test Theory, Evidence
Gramipour, Masoud; Shariatmadari, Mehdi; Mahdi, Somayeh – Journal of Pedagogical Research, 2019
The purpose of this study was to design a comprehensive and native scale, and to investigate the validity and reliability of teachers' academic emotions scale including anxiety, happiness, anger, pride, hope and despair, exhaustion, shame and guilt through a nine-factors TAE model (second order hierarchy) and a two-factors TAE model (third-order…
Descriptors: Affective Measures, Teacher Characteristics, Emotional Experience, Test Construction
Sustekova, Erika; Kubiatko, Milan; Usak, Muhammet – EURASIA Journal of Mathematics, Science and Technology Education, 2019
Critical Thinking is a generally recognized educational ideal at all levels of the educational process. This study validated the critical thinking test on Slovak conditions. Data were collected from 50 respondents studying at university. Bachelor's and Master's students from all grades, aged 21 to 36 (x = 23.00; SD = 2.84) were represented. Model…
Descriptors: Critical Thinking, Tests, Test Validity, Test Reliability
Braun, Henry – British Journal of Educational Psychology, 2019
Background: There is unrealized potential in higher education for greater use of performance assessment, particularly in support of teaching and learning: Well-designed performance tasks can elicit evidence regarding what students know and can do with respect to complex learning objectives. At the same time, there is some pressure, at least in the…
Descriptors: Performance Based Assessment, Higher Education, Test Format, Standardized Tests
Isik, Serife; Üzbe Atalay, Nazife – Pegem Journal of Education and Instruction, 2019
The purpose of the current study is to develop the Adolescent Happiness Scale (AHS). A systematic approach was utilized for developing the scale. In this study, the data were collected from 1136 adolescents including 490 females and 646 males between 11-17 years of age. The psychometric properties of AHS were analyzed by means of item analysis,…
Descriptors: Adolescents, Psychological Patterns, Test Construction, Test Validity
Munoz, Albert; Mackay, Jonathon – Journal of University Teaching and Learning Practice, 2019
Online testing is a popular practice for tertiary educators, largely owing to efficiency in automation, scalability, and capability to add depth and breadth to subject offerings. As with all assessments, designs need to consider whether student cheating may be inadvertently made easier and more difficult to detect. Cheating can jeopardise the…
Descriptors: Cheating, Test Construction, Computer Assisted Testing, Classification
Niksadat, Negin; Rakhshanderou, Sakineh; Negarandeh, Reza; Ramezankhani, Ali; Vasheghani Farahani, Ali; Ghaffari, Mohtasham – American Journal of Health Education, 2019
Background: The existing literature supports the application of the principles of andragogy on patient education. But there is a lack of a suitable tool for assessing patient education's conformity with these principles. Purpose: This study was conducted to develop and evaluate the psychometric properties of a questionnaire that measures the…
Descriptors: Test Construction, Questionnaires, Psychometrics, Andragogy
Esendemir, Ozan; Bindak, Recep – International Journal of Educational Methodology, 2019
"Mathematical knowledge for teaching" is a concept indicating the requirement for a specific kind of knowledge required to teach mathematics. Mathematical knowledge for teaching necessitates a more complex structure than what is required to carry out mathematical tasks and the knowledge to do that. The purpose of this study is to realize…
Descriptors: Knowledge Base for Teaching, Mathematics Instruction, Knowledge Level, Geometry
Crawford, Angela R.; Johnson, Evelyn S.; Moylan, Laura A.; Zheng, Yuzhu – Assessment for Effective Intervention, 2019
This study describes the development and initial psychometric evaluation of the Recognizing Effective Special Education Teachers (RESET) observation instrument. The study uses generalizability theory to compare two versions of a rubric, one with general descriptors of performance levels and one with item-specific descriptors of performance levels,…
Descriptors: Teacher Evaluation, Special Education Teachers, Scoring Rubrics, Observation
Akkus, Adem – International Journal of Assessment Tools in Education, 2019
The aim of this study is to develop a science attitude scale (SAS). For that purpose, the literature review has been done for suggestions for creating scales and a new draft scale developed. The draft scale was analyzed by specialists and a pilot study is done after its approval by experts. The SAS is prepared with 21 items and among these, 11…
Descriptors: Attitude Measures, Student Attitudes, Test Construction, Scientific Attitudes

Peer reviewed
Direct link
