Publication Date
| In 2026 | 0 |
| Since 2025 | 52 |
| Since 2022 (last 5 years) | 194 |
| Since 2017 (last 10 years) | 494 |
| Since 2007 (last 20 years) | 742 |
Descriptor
| Test Items | 1186 |
| Test Reliability | 1186 |
| Test Validity | 684 |
| Test Construction | 565 |
| Foreign Countries | 348 |
| Difficulty Level | 279 |
| Item Analysis | 252 |
| Psychometrics | 233 |
| Item Response Theory | 219 |
| Factor Analysis | 183 |
| Multiple Choice Tests | 172 |
| More ▼ | |
Source
Author
| Schoen, Robert C. | 12 |
| LaVenia, Mark | 5 |
| Liu, Ou Lydia | 5 |
| Anderson, Daniel | 4 |
| Bauduin, Charity | 4 |
| DiLuzio, Geneva J. | 4 |
| Farina, Kristy | 4 |
| Haladyna, Thomas M. | 4 |
| Huck, Schuyler W. | 4 |
| Petscher, Yaacov | 4 |
| Stansfield, Charles W. | 4 |
| More ▼ | |
Publication Type
Education Level
Audience
| Practitioners | 39 |
| Researchers | 30 |
| Teachers | 24 |
| Administrators | 13 |
| Support Staff | 3 |
| Counselors | 2 |
| Students | 2 |
| Community | 1 |
| Parents | 1 |
| Policymakers | 1 |
Location
| Turkey | 68 |
| Indonesia | 37 |
| Germany | 20 |
| Canada | 17 |
| Florida | 17 |
| China | 16 |
| Australia | 15 |
| California | 12 |
| Iran | 11 |
| India | 10 |
| New York | 9 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Meets WWC Standards without Reservations | 1 |
| Meets WWC Standards with or without Reservations | 1 |
Semiun, Thresia Trivict; Luruk, Fransiska Densiana – English Language Teaching Educational Journal, 2020
This study aimed at examining the quality of an English summative test of grade VII in a public school located in Kupang. Particularly, this study examined content validity, reliability, and conducted item analysis including item validity, item difficulty, item discrimination, and distracter effectiveness. This study was descriptive evaluative…
Descriptors: Summative Evaluation, Language Tests, English (Second Language), Content Validity
Laliyo, Lukman Abdul Rauf; Tangio, Julhim S.; Sumintono, Bambang; Jahja, Mohamad; Panigoro, Citra – Journal of Baltic Science Education, 2020
This research aimed to evaluate the students' conceptual understanding and to diagnose the students' preconceptions in elaborating the particle characteristics of matter by development of diagnostic instrument as well as Rasch model response pattern analysis approach. Data were acquired by 25 multiple-choice written test items distributed to 987…
Descriptors: Physics, Science Instruction, Scientific Concepts, Science Tests
Butz, Amanda R.; Branchaw, Janet L. – CBE - Life Sciences Education, 2020
Expanding the scope of previous undergraduate research assessment tools, the "Entering Research" Learning Assessment (ERLA) measures undergraduate and graduate research trainee learning gains in the seven areas of trainee development in the evidence-based "Entering Research" conceptual framework: Research Comprehension and…
Descriptors: Undergraduate Students, Graduate Students, College Students, Student Research
Smith, William Zachary; Dickenson, Tammiee S.; Rogers, Bradley David – AERA Online Paper Repository, 2017
Questionnaire refinement and a process for selecting items for elimination are important tools for survey developers. One of the major obstacles in questionnaire refinement and elimination in surveys lies in one's ability to adequately and appropriately reconstruct a survey. Often times, surveys can be long and strenuous on the respondent,…
Descriptors: Surveys, Psychometrics, Test Construction, Test Reliability
Ben Seipel; Sarah E. Carlson; Virginia Clinton-Lisell; Mark L. Davison; Patrick C. Kennedy – Grantee Submission, 2022
Originally designed for students in Grades 3 through 5, MOCCA (formerly the Multiple-choice Online Causal Comprehension Assessment), identifies students who struggle with comprehension, and helps uncover why they struggle. There are many reasons why students might not comprehend what they read. They may struggle with decoding, or reading words…
Descriptors: Multiple Choice Tests, Computer Assisted Testing, Diagnostic Tests, Reading Tests
Andersson, Björn; Xin, Tao – Educational and Psychological Measurement, 2018
In applications of item response theory (IRT), an estimate of the reliability of the ability estimates or sum scores is often reported. However, analytical expressions for the standard errors of the estimators of the reliability coefficients are not available in the literature and therefore the variability associated with the estimated reliability…
Descriptors: Item Response Theory, Test Reliability, Test Items, Scores
Sharma, Ekta; Sharma, Sandeep – International Journal of Educational Management, 2018
Purpose: Today, innovation and creativity are the buzz words in the galore of not only business but also of education. The need to foster creativity and innovation has long been a priority in the educational and corporate spheres. The purpose of this paper is to propose the scale for the measurement of teacher's creativity nurturing behaviour.…
Descriptors: Test Construction, Creativity, Educational Innovation, Creative Teaching
Yun, Young Ho; Kim, Yaeji; Sim, Jin A.; Choi, Soo Hyuk; Lim, Cheolil; Kang, Joon-ho – Journal of School Health, 2018
Background: The objective of this study was to develop the School Health Score Card (SHSC) and validate its psychometric properties. Methods: The development of the SHSC questionnaire included 3 phases: item generation, construction of domains and items, and field testing with validation. To assess the instrument's reliability and validity, we…
Descriptors: School Health Services, Psychometrics, Test Construction, Test Validity
Atalmis, Erkan Hasan – International Journal of Assessment Tools in Education, 2018
Although multiple-choice items (MCIs) are widely used for classroom assessment, designing MCIs with sufficient number of plausible distracters is very challenging for teachers. In this regard, previous empirical studies reveal that using three-option MCIs provides various advantages when compared to four-option MCIs due to less preparation and…
Descriptors: Multiple Choice Tests, Test Items, Difficulty Level, Test Reliability
Sridhanyarat, Kietnawin; Pathong, Supakarn; Suranakkharin, Todsapon; Ammaralikit, Amornrat – English Language Teaching, 2021
This study aimed at developing the Silpakorn Test of English Proficiency (STEP), in alignment with the Common European Framework of Reference for Languages (CEFR), and in accordance with the theoretical framework established by Alderson et al. (2006). Four major steps were involved in the test construction. First, English language lecturers who…
Descriptors: Language Tests, Language Proficiency, Second Language Learning, Second Language Instruction
Garcia, Allen G.; Lambert, Matthew C.; Epstein, Michael H.; Cullinan, Douglas – School Mental Health, 2019
The present study examined the measurement properties of the Emotional and Behavioral Screener (EBS), a universal screening instrument which identifies students presenting with emotional and behavioral problems. The primary research questions sought to examine the degree to which the EBS item responses fit the Rasch model through evaluating fit of…
Descriptors: Screening Tests, Identification, Behavior Rating Scales, Emotional Disturbances
Yuksel, Ibrahim; Savas, Muhammed Ali – Asian Journal of Education and Training, 2019
In this research, it is aimed to develop a valid and reliable test to determine the drawing a shape-schema and making a table levels of prospective teachers at Mathematics and Science Education, Turkish and Social Sciences Education and Basic Education Departments. In this process, a comprehensive item pool has been prepared with the table of…
Descriptors: Preservice Teachers, Item Banks, Test Validity, Foreign Countries
Sahin-Topalcengiz, Emine; Yildirim, Bekir – Journal of Education in Science, Environment and Health, 2019
The purpose of this study was to adapt the Elementary Teachers Efficacy and Attitudes towards STEM Survey (ET-STEM scale; Friday Institute for Educational Innovation, 2012) into Turkish and test the validity and reliability of the instrument. ET-STEM was administered to 313 elementary teachers from different provinces of Turkey. Exploratory and…
Descriptors: Foreign Countries, Likert Scales, Test Construction, Test Validity
Suprapto, Edy; Saryanto; Sumiharsono, Rudy; Ramadhan, Syahrul – Journal of Turkish Science Education, 2020
This research aims to produce feasible and valid assessment instrument of Higher Order Thinking Skill (HOTS) to measure students' Higher Order Thinking Skill in Physics learning. The type of this research was research and development, adapted from development model from Brog and Gall. The researchers modified Borg and Gall's development model as…
Descriptors: Measures (Individuals), Thinking Skills, Critical Thinking, Problem Solving
Kane, Michael T. – Assessment in Education: Principles, Policy & Practice, 2017
In response to an argument by Baird, Andrich, Hopfenbeck and Stobart (2017), Michael Kane states that there needs to be a better fit between educational assessment and learning theory. In line with this goal, Kane will examine how psychometric constraints might be loosened by relaxing some psychometric "rules" in some assessment…
Descriptors: Educational Assessment, Psychometrics, Standards, Test Reliability

Peer reviewed
Direct link
