Publication Date
| In 2026 | 0 |
| Since 2025 | 52 |
| Since 2022 (last 5 years) | 194 |
| Since 2017 (last 10 years) | 494 |
| Since 2007 (last 20 years) | 742 |
Descriptor
| Test Items | 1186 |
| Test Reliability | 1186 |
| Test Validity | 684 |
| Test Construction | 565 |
| Foreign Countries | 348 |
| Difficulty Level | 279 |
| Item Analysis | 252 |
| Psychometrics | 233 |
| Item Response Theory | 219 |
| Factor Analysis | 183 |
| Multiple Choice Tests | 172 |
| More ▼ | |
Source
Author
| Schoen, Robert C. | 12 |
| LaVenia, Mark | 5 |
| Liu, Ou Lydia | 5 |
| Anderson, Daniel | 4 |
| Bauduin, Charity | 4 |
| DiLuzio, Geneva J. | 4 |
| Farina, Kristy | 4 |
| Haladyna, Thomas M. | 4 |
| Huck, Schuyler W. | 4 |
| Petscher, Yaacov | 4 |
| Stansfield, Charles W. | 4 |
| More ▼ | |
Publication Type
Education Level
Audience
| Practitioners | 39 |
| Researchers | 30 |
| Teachers | 24 |
| Administrators | 13 |
| Support Staff | 3 |
| Counselors | 2 |
| Students | 2 |
| Community | 1 |
| Parents | 1 |
| Policymakers | 1 |
Location
| Turkey | 68 |
| Indonesia | 37 |
| Germany | 20 |
| Canada | 17 |
| Florida | 17 |
| China | 16 |
| Australia | 15 |
| California | 12 |
| Iran | 11 |
| India | 10 |
| New York | 9 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Meets WWC Standards without Reservations | 1 |
| Meets WWC Standards with or without Reservations | 1 |
Cook, Ryan M.; Fye, Heather J.; Wind, Stefanie A. – Measurement and Evaluation in Counseling and Development, 2021
We examined the psychometric properties of the Counselor Burnout Inventory (CBI) with 560 early career, post-master's counselors. We tested the dimensional structure of the CBI, item ordering, and the function of the rating scale using item response theory. Implications of the findings for researchers, counselors, and counselor educators are…
Descriptors: Counselors, Burnout, Item Response Theory, Entry Workers
Kevin Ackermans; Marjoke Bakker; Pierre Gorissen; Anne-Marieke Loon; Marijke Kral; Gino Camp – Journal of Computer Assisted Learning, 2024
Background: A practical test that measures the information and communication technology (ICT) skills students need for effectively using ICT in primary education has yet to be developed (Oh et al., 2021). This paper reports on the development, validation, and reliability of a test measuring primary school students' ICT skills required for…
Descriptors: Test Construction, Test Validity, Measures (Individuals), Elementary School Students
Chen, Mo; Nah, Yong-Hwee; Waschl, Nicolette; Poon, Kenneth; Chen, Ping – Journal of Psychoeducational Assessment, 2022
Culturally bounded in nature, adaptive behavior is the degree to which a person meets the requirements of personal independence and social responsibilities. This study aimed to develop a computerized adaptive test (CAT) of a culturally appropriate adaptive behavior measure (i.e., the Activities and Participation Rating Scale [APRS]) in the…
Descriptors: Computer Assisted Testing, Cultural Relevance, Test Construction, Test Items
Liu, Vivienne Yi-Yu; Lim, Sok Mui – Asia Pacific Journal of Education, 2022
Although the Brief Resilience Scale (BRS) has been extensively adapted worldwide, work on the generalizability of the original English BRS to Asian populations remains limited. This research evaluated the psychometric properties of the English BRS through two studies with Singaporean undergraduate freshmen (Study 1 n = 839; Study 2 n = 1,068)…
Descriptors: Foreign Countries, Psychometrics, College Freshmen, Resilience (Psychology)
Leo, Francisco M.; Fernández-Río, Javier; Pulido, Juan J.; Rodríguez-González, Pablo; López-Gajardo, Miguel A. – Social Psychology of Education: An International Journal, 2023
The aim of this study was to develop and validate a psychometrically-sound instrument to assess students' perceptions about class cohesion. Two studies were conducted. In Study 1, four steps were established: (1) development of the Class Cohesion Questionnaire (CCQ); (2) item selection; (3) item compression; and (4) exploration of psychometric…
Descriptors: Classroom Environment, Group Unity, Elementary School Students, Secondary School Students
Salim Nabhan; Anita Habók – SAGE Open, 2025
As the integration of digital technologies continues to shape academic landscapes, assessing digital literacy in the context of academic writing becomes paramount. Several instruments and frameworks are available for measuring digital literacy and examining it from different perspectives; however, none are suitable for measuring the digital…
Descriptors: Digital Literacy, Academic Language, Writing (Composition), Measures (Individuals)
Gurdil Ege, Hatice; Demir, Ergul – Eurasian Journal of Educational Research, 2020
Purpose: The present study aims to evaluate how the reliabilities computed using a, Stratified a, Angoff-Feldt, and Feldt-Raju estimators may differ when sample size (500, 1000, and 2000) and item type ratio of dichotomous to polytomous items (2:1; 1:1, 1:2) included in the scale are varied. Research Methods: In this study, Cronbach's a,…
Descriptors: Test Format, Simulation, Test Reliability, Sample Size
Deborah Rivas-Drake; Jozet Channey; Gina McGovern; Bernardette J. Pinetta – AERA Open, 2024
This article delineates the development of a measure to assess teachers' reported engagement in practices that center on issues of racial equity as part of their SEL instruction. An iterative mixed-method approach included theoretical grounding, literature reviews, content expert evaluation, focus groups, cognitive interviews, and multiple survey…
Descriptors: Equal Education, Social Emotional Learning, Measures (Individuals), Test Construction
Al-zboon, Habis Saad; Alrekebat, Amjad Farhan – International Journal of Higher Education, 2021
This study aims at identifying the effect of multiple-choice test items' difficulty degree on the reliability coefficient and the standard error of measurement depending on the item response theory IRT. To achieve the objectives of the study, (WinGen3) software was used to generate the IRT parameters (difficulty, discrimination, guessing) for four…
Descriptors: Multiple Choice Tests, Test Items, Difficulty Level, Error of Measurement
Deniz, Kaan Zulfikar; Ilican, Emel – International Journal of Assessment Tools in Education, 2021
This study aims to compare the G and Phi coefficients as estimated by D studies for a measurement tool with the G and Phi coefficients obtained from real cases in which items of differing difficulty levels were added and also to determine the conditions under which the D studies estimated reliability coefficients closer to reality. The study group…
Descriptors: Generalizability Theory, Test Items, Difficulty Level, Test Reliability
Moore, C. Missy; Foxx, Sejal Parikh – Measurement and Evaluation in Counseling and Development, 2021
Using a multiphased and mixed method approach to instrument development, the ISS-C was composed of 16 items extracted into four factors, which demonstrated adequate model fit. The ISS-C scores showed good internal consistency, convergent validity, and temporal stability, while some aspects of discriminant validity remain suboptimal.
Descriptors: Affective Measures, Anxiety, Counselors, Test Validity
Campbell, Todd; Lee, Hyunju; Longhurst, Max; McKenna, Thomas J.; Coster, Daniel; Lundgren, Lisa – School Science and Mathematics, 2021
In the United States and internationally, there has been an increased emphasis on the practice turn or a focus on engaging students in more authentic representations of how science is practiced. In this article, we describe the development of a student questionnaire to investigate the extent to which students report being engaged in learning…
Descriptors: Science Education, Student Experience, Questionnaires, Test Construction
Glamocic, Džana Salibašic; Mešic, Vanes; Neumann, Knut; Sušac, Ana; Boone, William J.; Aviani, Ivica; Hasovic, Elvedin; Erceg, Nataša; Repnik, Robert; Grubelnik, Vladimir – Physical Review Physics Education Research, 2021
Item banks are generally considered the basis of a new generation of educational measurement. In combination with specialized software, they can facilitate the computerized assembling of multiple pre-equated test forms. However, for advantages of item banks to become fully realized it is important that the item banks store a relatively large…
Descriptors: Item Banks, Test Items, Item Response Theory, Item Sampling
Kathryn Lynn Black – ProQuest LLC, 2021
In the process of instrument development, developers follow protocols and conduct analyses to ensure psychometric properties are met. During development, developers determine an assessment format that best aligns to their desired construct, including direct or indirect assessment. Within indirect assessment, participants hold a non-active role in…
Descriptors: Measures (Individuals), Early Childhood Education, Factor Structure, Factor Analysis
Meyer, J. Patrick; Hu, Ann; Li, Sylvia – NWEA, 2023
The Content Proximity Project was designed to improve the content validity of the MAP® Growth™ assessments while retaining the ability for the test to adapt off-grade and meet students wherever they are in their learning. Two main features of the project were the development of an enhanced item selection algorithm, and a spring pilot study…
Descriptors: Achievement Tests, Mathematics Achievement, Content Validity, Mathematics Tests

Peer reviewed
Direct link
