Publication Date
| In 2026 | 0 |
| Since 2025 | 52 |
| Since 2022 (last 5 years) | 194 |
| Since 2017 (last 10 years) | 494 |
| Since 2007 (last 20 years) | 742 |
Descriptor
| Test Items | 1186 |
| Test Reliability | 1186 |
| Test Validity | 684 |
| Test Construction | 565 |
| Foreign Countries | 348 |
| Difficulty Level | 279 |
| Item Analysis | 252 |
| Psychometrics | 233 |
| Item Response Theory | 219 |
| Factor Analysis | 183 |
| Multiple Choice Tests | 172 |
| More ▼ | |
Source
Author
| Schoen, Robert C. | 12 |
| LaVenia, Mark | 5 |
| Liu, Ou Lydia | 5 |
| Anderson, Daniel | 4 |
| Bauduin, Charity | 4 |
| DiLuzio, Geneva J. | 4 |
| Farina, Kristy | 4 |
| Haladyna, Thomas M. | 4 |
| Huck, Schuyler W. | 4 |
| Petscher, Yaacov | 4 |
| Stansfield, Charles W. | 4 |
| More ▼ | |
Publication Type
Education Level
Audience
| Practitioners | 39 |
| Researchers | 30 |
| Teachers | 24 |
| Administrators | 13 |
| Support Staff | 3 |
| Counselors | 2 |
| Students | 2 |
| Community | 1 |
| Parents | 1 |
| Policymakers | 1 |
Location
| Turkey | 68 |
| Indonesia | 37 |
| Germany | 20 |
| Canada | 17 |
| Florida | 17 |
| China | 16 |
| Australia | 15 |
| California | 12 |
| Iran | 11 |
| India | 10 |
| New York | 9 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Meets WWC Standards without Reservations | 1 |
| Meets WWC Standards with or without Reservations | 1 |
Thompson, Denisse R.; Senk, Sharon L. – Investigations in Mathematics Learning, 2017
Validity evidence based on content is critical for making inferences about examinees' responses to test items. Traditionally, content validity has been established by specifying the content domain of an instrument, through reviews by subject-matter experts, through alignment studies, or by reporting measures of internal consistency, such as…
Descriptors: Item Response Theory, Content Validity, Secondary School Mathematics, Test Items
Polat, Murat – Novitas-ROYAL (Research on Youth and Language), 2020
Classroom practices, materials and teaching methods in language classes have changed a lot in the last decades and continue to evolve; however, the commonly used techniques to test students' foreign language skills have not changed much regardless of the recent awareness in Bloom's taxonomy. Testing units at schools rely mostly on multiple choice…
Descriptors: Multiple Choice Tests, Test Format, Test Items, Difficulty Level
Georgiou, Yiannis; Kyza, Eleni A. – Journal of Psychoeducational Assessment, 2018
The purpose of the present study was to adapt and validate the Need for Cognition Scale--Short Form (NfC-SF) in the Greek language. A multistep process was followed, including (a) the translation and adaptation of the questionnaire, (b) a reliability analysis of the instrument's items in combination with an exploratory factor analysis with 177…
Descriptors: Greek, Test Validity, Translation, Media Adaptation
Ning, Hoi Kwan – Measurement and Evaluation in Counseling and Development, 2018
The psychometric properties of the 2 versions of the Junior Metacognitive Awareness Inventory were examined with Singapore student samples. Other than 2 misfitting items and an underutilized response scale, Rasch analysis demonstrated that the instruments have good measurement precision, and no differential item functioning was detected across…
Descriptors: Foreign Countries, Metacognition, Measures (Individuals), Item Response Theory
Lang, W. Steve; Moore, LaSonya; Wilkerson, Judy R.; Parfitt, Christopher M.; Greene, Jackie; Kratt, Diane; Martelli, C. Dawn; LaPaglia, Kyle; Johnston, Vickie; Gilbert, Shelby; Zhang, Jason; Fields, Lynette – Online Submission, 2018
A team of researchers at two institutions revised and analyzed a battery of instruments to assess the Critical Dispositions (InTASC, 2013) required in the CAEP (2016a) accreditation standards for teacher education programs. This research presents initial findings for the revised version updating previous results from validity and reliability…
Descriptors: Measures (Individuals), Test Construction, Construct Validity, Teacher Characteristics
Eristi, Bahadir; Erdem, Cahit – Contemporary Educational Technology, 2017
This study aims to develop a reliable and valid scale to identify the levels of media users' media literacy skills. The scale development process was carried out in nine steps as recommended in the literature. Before the scale was administered, the items were reviewed by field experts and language experts and a pilot study was carried out.…
Descriptors: Foreign Countries, Media Literacy, Likert Scales, Test Construction
Lee, Yi-Hsuan; Zhang, Jinming – International Journal of Testing, 2017
Simulations were conducted to examine the effect of differential item functioning (DIF) on measurement consequences such as total scores, item response theory (IRT) ability estimates, and test reliability in terms of the ratio of true-score variance to observed-score variance and the standard error of estimation for the IRT ability parameter. The…
Descriptors: Test Bias, Test Reliability, Performance, Scores
Dray, Amy J.; Brown, Nathaniel J. S.; Diakow, Ronli; Lee, Yongsang; Wilson, Mark R. – Reading Psychology, 2019
This article describes the development of an assessment system for adolescent reading comprehension. It presents a research context--a reading intervention implemented in middle and high schools in an urban district--which became the impetus for the work. The article outlines the organizational principles of construct modeling that guided the…
Descriptors: Reading Comprehension, Adolescents, Test Construction, Intervention
Bichi, Ado Abdu; Talib, Rohaya – International Journal of Evaluation and Research in Education, 2018
Testing in educational system perform a number of functions, the results from a test can be used to make a number of decisions in education. It is therefore well accepted in the education literature that, testing is an important element of education. To effectively utilize the tests in educational policies and quality assurance its validity and…
Descriptors: Item Response Theory, Test Items, Test Construction, Decision Making
Bijsterbosch, Erik – Geographical Education, 2018
Geography teachers' school-based (internal) examinations in pre-vocational geography education in the Netherlands appear to be in line with the findings in the literature, namely that teachers' assessment practices tend to focus on the recall of knowledge. These practices are strongly influenced by national (external) examinations. This paper…
Descriptors: Foreign Countries, Instructional Effectiveness, National Competency Tests, Geography Instruction
Mikeska, Jamie N.; Kurzum, Christopher; Steinberg, Jonathan H.; Xu, Jun – ETS Research Report Series, 2018
The purpose of this report is to examine the performance of assessment items designed to measure elementary teachers' content knowledge for teaching (CKT) science as part of the ETS® Educator Series. The Elementary Education: CKT Science assessment is 1 component of licensure examination through the PRAXIS® assessments. The Elementary Education:…
Descriptors: Elementary School Teachers, Pedagogical Content Knowledge, Elementary School Science, Preservice Teachers
Bagriacik Yilmaz, Ayse; Karatas, Serçin – Interactive Learning Environments, 2018
The aim of this study was to develop a measurement instrument which is compatible with literature, of which validity and reliability are proved with the aim of determining interaction perceived by learners in online learning environments. Accordingly, literature review was made, and outline form of the scale was formed with item pool by taking 14…
Descriptors: Foreign Countries, College Students, Likert Scales, Computer Mediated Communication
Zembat, Rengin; Turasli, Nalan Kuru; Güven, Gülçin; Sezer, Türker; Aksin, Ezgi; Yilmaz, Elif; Bayindir, Dilan – Journal of Education and Training Studies, 2016
The aim of this study is to investigate the reliability and validity of the DeMoulin Self-Concept Developmental Scale for 36-72 month old children. In addition, it has been attempted to examine the effects of age and gender variables on the self-concept of children. The study is in survey method. The sample consists of 810 children who attend…
Descriptors: Test Validity, Test Reliability, Self Concept Measures, Age Differences
Guo, Hongwen; Zu, Jiyun; Kyllonen, Patrick; Schmitt, Neal – ETS Research Report Series, 2016
In this report, systematic applications of statistical and psychometric methods are used to develop and evaluate scoring rules in terms of test reliability. Data collected from a situational judgment test are used to facilitate the comparison. For a well-developed item with appropriate keys (i.e., the correct answers), agreement among various…
Descriptors: Scoring, Test Reliability, Statistical Analysis, Psychometrics
Olpak, Yusuf Ziya; Kiliç Çakmak, Ebru – Online Learning, 2018
The aim of this study was to describe the validity and reliability of a Turkish language version of the CoI survey developed by Arbaugh et al. (2008). Data were obtained from 1150 students enrolled in online courses in various departments in three Turkish state universities. The data were randomly divided into two parts: the first part was…
Descriptors: Foreign Countries, Test Reliability, Test Validity, Student Surveys

Peer reviewed
Direct link
