Publication Date
| In 2026 | 0 |
| Since 2025 | 1 |
| Since 2022 (last 5 years) | 7 |
| Since 2017 (last 10 years) | 14 |
| Since 2007 (last 20 years) | 33 |
Descriptor
| Test Content | 45 |
| Test Items | 45 |
| Test Validity | 45 |
| Test Construction | 30 |
| Test Reliability | 19 |
| Foreign Countries | 10 |
| Item Analysis | 9 |
| Statistical Analysis | 9 |
| Achievement Tests | 7 |
| Difficulty Level | 7 |
| Student Evaluation | 7 |
| More ▼ | |
Source
Author
| Winnick, Joseph P. | 3 |
| Short, Francis X. | 2 |
| Ackerman, Debra J. | 1 |
| Ahmed, S. | 1 |
| Alghazali, Tawfeeq | 1 |
| Anne Traynor | 1 |
| Baele, Judith | 1 |
| Ballester, Rex C. | 1 |
| Barghaus, Katherine M. | 1 |
| Baxter, G. P. | 1 |
| Bello, Samira Abdullahi | 1 |
| More ▼ | |
Publication Type
Education Level
Audience
| Practitioners | 2 |
| Administrators | 1 |
| Teachers | 1 |
Location
| Belgium | 1 |
| China | 1 |
| Delaware | 1 |
| Illinois | 1 |
| Japan | 1 |
| Malaysia | 1 |
| Maryland | 1 |
| Massachusetts | 1 |
| Netherlands | 1 |
| Nigeria | 1 |
| North Carolina | 1 |
| More ▼ | |
Laws, Policies, & Programs
| No Child Left Behind Act 2001 | 1 |
Assessments and Surveys
What Works Clearinghouse Rating
Anne Traynor; Sara C. Christopherson – Applied Measurement in Education, 2024
Combining methods from earlier content validity and more contemporary content alignment studies may allow a more complete evaluation of the meaning of test scores than if either set of methods is used on its own. This article distinguishes item relevance indices in the content validity literature from test representativeness indices in the…
Descriptors: Test Validity, Test Items, Achievement Tests, Test Construction
Sarah K. Cowan; Michael Hout; Stuart Perrett – Sociological Methods & Research, 2024
Long-running surveys need a systematic way to reflect social change and to keep items relevant to respondents, especially when they ask about controversial subjects, or they threaten the items' validity. We propose a protocol for updating measures that preserves content and construct validity. First, substantive experts articulate the current and…
Descriptors: Surveys, Public Opinion, Social Attitudes, Pregnancy
Do-Hong Kim; Chuang Wang; Thi Nhu Ngoc Truong – Language Teaching Research, 2024
Researchers and practitioners in the field of second language acquisition have come to realize the importance of non-cognitive skills such as self-efficacy and self-regulation in students' learning of a second language. However, there has been limited systematic research on such measures in the second language context and the validity and…
Descriptors: Psychometrics, Test Content, Self Efficacy, English Language Learners
Sara T. Cushing – ETS Research Report Series, 2025
This report provides an in-depth comparison of TOEFL iBT® and the Duolingo English Test (DET) in terms of the degree to which both tests assess academic language proficiency in listening, reading, writing, and speaking. The analysis is based on publicly available documentation on both tests, including sample test questions available on the test…
Descriptors: Language Tests, English (Second Language), Second Language Learning, Academic Language
Russell, Michael; Moncaleano, Sebastian – Practical Assessment, Research & Evaluation, 2020
Although both content alignment and standard-setting procedures rely on content-expert panel judgements, only the latter employs discussion among panel members. This study employed a modified form of the Webb methodology to examine content alignment for twelve tests administered as part of the Massachusetts Comprehensive Assessment System (MCAS).…
Descriptors: Test Content, Test Items, Discussion, Test Validity
Crystal Uminski – ProQuest LLC, 2023
The landscape of undergraduate biology education has been shaped by decades of reform efforts calling for instruction to integrate core concepts and scientific skills as a means of helping students become proficient in the discipline. Assessments can be used to make inferences about how these reform efforts have translated into changes in…
Descriptors: Undergraduate Students, Biology, Science Instruction, Science Tests
Parry, James R. – Online Submission, 2020
This paper presents research and provides a method to ensure that parallel assessments, that are generated from a large test-item database, maintain equitable difficulty and content coverage each time the assessment is presented. To maintain fairness and validity it is important that all instances of an assessment, that is intended to test the…
Descriptors: Culture Fair Tests, Difficulty Level, Test Items, Test Validity
Mohammed Ambusaidi – ProQuest LLC, 2022
There is an increased demand on nursing faculty to provide quality teaching and assessment. Nursing faculty are required to ensure accurate assessment of learning through testing and outcome measurement that are critical elements of the evaluation process. Likewise, nursing faculty should implement a logical evaluation system. However, the…
Descriptors: Nursing Education, College Faculty, Test Construction, Test Validity
Mohammed, Aisha; Dawood, Abdul Kareem Shareef; Alghazali, Tawfeeq; Kadhim, Qasim Khlaif; Sabti, Ahmed Abdulateef; Sabit, Shaker Holh – International Journal of Language Testing, 2023
Cognitive diagnostic models (CDMs) have received much interest within the field of language testing over the last decade due to their great potential to provide diagnostic feedback to all stakeholders and ultimately improve language teaching and learning. A large number of studies have demonstrated the application of CDMs on advanced large-scale…
Descriptors: Reading Comprehension, Reading Tests, Language Tests, English (Second Language)
Butz, Amanda R.; Branchaw, Janet L. – CBE - Life Sciences Education, 2020
Expanding the scope of previous undergraduate research assessment tools, the "Entering Research" Learning Assessment (ERLA) measures undergraduate and graduate research trainee learning gains in the seven areas of trainee development in the evidence-based "Entering Research" conceptual framework: Research Comprehension and…
Descriptors: Undergraduate Students, Graduate Students, College Students, Student Research
Zhang, Xinxin; Gierl, Mark – Journal of Educational Issues, 2016
The purpose of this study is to describe a methodology to recover the item model used to generate multiple-choice test items with a novel graph theory approach. Beginning with the generated test items and working backward to recover the original item model provides a model-based method for validating the content used to automatically generate test…
Descriptors: Test Items, Automation, Content Validity, Test Validity
Shanmugam, S. Kanageswari Suppiah; Veloo, Arsaythamby; Md-Ali, Ruzlan – Diaspora, Indigenous, and Minority Education, 2021
This study examined the validity of trilingual test as a test accommodation to assess the Indigenous pupils' mathematical performance in Malaysia. The study employed two tests; BM-only test with items written in Malay language (BM) and trilingual test, which had items written in BM and English, and oral audio recording in their native Temiar…
Descriptors: Multilingualism, Testing Accommodations, Grade 5, Elementary School Students
Walstad, William B.; Rebeck, Ken – Journal of Economic Education, 2017
The "Test of Financial Literacy" (TFL) was created to measure the financial knowledge of high school students. Its content is based on the standards and benchmarks stated in the "National Standards for Financial Literacy" (Council for Economic Education 2013). The test development process involved extensive item writing and…
Descriptors: Tests, Money Management, Literacy, High School Students
Zou, Min; Wu, Wenxin – English Language Teaching, 2015
Since its first pilot study was launched in 2003, China Accreditation Test for Translators and Interpreters (CATTI) has developed into the most authoritative translation and interpretation proficiency qualification accreditation test in China and played an important role in assessing and cultivating translators and interpreters. Based on the…
Descriptors: Foreign Countries, Translation, Test Validity, Test Reliability
Gierl, Mark J.; Lai, Hollis – Educational Measurement: Issues and Practice, 2016
Testing organization needs large numbers of high-quality items due to the proliferation of alternative test administration methods and modern test designs. But the current demand for items far exceeds the supply. Test items, as they are currently written, evoke a process that is both time-consuming and expensive because each item is written,…
Descriptors: Test Items, Test Construction, Psychometrics, Models

Peer reviewed
Direct link
