Publication Date
| In 2026 | 0 |
| Since 2025 | 0 |
| Since 2022 (last 5 years) | 1 |
| Since 2017 (last 10 years) | 5 |
| Since 2007 (last 20 years) | 7 |
Descriptor
| Difficulty Level | 10 |
| Interrater Reliability | 10 |
| Test Validity | 10 |
| Foreign Countries | 5 |
| Test Construction | 5 |
| Test Items | 5 |
| Test Reliability | 5 |
| English (Second Language) | 4 |
| Scores | 4 |
| Computer Assisted Testing | 3 |
| Language Tests | 3 |
| More ▼ | |
Source
Author
| Bennett, Randy Elliot | 1 |
| Buchheim, Anna | 1 |
| Chan, Yi-Chih | 1 |
| Crosson, Amy | 1 |
| Derek N. Canning | 1 |
| Doering, Stephan | 1 |
| Edward Paul Getman | 1 |
| Fischer-Kern, Melitta | 1 |
| Horz, Susanne | 1 |
| Hung, Yu-Chen | 1 |
| Joseph P. Vitta | 1 |
| More ▼ | |
Publication Type
| Journal Articles | 6 |
| Reports - Research | 5 |
| Dissertations/Theses -… | 2 |
| Reports - Evaluative | 2 |
| Tests/Questionnaires | 2 |
| Reports - Descriptive | 1 |
Education Level
| Elementary Education | 3 |
| Higher Education | 2 |
| Postsecondary Education | 2 |
| Grade 9 | 1 |
Audience
Location
| Japan | 2 |
| Louisiana | 1 |
| Sweden | 1 |
| Taiwan (Taipei) | 1 |
Laws, Policies, & Programs
| Pell Grant Program | 1 |
Assessments and Surveys
| Test of English as a Foreign… | 2 |
| ACT Assessment | 1 |
| Adult Attachment Interview | 1 |
| Test of English for… | 1 |
What Works Clearinghouse Rating
Derek N. Canning; Stuart McLean; Joseph P. Vitta – Vocabulary Learning and Instruction, 2022
The substantive component of construct validity requires a confrontation between empirical test results and content relevance. The Vocabulary Size Test (VST) has been extensively validated in terms of empirical results. Less is known, however, about expert judgments of content relevance. The VST was constructed and validated according to the…
Descriptors: Foreign Countries, Undergraduate Students, College Faculty, Vocabulary Skills
Hung, Yu-Chen; Chan, Yi-Chih – Deafness & Education International, 2020
Unlike their peers with typical hearing, reading and speech challenges observed among children with hearing loss may not only be caused by developmental issues but also hearing-related problems. Although conventional oral reading assessments are useful for identifying children at risk of reading difficulties, they do not help examiners identify…
Descriptors: Test Construction, Test Validity, Oral Reading, Reading Tests
Smolinsky, Lawrence; Marx, Brian D.; Olafsson, Gestur; Ma, Yanxia A. – Journal of Educational Computing Research, 2020
Computer-based testing is an expanding use of technology offering advantages to teachers and students. We studied Calculus II classes for science, technology, engineering, and mathematics majors using different testing modes. Three sections with 324 students employed: paper-and-pencil testing, computer-based testing, and both. Computer tests gave…
Descriptors: Test Format, Computer Assisted Testing, Paper (Material), Calculus
Tengberg, Michael – Language Assessment Quarterly, 2018
Reading comprehension is often treated as a multidimensional construct. In many reading tests, items are distributed over reading process categories to represent the subskills expected to constitute comprehension. This study explores (a) the extent to which specified subskills of reading comprehension tests are conceptually conceivable to…
Descriptors: Reading Tests, Reading Comprehension, Scores, Test Results
Edward Paul Getman – Online Submission, 2020
Despite calls for engaging assessments targeting young language learners (YLLs) between 8 and 13 years old, what makes assessment tasks engaging and how such task characteristics affect measurement quality have not been well studied empirically. Furthermore, there has been a dearth of validity research about technology-enhanced speaking tests for…
Descriptors: English (Second Language), Language Tests, Second Language Learning, Learner Engagement
Taubner, Svenja; Horz, Susanne; Fischer-Kern, Melitta; Doering, Stephan; Buchheim, Anna; Zimmermann, Johannes – Psychological Assessment, 2013
The Reflective Functioning Scale (RFS) was developed to assess individual differences in the ability to mentalize attachment relationships. The RFS assesses mentalization from transcripts of the Adult Attachment Interview (AAI). A global score is given by trained coders on an 11-point scale ranging from antireflective to exceptionally reflective.…
Descriptors: Measures (Individuals), Attachment Behavior, Individual Differences, Adults
Lim, Gad S. – ProQuest LLC, 2009
Performance assessments have become the norm for evaluating language learners' writing abilities in international examinations of English proficiency. Two aspects of these assessments are usually systematically varied: test takers respond to different prompts, and their responses are read by different raters. This raises the possibility of undue…
Descriptors: Performance Based Assessment, Language Tests, Performance Tests, Test Validity
Bennett, Randy Elliot; Rock, Donald A. – 1993
Formulating-Hypotheses (F-H) items present a situation and ask the examinee to generate as many explanations for it as possible. This study examined the generalizability, validity, and examinee perceptions of a computer-delivered version of the task. Eight F-H questions were administered to 192 graduate students. Half of the items restricted…
Descriptors: Computer Assisted Testing, Difficulty Level, Generalizability Theory, Graduate Students
Strong, Gregory – Thought Currents in English Literature, 1995
This paper traces developments in educational psychology and measurement that led to the Test of English as a Foreign Language (TOEFL) and the test of English for International Communication (TOEIC) and the application of educational measurement terms such as validity and reliability to testing. Use of a table of specifications for planning…
Descriptors: Cloze Procedure, Difficulty Level, English (Second Language), Foreign Countries
Matsumura, Lindsay Clare; Slater, Sharon Cadman; Wolf, Mikyung Kim; Crosson, Amy; Levison, Allison; Peterson, Maureen; Resnick, Lauren; Junker, Brian – National Center for Research on Evaluation, Standards, and Student Testing (CRESST), 2006
This study presents preliminary findings from research developing an instructional quality assessment (IQA) toolkit that could be used to monitor the influence of reform initiatives on students' learning environments and to guide professional development efforts within a school or district. This report focuses specifically on the portion of the…
Descriptors: Reading Comprehension, Reading Assignments, Reading Instruction, Instructional Effectiveness

Peer reviewed
Direct link
