Publication Date
| In 2026 | 0 |
| Since 2025 | 2 |
| Since 2022 (last 5 years) | 12 |
| Since 2017 (last 10 years) | 23 |
| Since 2007 (last 20 years) | 37 |
Descriptor
| Guidelines | 60 |
| Test Format | 60 |
| Test Construction | 19 |
| Language Tests | 18 |
| Second Language Learning | 18 |
| Test Items | 18 |
| Foreign Countries | 16 |
| Student Evaluation | 14 |
| Computer Assisted Testing | 13 |
| Test Validity | 13 |
| Comparative Analysis | 12 |
| More ▼ | |
Source
Author
| Hambleton, Ronald K. | 3 |
| Gyeonggeon Lee | 2 |
| Min Li | 2 |
| Xiaoming Zhai | 2 |
| Xiaoxiao Liu | 2 |
| Yizhu Gao | 2 |
| Agustinus Hardi Prasetyo | 1 |
| Al Habbash, Maha | 1 |
| Al Mohammedi, Najah | 1 |
| Al Othali, Safa | 1 |
| Algina, James | 1 |
| More ▼ | |
Publication Type
Education Level
| Higher Education | 8 |
| Postsecondary Education | 7 |
| High Schools | 6 |
| Elementary Education | 5 |
| Secondary Education | 5 |
| Elementary Secondary Education | 4 |
| Grade 12 | 4 |
| Grade 4 | 4 |
| Grade 8 | 4 |
| Intermediate Grades | 4 |
| Junior High Schools | 4 |
| More ▼ | |
Audience
| Practitioners | 11 |
| Teachers | 5 |
| Administrators | 4 |
| Community | 1 |
| Policymakers | 1 |
Location
| Europe | 4 |
| Georgia | 2 |
| Arizona | 1 |
| California | 1 |
| China | 1 |
| Connecticut | 1 |
| Czech Republic | 1 |
| European Union | 1 |
| France | 1 |
| Hungary | 1 |
| Indiana | 1 |
| More ▼ | |
Laws, Policies, & Programs
| Job Training Partnership Act… | 1 |
| No Child Left Behind Act 2001 | 1 |
Assessments and Surveys
What Works Clearinghouse Rating
Choe, Edison M.; Han, Kyung T. – Journal of Educational Measurement, 2022
In operational testing, item response theory (IRT) models for dichotomous responses are popular for measuring a single latent construct [theta], such as cognitive ability in a content domain. Estimates of [theta], also called IRT scores or [theta hat], can be computed using estimators based on the likelihood function, such as maximum likelihood…
Descriptors: Scores, Item Response Theory, Test Items, Test Format
Giofrè, D.; Allen, K.; Toffalini, E.; Caviola, S. – Educational Psychology Review, 2022
This meta-analysis reviews 79 studies (N = 46,605) that examined the existence of gender difference on intelligence in school-aged children. To do so, we limited the literature search to works that assessed the construct of intelligence through the Wechsler Intelligence Scales for Children (WISC) batteries, evaluating eventual gender differences…
Descriptors: Gender Differences, Cognitive Processes, Children, Intelligence Tests
Huang, Hung-Yu – Educational and Psychological Measurement, 2023
The forced-choice (FC) item formats used for noncognitive tests typically develop a set of response options that measure different traits and instruct respondents to make judgments among these options in terms of their preference to control the response biases that are commonly observed in normative tests. Diagnostic classification models (DCMs)…
Descriptors: Test Items, Classification, Bayesian Statistics, Decision Making
Aryadoust, Vahid; Luo, Lan – Language Testing, 2023
This study reviewed conceptualizations and operationalizations of second language (L2) listening constructs. A total of 157 peer-reviewed papers published in 19 journals in applied linguistics were coded for (1) publication year, author, source title, location, language, and reliability and (2) listening subskills, cognitive processes, attributes,…
Descriptors: Test Format, Listening Comprehension Tests, Second Language Learning, Second Language Instruction
Kárász, Judit T.; Széll, Krisztián; Takács, Szabolcs – Quality Assurance in Education: An International Perspective, 2023
Purpose: Based on the general formula, which depends on the length and difficulty of the test, the number of respondents and the number of ability levels, this study aims to provide a closed formula for the adaptive tests with medium difficulty (probability of solution is p = 1/2) to determine the accuracy of the parameters for each item and in…
Descriptors: Test Length, Probability, Comparative Analysis, Difficulty Level
Yizhu Gao; Xiaoming Zhai; Min Li; Gyeonggeon Lee; Xiaoxiao Liu – Grantee Submission, 2025
The rapid evolution of generative artificial intelligence (GenAI) is transforming science education by facilitating innovative pedagogical paradigms while raising substantial concerns about scholarly integrity. One particularly pressing issue is the growing risk of student use of GenAI tools to outsource assessment tasks, potentially compromising…
Descriptors: Artificial Intelligence, Computer Software, Science Education, Integrity
Yizhu Gao; Xiaoming Zhai; Min Li; Gyeonggeon Lee; Xiaoxiao Liu – Journal of Research in Science Teaching, 2025
The rapid evolution of generative artificial intelligence (GenAI) is transforming science education by facilitating innovative pedagogical paradigms while raising substantial concerns about scholarly integrity. One particularly pressing issue is the growing risk of student use of GenAI tools to outsource assessment tasks, potentially compromising…
Descriptors: Artificial Intelligence, Computer Software, Science Education, Integrity
Ippel, Lianne; Magis, David – Educational and Psychological Measurement, 2020
In dichotomous item response theory (IRT) framework, the asymptotic standard error (ASE) is the most common statistic to evaluate the precision of various ability estimators. Easy-to-use ASE formulas are readily available; however, the accuracy of some of these formulas was recently questioned and new ASE formulas were derived from a general…
Descriptors: Item Response Theory, Error of Measurement, Accuracy, Standards
Zabala Delgado, Julia; Rouveyrol, Laurent – Language Learning in Higher Education, 2022
Verbal interaction has been the subject of a growing interest among language professionals in Europe since the CEFR was published in 2001; in linguistics, verbal interaction has long been studied. In the Bakhtinian approach, it is even considered "the fundamental reality of language". All types of interaction share the fact that they are…
Descriptors: Spanish, French, Second Language Learning, Second Language Instruction
Dimova, Slobodanka – Language Teaching Research Quarterly, 2022
Drawing on Glenn Fulcher's extensive work in performance-based language assessment of speaking, this paper explores the assessment of L2 speaking ability in local language testing contexts. For that purpose, I review Fulcher's influential work that highlights the relationship between the speaking construct, the task, the performance, and the…
Descriptors: Language Tests, Speech Communication, Performance Based Assessment, Second Language Learning
Burton, J. Dylan – Language Assessment Quarterly, 2023
The effects of question or task complexity on second language speaking have traditionally been investigated using complexity, accuracy, and fluency measures. Response processes in speaking tests, however, may manifest in other ways, such as through nonverbal behavior. Eye behavior, in the form of averted gaze or blinking frequency, has been found…
Descriptors: Oral Language, Speech Communication, Language Tests, Eye Movements
Joseph, Dane Christian – Journal of Effective Teaching in Higher Education, 2019
Multiple-choice testing is a staple within the U.S. higher education system. From classroom assessments to standardized entrance exams such as the GRE, GMAT, or LSAT, test developers utilize a variety of validated and heuristic driven item-writing guidelines. One such guideline that has been given recent attention is to randomize the position of…
Descriptors: Test Construction, Multiple Choice Tests, Guessing (Tests), Test Wiseness
Al Habbash, Maha; Alsheikh, Negmeldin; Liu, Xu; Al Mohammedi, Najah; Al Othali, Safa; Ismail, Sadiq Abdulwahed – International Journal of Instruction, 2021
This convergent mixed method study aimed at exploring the English context of the widely used Emirates Standardized Test (EmSAT) by juxtaposing it to its sequel, the International English Language Testing System (IELTS). For this purpose, the study used the Common European Framework of Reference (CEFR) international standards which is used as a…
Descriptors: Language Tests, English (Second Language), Second Language Learning, Guidelines
Hyeonah Kang – Journal of Second Language Acquisition and Teaching, 2022
Using a lexical decision task, Wolter and Yamashita (2015) showed that collocations that exist only in L1 but not in L2 were not processed faster than collocations that only exist in L2 but not in L1 or a random combination of two words. This result seems to support the age/order of acquisition effects (Carroll & White, 1973) over Jiang's…
Descriptors: Language Processing, Phrase Structure, Language Usage, Decision Making
Yan, Xun; Kim, Ha Ram; Kim, Ji Young – Language Testing, 2021
Speech fluency has been extensively researched as a core construct for second language (L2) speaking assessment. Despite the broad consensus on its multifaceted nature, few researchers have empirically explored the dimensionality of this construct. Operationalizations of fluency vary across research and practice, using both holistic and…
Descriptors: Language Fluency, Language Tests, Accuracy, Speech Communication

Peer reviewed
Direct link
