Publication Date
In 2025 | 0 |
Since 2024 | 0 |
Since 2021 (last 5 years) | 3 |
Since 2016 (last 10 years) | 11 |
Since 2006 (last 20 years) | 33 |
Descriptor
Comparative Analysis | 225 |
Testing Problems | 225 |
Test Validity | 49 |
Test Reliability | 44 |
Higher Education | 38 |
Scores | 34 |
Evaluation Methods | 31 |
Foreign Countries | 31 |
Test Interpretation | 31 |
Elementary Secondary Education | 30 |
Achievement Tests | 29 |
More ▼ |
Source
Author
Publication Type
Education Level
Elementary Secondary Education | 9 |
Higher Education | 9 |
Postsecondary Education | 7 |
Secondary Education | 3 |
Elementary Education | 1 |
Grade 4 | 1 |
Grade 8 | 1 |
Intermediate Grades | 1 |
Junior High Schools | 1 |
Middle Schools | 1 |
Location
United Kingdom | 5 |
United Kingdom (England) | 5 |
Canada | 4 |
Israel | 4 |
United States | 4 |
Australia | 3 |
Netherlands | 3 |
United Kingdom (Wales) | 3 |
Germany | 2 |
Iran | 2 |
Sweden | 2 |
More ▼ |
Laws, Policies, & Programs
Elementary and Secondary… | 3 |
Elementary and Secondary… | 1 |
Social Security | 1 |
Assessments and Surveys
What Works Clearinghouse Rating
Kim, Sooyeon; Walker, Michael – ETS Research Report Series, 2021
In this investigation, we used real data to assess potential differential effects associated with taking a test in a test center (TC) versus testing at home using remote proctoring (RP). We used a pseudo-equivalent groups (PEG) approach to examine group equivalence at the item level and the total score level. If our assumption holds that the PEG…
Descriptors: Testing, Distance Education, Comparative Analysis, Test Items
Sinharay, Sandip – Applied Measurement in Education, 2017
Karabatsos compared the power of 36 person-fit statistics using receiver operating characteristics curves and found the "H[superscript T]" statistic to be the most powerful in identifying aberrant examinees. He found three statistics, "C", "MCI", and "U3", to be the next most powerful. These four statistics,…
Descriptors: Nonparametric Statistics, Goodness of Fit, Simulation, Comparative Analysis
Shijun, Chen – International Education Studies, 2022
The high-stakes College English Test (CET), developed, administered, and reformed over the last 20 years, has received great attention in the aspect of washback on teaching and learning from previous research. Very few studies explored its consequences in the workplace domain--being used as a screening lever. This research aimed to 1) compare…
Descriptors: Language Tests, Test Use, Second Language Learning, Second Language Instruction
Sinharay, Sandip – Journal of Educational Measurement, 2017
Person-fit assessment (PFA) is concerned with uncovering atypical test performance as reflected in the pattern of scores on individual items on a test. Existing person-fit statistics (PFSs) include both parametric and nonparametric statistics. Comparison of PFSs has been a popular research topic in PFA, but almost all comparisons have employed…
Descriptors: Goodness of Fit, Testing, Test Items, Scores
Mendoza, Arturo; Martínez, Joaquín – International Journal of Language Testing, 2023
Language placement tests (LPTs) are used to assess students' proficiency in a progressive manner in the target language. Based on their performance, students are assigned to stepped language courses. These tests are usually considered low stakes because they do not have significant consequences in students' lives, which is perhaps the reason why…
Descriptors: Language Tests, English (Second Language), Second Language Learning, Second Language Instruction
Isbell, Dan; Winke, Paula – Language Testing, 2019
The American Council on the Teaching of Foreign Languages (ACTFL) oral proficiency interview -- computer (OPIc) testing system represents an ambitious effort in language assessment: Assessing oral proficiency in over a dozen languages, on the same scale, from virtually anywhere at any time. Especially for users in contexts where multiple foreign…
Descriptors: Oral Language, Language Tests, Language Proficiency, Second Language Learning
Phongsirikul, Marissa – rEFLections, 2018
The study aimed to investigate teachers' and students' perceptions towards traditional and alternative types of assessment within a classroom context of an English course provided for English-majoring students at tertiary level. A combination of traditional and alternative assessment tools was implemented in the study. The researcher developed…
Descriptors: Teacher Attitudes, Student Attitudes, Alternative Assessment, Second Language Learning
Forestier, Katherine; Adamson, Bob – Compare: A Journal of Comparative and International Education, 2017
Adopting a comparative perspective to address educational issues in different contexts was a hallmark of Jullien's work in the early-nineteenth century. Different emphases and approaches to comparative education methodology have emerged in recent times thanks to major developments in technology, but have these changes rendered Jullien's ideas…
Descriptors: Criticism, International Assessment, Comparative Education, Comparative Analysis
Sinharay, Sandip; Wan, Ping; Choi, Seung W.; Kim, Dong-In – Journal of Educational Measurement, 2015
With an increase in the number of online tests, the number of interruptions during testing due to unexpected technical issues seems to be on the rise. For example, interruptions occurred during several recent state tests. When interruptions occur, it is important to determine the extent of their impact on the examinees' scores. Researchers such as…
Descriptors: Computer Assisted Testing, Testing Problems, Scores, Statistical Analysis
Emadian, Farzaneh; Gholami, Javad; Sarkhosh, Mehdi – Journal of Teacher Education for Sustainability, 2018
The first and most crucial step towards developing a sustainable curriculum for instructors teaching English for Specific Academic Purposes (ESAP) is a needs analysis. Therefore, the main aim of conducting this study was to investigate the in-service needs of language instructors and content specialists teaching ESAP and to spot the differences…
Descriptors: English for Academic Purposes, Second Language Learning, Second Language Instruction, Inservice Teacher Education
Skinner, Rebecca R. – Congressional Research Service, 2018
Assessing the achievement of students in elementary and secondary schools and the nation's educational progress is fundamental to informing education policy approaches. Congressional interest in this area includes and extends beyond the annual assessments administered by states to comply with the educational accountability requirements of Title…
Descriptors: National Competency Tests, Achievement Tests, Mathematics Achievement, Mathematics Tests
Davis, Andrew – Ethics and Education, 2015
PISA claims that it can extend its reach from its current core subjects of Reading, Science, Maths and problem-solving. Yet given the requirement for high levels of reliability for PISA, especially in the light of its current high stakes character, proposed widening of its subject coverage cannot embrace some important aspects of the social and…
Descriptors: International Assessment, High Stakes Tests, Reliability, Academic Achievement
Makransky, Guido; Glas, Cees A. W. – International Journal of Testing, 2013
Cognitive ability tests are widely used in organizations around the world because they have high predictive validity in selection contexts. Although these tests typically measure several subdomains, testing is usually carried out for a single subdomain at a time. This can be ineffective when the subdomains assessed are highly correlated. This…
Descriptors: Foreign Countries, Cognitive Ability, Adaptive Testing, Feedback (Response)
Chen, Haiwen H.; von Davier, Matthias; Yamamoto, Kentaro; Kong, Nan – ETS Research Report Series, 2015
One major issue with large-scale assessments is that the respondents might give no responses to many items, resulting in less accurate estimations of both assessed abilities and item parameters. This report studies how the types of items affect the item-level nonresponse rates and how different methods of treating item-level nonresponses have an…
Descriptors: Achievement Tests, Foreign Countries, International Assessment, Secondary School Students
Pan, Yi-Ching – TEFLIN Journal: A publication on the teaching and learning of English, 2016
There has been an increased level of attention devoted to the consequences of test use in recent years; however, the majority of washback studies focused on teaching. In fact, little research has addressed learners' perspectives to analyze possible determinants of test results. To address this issue, this study first compared the pre-and-post…
Descriptors: English (Second Language), Second Language Learning, Second Language Instruction, Language Proficiency