Publication Date
| In 2026 | 0 |
| Since 2025 | 6 |
| Since 2022 (last 5 years) | 26 |
| Since 2017 (last 10 years) | 108 |
| Since 2007 (last 20 years) | 302 |
Descriptor
| Comparative Analysis | 792 |
| Test Reliability | 792 |
| Test Validity | 425 |
| Foreign Countries | 174 |
| Test Construction | 132 |
| Correlation | 119 |
| Statistical Analysis | 117 |
| Scores | 106 |
| Higher Education | 98 |
| Psychometrics | 91 |
| Test Items | 89 |
| More ▼ | |
Source
Author
| Reckase, Mark D. | 5 |
| Bashaw, W. L. | 3 |
| Bennett, Randy Elliot | 3 |
| Benson, Jeri | 3 |
| Crehan, Kevin D. | 3 |
| Ebel, Robert L. | 3 |
| Frisbie, David A. | 3 |
| Hakstian, A. Ralph | 3 |
| Henk, William A. | 3 |
| Weiss, David J. | 3 |
| Winke, Paula | 3 |
| More ▼ | |
Publication Type
Education Level
Audience
| Researchers | 18 |
| Practitioners | 17 |
| Teachers | 9 |
| Administrators | 4 |
| Counselors | 2 |
| Policymakers | 2 |
| Parents | 1 |
| Support Staff | 1 |
Location
| United States | 21 |
| Turkey | 20 |
| Australia | 16 |
| China | 11 |
| United Kingdom (England) | 11 |
| Germany | 9 |
| Hong Kong | 9 |
| Iran | 9 |
| Taiwan | 9 |
| United Kingdom | 9 |
| Canada | 8 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
Winke, Paula; Lee, Shinhye; Ahn, Jieun Irene; Choi, Ina; Cui, Yaqiong; Yoon, Hyung-Jo – TESOL Quarterly: A Journal for Teachers of English to Speakers of Other Languages and of Standard English as a Second Dialect, 2018
This study investigated the cognitive validity of two child English language tests. Some teachers maintain that these types of tests may be cognitively invalid because native-English-speaking children would not do well on them (Winke, 2011). So the researchers had native speakers and learners of English aged 7 to 9 take sample versions of two…
Descriptors: Language Tests, English, English (Second Language), Second Language Learning
Al-Tal, Suhair; AL-Jawaldeh, Fuad; AL-Taj, Heyam; Maharmeh, Lina – International Education Studies, 2017
This study aimed at revealing the emotional intelligence levels of students with sensory disability in Amman in Jordan. The participants of the study were 200 students; 140 hearing impaired students and 60 visual impaired students enrolled in the special education schools and centers for the academic year 2016-2017. The study adopted the…
Descriptors: Foreign Countries, Emotional Intelligence, Hearing Impairments, Visual Impairments
Rios, Joseph A.; Liu, Ou Lydia – American Journal of Distance Education, 2017
Online higher education institutions are presented with the concern of how to obtain valid results when administering student learning outcomes (SLO) assessments remotely. Traditionally, there has been a great reliance on unproctored Internet test administration (UIT) due to increased flexibility and reduced costs; however, a number of validity…
Descriptors: Online Courses, Testing, Test Wiseness, Academic Achievement
Uto, Masaki; Ueno, Maomi – IEEE Transactions on Learning Technologies, 2016
As an assessment method based on a constructivist approach, peer assessment has become popular in recent years. However, in peer assessment, a problem remains that reliability depends on the rater characteristics. For this reason, some item response models that incorporate rater parameters have been proposed. Those models are expected to improve…
Descriptors: Item Response Theory, Peer Evaluation, Bayesian Statistics, Simulation
Murray, Keith B.; Zdravkovic, Srdan – Journal of Education for Business, 2016
Considerable debate continues regarding the efficacy of the website RateMyProfessors.com (RMP). To date, however, virtually no direct, experimental research has been reported which directly bears on questions relating to sampling adequacy or item adequacy in producing what favorable correlations have been reported. The authors compare the data…
Descriptors: Computer Assisted Testing, Computer Software Evaluation, Student Evaluation of Teacher Performance, Item Analysis
Wright, Paul M.; Irwin, Carol – Measurement in Physical Education and Exercise Science, 2018
National content standards in PE address responsibility; however, learning outcomes and teacher effectiveness in this area remain poorly defined. This study employed the Social and Emotional Learning framework and a teaching personal and social responsibility (TPSR) model fidelity instrument to address this gap. Our purpose was to examine the…
Descriptors: Observation, Teacher Effectiveness, Teacher Responsibility, Physical Education Teachers
Davis, Doris Bitler – Teaching of Psychology, 2017
Providing two or more versions of multiple-choice exams has long been a popular strategy for reducing the opportunity for students to engage in academic dishonesty. While the results of studies comparing exam scores under different question-order conditions have been inconclusive, the potential importance of contextual cues to aid student recall…
Descriptors: Test Construction, Multiple Choice Tests, Sequential Approach, Cues
Kloser, Matthew; Borko, Hilda; Martinez, Jose Felipe; Stecher, Brian; Luskin, Rebecca – Science Education, 2017
Assessments are powerful tools for informing teachers and students about where student thinking stands with relation to a learning goal. Yet, few studies provide qualitative analyses of assessment practice across a unit. This study uses a framework of nine dimensions of effective assessment practice in science classrooms to compare more and less…
Descriptors: Secondary School Science, Evidence, Portfolio Assessment, Middle School Teachers
Whittaker, Jessica E. V.; Williford, Amanda P.; Carter, Lauren M.; Vitiello, Virginia E.; Hatfield, Bridget E. – Early Education and Development, 2018
Research Findings: This study explored the quality of teacher-child interactions within the context of a newly developed standardized task, Teacher-Child Structured Play Task (TC-SPT). A sample of 146 teachers and 345 children participated. Children who displayed the highest disruptive behaviors within each classroom were selected to participate.…
Descriptors: Teacher Student Relationship, Interaction, Preschool Children, Preschool Teachers
Farwell, Tricia M.; Alligood, Leon; Fitzgerald, Sharon; Blake, Ken – Journalism and Mass Communication Educator, 2016
This article introduces an objective grammar and math assessment and evaluates the assessment's outcome and reliability when fielded among eighty-one students in media writing courses. In addition, the article proposes a rubric for grading straight news leads and compares the rubric's reliability with the reliability of rating straight news leads…
Descriptors: Journalism, Journalism Education, Introductory Courses, Reliability
Lowe, Patricia A.; Ang, Rebecca P. – Journal of Psychoeducational Assessment, 2016
Tests of measurement invariance were conducted across culture and gender on the Revised Children's Manifest Anxiety Scale-Second Edition (RCMAS-2) Short Form in a sample of 1,003 Singapore and U.S. adolescents. The results of multi-group confirmatory factor analyses across culture and gender supported at least partial measurement invariance. ANOVA…
Descriptors: Measurement Techniques, Cultural Differences, Gender Differences, Comparative Analysis
Hauser, Peter C.; Paludneviciene, Raylene; Riddle, Wanda; Kurz, Kim B.; Emmorey, Karen; Contreras, Jessica – Journal of Deaf Studies and Deaf Education, 2016
The American Sign Language Comprehension Test (ASL-CT) is a 30-item multiple-choice test that measures ASL receptive skills and is administered through a website. This article describes the development and psychometric properties of the test based on a sample of 80 college students including deaf native signers, hearing native signers, deaf…
Descriptors: American Sign Language, Comprehension, Multiple Choice Tests, Receptive Language
Moshinsky, Avital; Ziegler, David; Gafni, Naomi – International Journal of Testing, 2017
Many medical schools have adopted multiple mini-interviews (MMI) as an advanced selection tool. MMIs are expensive and used to test only a few dozen candidates per day, making it infeasible to develop a different test version for each test administration. Therefore, some items are reused both within and across years. This study investigated the…
Descriptors: Interviews, Medical Schools, Test Validity, Test Reliability
Bostian, Brad – International Journal of Multidisciplinary Perspectives in Higher Education, 2017
Amid profound changes to student placement systems at universities and colleges, the placement of English language learners has remained largely the same. Generally speaking, international students, and in some places other English language learners, face single measure testing and required remediation. Single measure high stakes testing goes…
Descriptors: Student Placement, English (Second Language), Second Language Learning, College Students
Thompson, Gregory L.; Cox, Troy L.; Knapp, Nieves – Foreign Language Annals, 2016
While studies have been done to rate the validity and reliability of the Oral Proficiency Interview (OPI) and Oral Proficiency Interview-Computer (OPIc) independently, a limited amount of research has analyzed the interexam reliability of these tests, and studies have yet to be conducted comparing the results of Spanish language learners who take…
Descriptors: Comparative Analysis, Oral Language, Language Proficiency, Spanish

Peer reviewed
Direct link
