Publication Date
| In 2026 | 0 |
| Since 2025 | 12 |
| Since 2022 (last 5 years) | 114 |
| Since 2017 (last 10 years) | 375 |
| Since 2007 (last 20 years) | 1130 |
Descriptor
| Comparative Analysis | 1943 |
| Reliability | 880 |
| Test Reliability | 792 |
| Foreign Countries | 554 |
| Test Validity | 443 |
| Correlation | 350 |
| Validity | 332 |
| Interrater Reliability | 327 |
| Statistical Analysis | 321 |
| Scores | 280 |
| Measures (Individuals) | 236 |
| More ▼ | |
Source
Author
| Reckase, Mark D. | 6 |
| Attali, Yigal | 5 |
| Coniam, David | 5 |
| Brennan, Robert L. | 4 |
| Crehan, Kevin D. | 4 |
| Feldt, Leonard S. | 4 |
| Hakstian, A. Ralph | 4 |
| Jones, Ian | 4 |
| Kolen, Michael J. | 4 |
| Lunz, Mary E. | 4 |
| August, Diane | 3 |
| More ▼ | |
Publication Type
Education Level
Audience
| Researchers | 35 |
| Practitioners | 29 |
| Teachers | 15 |
| Administrators | 9 |
| Policymakers | 6 |
| Counselors | 2 |
| Media Staff | 2 |
| Parents | 1 |
| Support Staff | 1 |
Location
| Turkey | 59 |
| United States | 47 |
| Australia | 36 |
| China | 33 |
| Canada | 32 |
| United Kingdom (England) | 32 |
| United Kingdom | 28 |
| Germany | 25 |
| Netherlands | 24 |
| Taiwan | 22 |
| Hong Kong | 20 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Meets WWC Standards with or without Reservations | 1 |
| Does not meet standards | 1 |
Baghaei, Samira; Bagheri, Mohammad Sadegh; Yamini, Mortaza – Cogent Education, 2020
The main purpose of this quantitative-qualitative content analysis study was to compare IELTS and TOEFL listening and reading tests based on the representation of the learning objectives of Revised Bloom's taxonomy. To this end, 12 Academic IELTS listening and reading tests and 12 TOEFL iBT listening and reading tests were analyzed qualitatively…
Descriptors: Second Language Learning, English (Second Language), Language Tests, Reading Tests
Sung, Kyung Hee; Noh, Eun Hee; Chon, Kyong Hee – Asia Pacific Education Review, 2017
With increased use of constructed response items in large scale assessments, the cost of scoring has been a major consideration (Noh et al. in KICE Report RRE 2012-6, 2012; Wainer and Thissen in "Applied Measurement in Education" 6:103-118, 1993). In response to the scoring cost issues, various forms of automated system for scoring…
Descriptors: Automation, Scoring, Social Studies, Test Items
Ricketts, Todd A.; Picou, Erin M.; Galster, Jason – Journal of Speech, Language, and Hearing Research, 2017
Purpose: The hearing aid microphone setting (omnidirectional or directional) can be selected manually or automatically. This study examined the percentage of time the microphone setting selected using each method was judged to provide the best signalto-noise ratio (SNR) for the talkers of interest in school environments. Method: A total of 26…
Descriptors: Assistive Technology, Audio Equipment, Children, Adolescents
Lehan, Tara; Hussey, Heather; Mika, Eva – Journal of University Teaching and Learning Practice, 2016
Throughout the dissertation process, the chair and committee members provide feedback regarding quality to help the doctoral candidate to produce the highest-quality document and become an independent scholar. Nevertheless, results of previous research suggest that overall dissertation quality generally is poor. Because much of the feedback about…
Descriptors: Graduate Students, Doctoral Dissertations, Student Evaluation, Feedback (Response)
Fonna, Mutia; Mursalin – Malikussaleh Journal of Mathematics Learning, 2018
The purpose of this study was to determine the use of Wingeom software in geometry learning in the Department of Mathematics Education. This type of research is quasi-experimental. This study examines the use of Wingeom software in geometry learning in the Department of Mathematics Education. The population in this study were all students of the…
Descriptors: Geometry, Mathematics Instruction, Computer Software, Teaching Methods
Filippello, Pina; Buzzai, Caterina; Sorrenti, Luana; Costa, Sebastiano; Abramo, Annarita; Wang, Kennet T. – Applied Developmental Science, 2021
The aim of this manuscript is to examine the perceived parental perfectionism in undergraduate Italian students. Study 1 aimed at exploring the factorial structure, reliability, and construct validity of the Italian version of the Family Almost Perfect Scale (FAPS). The aim of Study 2 was to cross-validate the FAPS structure with a different…
Descriptors: Italian, Personality Measures, Undergraduate Students, Parent Attitudes
Seipp, Larry Michael – ProQuest LLC, 2021
Nonmusical factors affect the Virginia Band and Orchestra Directors Association (VBODA) concert performances and subsequent assessment results; namely, school size, ethnicity, and socioeconomic status. A comparison of ratings given by individual trained evaluators demonstrates interrater reliability. A comparison of final ratings given at…
Descriptors: Comparative Analysis, Predictor Variables, Socioeconomic Status, Ethnicity
Jeong, Heejeong – Language Testing in Asia, 2019
In writing assessment, finding a valid, reliable, and efficient scale is critical. Appropriate scales, increase rater reliability, and can also save time and money. This exploratory study compared the effects of a binary scale and an analytic scale across teacher raters and expert raters. The purpose of the study is to find out how different scale…
Descriptors: Writing Evaluation, English (Second Language), Second Language Learning, Second Language Instruction
Kieftenbeld, Vincent; Boyer, Michelle – Applied Measurement in Education, 2017
Automated scoring systems are typically evaluated by comparing the performance of a single automated rater item-by-item to human raters. This presents a challenge when the performance of multiple raters needs to be compared across multiple items. Rankings could depend on specifics of the ranking procedure; observed differences could be due to…
Descriptors: Automation, Scoring, Comparative Analysis, Test Items
Liu, Ren; Huggins-Manley, Anne Corinne; Bradshaw, Laine – Educational and Psychological Measurement, 2017
There is an increasing demand for assessments that can provide more fine-grained information about examinees. In response to the demand, diagnostic measurement provides students with feedback on their strengths and weaknesses on specific skills by classifying them into mastery or nonmastery attribute categories. These attributes often form a…
Descriptors: Matrices, Classification, Accuracy, Diagnostic Tests
Prihar, Ethan; Heffernan, Neil – International Educational Data Mining Society, 2021
Similar content has tremendous utility in classroom and online learning environments. For example, similar content can be used to combat cheating, track students' learning over time, and model students' latent knowledge. These different use cases for similar content all rely on different notions of similarity, which make it difficult to determine…
Descriptors: Computer Software, Middle School Teachers, Mathematics Teachers, College Students
Van Norman, Ethan R.; Parker, David C. – Assessment for Effective Intervention, 2018
Recent simulations suggest that trend line decision rules applied to curriculum-based measurement of reading progress monitoring data may lead to inaccurate interpretations unless data are collected for upward of 3 months. The authors of those studies did not manipulate goal line slope or account for a student's level of initial performance when…
Descriptors: Comparative Analysis, Curriculum Based Assessment, Reading Tests, Progress Monitoring
Jaikaew, Pimpilai; Damrongpanit, Suntonrapot – Universal Journal of Educational Research, 2018
The research was designed to examine the effects of question setting using different conditions into 10 sets on the validity of structural equation modeling for factors affecting job morale. The data was collected from 690 personnel working in regional Statistical Offices around Thailand by using cluster random sampling. The tool used in…
Descriptors: Structural Equation Models, Questionnaires, Reliability, Multivariate Analysis
McKie, Greg L.; Islam, Hashim; Townsend, Logan K.; Howe, Greg J.; Hazell, Tom J. – Measurement in Physical Education and Exercise Science, 2018
This study examined the validity and reliability of a 30-second running sprint test using two non-motorized treadmills compared to the established Wingate Anaerobic Test. Twenty-four participants completed three sessions in a randomized order on a: (1) manual mode treadmill (Woodway); (2) specialized interval training treadmill (HiTrainer); and…
Descriptors: Exercise, Physical Activities, Correlation, Exercise Physiology
Kelleher, Leila K.; Beach, Tyson A. C.; Frost, David M.; Johnson, Andrew M.; Dickey, James P. – Measurement in Physical Education and Exercise Science, 2018
The scoring scheme for the functional movement screen implicitly assumes that the factor structure is consistent, stable, and congruent across different populations. To determine if this is the case, we compared principal components analyses of three samples: a healthy, general population (n = 100), a group of varsity athletes (n = 101), and a…
Descriptors: Factor Structure, Test Reliability, Screening Tests, Motion

Peer reviewed
Direct link
