Publication Date
| In 2026 | 12 |
| Since 2025 | 958 |
| Since 2022 (last 5 years) | 4567 |
| Since 2017 (last 10 years) | 10500 |
| Since 2007 (last 20 years) | 21963 |
Descriptor
| Test Validity | 21786 |
| Validity | 13791 |
| Test Reliability | 10864 |
| Foreign Countries | 9887 |
| Test Construction | 6897 |
| Factor Analysis | 5761 |
| Measures (Individuals) | 5633 |
| Predictive Validity | 5022 |
| Psychometrics | 4820 |
| Reliability | 4635 |
| Correlation | 4376 |
| More ▼ | |
Source
Author
Publication Type
Education Level
Audience
| Researchers | 1169 |
| Practitioners | 629 |
| Teachers | 336 |
| Administrators | 165 |
| Policymakers | 110 |
| Counselors | 63 |
| Students | 63 |
| Parents | 15 |
| Community | 12 |
| Media Staff | 10 |
| Support Staff | 8 |
| More ▼ | |
Location
| Turkey | 1397 |
| Australia | 705 |
| Canada | 626 |
| China | 528 |
| United States | 439 |
| Indonesia | 389 |
| United Kingdom | 363 |
| Germany | 340 |
| California | 338 |
| Netherlands | 336 |
| Spain | 311 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Meets WWC Standards without Reservations | 7 |
| Meets WWC Standards with or without Reservations | 12 |
| Does not meet standards | 10 |
Wielicki, Tom – International Association for Development of the Information Society, 2016
This paper reports on longitudinal study regarding integrity of testing in an online format as used by e-learning platforms. Specifically, this study explains whether online testing, which implies an open book format is compromising integrity of assessment by encouraging cheating among students. Statistical experiment designed for this study…
Descriptors: Integrity, Online Courses, Statistical Surveys, Longitudinal Studies
Marini, Jessica P.; Shaw, Emily J.; Young, Linda – College Board, 2016
During the transition period between the use of exclusively old SAT® scores and the use of exclusively new SAT scores, college admission offices will be receiving both types of scores from students. Making an admission decision based on new SAT scores can be challenging at first because institutions have methods, procedures, and models based on…
Descriptors: College Entrance Examinations, Scores, College Admission, Decision Making
Goldhammer, Frank; Martens, Thomas; Christoph, Gabriela; Lüdtke, Oliver – OECD Publishing, 2016
In this study, we investigated how empirical indicators of test-taking engagement can be defined, empirically validated, and used to describe group differences in the context of the Programme of International Assessment of Adult Competences (PIAAC). The approach was to distinguish between disengaged and engaged response behavior by means of…
Descriptors: International Assessment, Adults, Response Style (Tests), Reaction Time
Zazkis, Rina; Zazkis, Dov – Research in Mathematics Education, 2014
Script writing by learners has been used as a valuable pedagogical strategy and a research tool in several contexts. We adopted this strategy in the context of a mathematics course for prospective teachers. Participants were presented with opposing viewpoints with respect to a mathematical claim, and were asked to write a dialogue in which the…
Descriptors: Numbers, Mathematics, Teacher Education, Mathematics Education
Karim, Shahzad; Haq, Naushaba – International Journal of Evaluation and Research in Education, 2014
The present study focused on assessing the speaking test of IELTS. The assessment discussed both positive aspects and weaknesses in IELTS speaking module. The researchers had also suggested some possible measures for the improvement in IELTS speaking test and increasing its validity and reliability. The researchers had analysed and assessed IELTS…
Descriptors: English (Second Language), Second Language Learning, Language Tests, Speech Tests
Cormier, Damien C.; McGrew, Kevin S.; Ysseldyke, James E. – Journal of Psychoeducational Assessment, 2014
The increasing diversity of the U.S. population has resulted in increased concerns about the psychological assessment of students from culturally and linguistically diverse backgrounds. To date, little empirical research supports recommendations in test selection and interpretation, such as those presented in the Culture-Language Interpretative…
Descriptors: Cognitive Tests, Scores, Validity, Classification
Penketh, Claire – International Journal of Art & Design Education, 2014
Putting disability studies to work in art education suggests a form of action or industry, a creative opportunity for something to be done, recognising the relationship between theory and practice. Drawing on discourse analysis, this article offers an initial theoretical discussion of some of the ways in which disability is revealed and created…
Descriptors: Art Education, Disabilities, Discourse Analysis, Journal Articles
Canivez, Gary L. – School Psychology Quarterly, 2014
The Wechsler Intelligence Scale for Children--Fourth Edition (WISC-IV) is one of the most frequently used intelligence tests in clinical assessments of children with learning difficulties. Construct validity studies of the WISC-IV have generally supported the higher order structure with four correlated first-order factors and one higher-order…
Descriptors: Intelligence Tests, Construct Validity, Children, Learning Problems
Derby, Dustin C.; Smith, Thomas J. – Measurement and Evaluation in Counseling and Development, 2014
The purpose of the current study was to assess the gender invariance of an a priori four-factor solution of behavioral consequences of drinking. Results evidenced strong partial measurement invariance, with marginal structural invariance, which signals that the underlying constructs possessed the same theoretical structure for both men and women.
Descriptors: Drinking, Gender Differences, Measurement, Factor Structure
Eshach, Haim – Physical Review Special Topics - Physics Education Research, 2014
This article describes the development and field test of the Sound Concept Inventory Instrument (SCII), designed to measure middle school students' concepts of sound. The instrument was designed based on known students' difficulties in understanding sound and the history of science related to sound and focuses on two main aspects of sound: sound…
Descriptors: Measures (Individuals), Auditory Discrimination, Test Reliability, Test Validity
Wood, Timothy J. – Advances in Health Sciences Education, 2014
Medical education relies heavily on assessment formats that require raters to assess the competence and skills of learners. Unfortunately, there are often inconsistencies and variability in the scores raters assign. To ensure the scores from these assessment tools have validity, it is important to understand the underlying cognitive processes that…
Descriptors: Medical Education, Interrater Reliability, Cognitive Processes, Validity
Bowles, Terry; Hattie, John; Dinham, Stephen; Scull, Janet; Clinton, Janet – Australian Educational Researcher, 2014
Teacher education in universities continues to diversify in the twenty-first century. Just as course offerings, course delivery, staffing and the teaching/research mix varies extensively from university to university so does the procedure for pre-service teacher selection. Various factors bear on selection procedures and practices however few…
Descriptors: Teacher Education, Preservice Teachers, Selective Admission, Scores
Luckay, Melanie B.; Collier-Reed, Brandon I. – International Journal of Technology and Design Education, 2014
In this article, an instrument for assessing upper secondary school students' levels of technological literacy is presented. The items making up the instrument emerged from a previous study that employed a phenomenographic research approach to explore students' conceptions of technology in terms of their understanding of the "nature…
Descriptors: Secondary School Students, Technological Literacy, Measures (Individuals), Student Attitudes
Attali, Yigal – Educational and Psychological Measurement, 2014
This article presents a comparative judgment approach for holistically scored constructed response tasks. In this approach, the grader rank orders (rather than rate) the quality of a small set of responses. A prior automated evaluation of responses guides both set formation and scaling of rankings. Sets are formed to have similar prior scores and…
Descriptors: Responses, Item Response Theory, Scores, Rating Scales
Pope, J. Paige; Hall, Craig R. – Measurement in Physical Education and Exercise Science, 2014
This study was designed to examine select psychometric properties of the Coach Identity Prominence Scale (CIPS), including the reliability, factorial validity, convergent validity, discriminant validity, and predictive validity. Coaches (N = 338) who averaged 37 (SD = 12.27) years of age, had a mean of 13 (SD = 9.90) years of coaching experience,…
Descriptors: Athletic Coaches, Self Concept Measures, Test Validity, Test Reliability

Direct link
Peer reviewed
