Publication Date
| In 2026 | 0 |
| Since 2025 | 5 |
| Since 2022 (last 5 years) | 23 |
| Since 2017 (last 10 years) | 563 |
| Since 2007 (last 20 years) | 1786 |
Descriptor
| Statistical Analysis | 2533 |
| Reliability | 1278 |
| Test Reliability | 1074 |
| Foreign Countries | 940 |
| Correlation | 633 |
| Test Validity | 630 |
| Factor Analysis | 559 |
| Validity | 508 |
| Questionnaires | 479 |
| Measures (Individuals) | 411 |
| Test Construction | 338 |
| More ▼ | |
Source
Author
| Alonzo, Julie | 12 |
| Price, Gary G. | 12 |
| Tindal, Gerald | 10 |
| Lai, Cheng-Fei | 9 |
| Brennan, Robert L. | 8 |
| Raykov, Tenko | 8 |
| Feldt, Leonard S. | 7 |
| Livingston, Samuel A. | 7 |
| Park, Bitnara Jasmine | 7 |
| Irvin, P. Shawn | 6 |
| Anderson, Daniel | 5 |
| More ▼ | |
Publication Type
Education Level
Audience
| Researchers | 34 |
| Practitioners | 21 |
| Teachers | 10 |
| Students | 8 |
| Administrators | 5 |
| Counselors | 2 |
| Parents | 1 |
| Policymakers | 1 |
Location
| Turkey | 204 |
| Nigeria | 57 |
| Jordan | 38 |
| Australia | 35 |
| Iran | 35 |
| Taiwan | 35 |
| Canada | 31 |
| China | 30 |
| Germany | 29 |
| California | 28 |
| United Kingdom | 25 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Meets WWC Standards without Reservations | 1 |
| Meets WWC Standards with or without Reservations | 1 |
| Does not meet standards | 1 |
Sung, Yao-Ting; Chang, Kuo-En; Chang, Tzyy-Hua; Yu, Wen-Cheng – Journal of Adolescence, 2010
Self- and peer assessments are becoming more popular in classrooms, but there are few data on the reliability and validity of such assessments performed by school children. Because these factors are greatly affected by the number of raters, we conducted two studies to determine the rating behaviours of teenagers in self- and peer assessments, and…
Descriptors: Generalizability Theory, Peer Evaluation, Validity, Reliability
Wang, Ping – English Language Teaching, 2009
This paper makes a study of the rater reliability in scoring composition in the test of English as a foreign language (EFL) and focuses on the inter-rater reliability as well as several interactions between raters and the other facets involved (that is examinees, rating criteria and rating methods). Results showed that raters were fairly…
Descriptors: Interrater Reliability, Scoring, Writing (Composition), English (Second Language)
Wanstrom, Linda – Multivariate Behavioral Research, 2009
Second-order latent growth curve models (S. C. Duncan & Duncan, 1996; McArdle, 1988) can be used to study group differences in change in latent constructs. We give exact formulas for the covariance matrix of the parameter estimates and an algebraic expression for the estimation of slope differences. Formulas for calculations of the required sample…
Descriptors: Sample Size, Effect Size, Mathematical Formulas, Computation
Waller, Niels G. – Applied Psychological Measurement, 2008
Reliability is a property of test scores from individuals who have been sampled from a well-defined population. Reliability indices, such as coefficient and related formulas for internal consistency reliability (KR-20, Hoyt's reliability), yield lower bound reliability estimates when (a) subjects have been sampled from a single population and when…
Descriptors: Test Items, Reliability, Scores, Psychometrics
Wormald, Amy Marie – Online Submission, 2011
Recent literature notes a significant increase in students being diagnosed with Autism Spectrum Disorder (ASD). Yet, no uniform testing protocol exists. It is vital for students and educators that a unified testing process be created and established. This study investigated which ASD testing instruments were currently used in Southern California…
Descriptors: Private Schools, Autism, Testing, Rating Scales
Osisioma, Irene U.; Onyia, Chidiebere R. – International Education Studies, 2009
The present study seeks to explore middle school students' perception of the kind of science instruction going on in their classrooms and its relevance to their daily lives outside the classroom. Data were collected using a five point Likert type survey instrument that was administered to 262 middle school (Grades 6, 7 & 8) students in six…
Descriptors: Middle School Students, Student Attitudes, Surveys, Science Instruction
Shani, Amir – ProQuest LLC, 2009
From time immemorial human beings have utilized animals for various needs and purposes, which led societies to debate the justification for using animals and to reflect on the way in which animals are treated. These concerns have also resulted in various contemporary studies aimed to reveal interest groups'--as well as the general publics'--views…
Descriptors: Animals, Qualitative Research, Leisure Time, Attitudes
Bing, Mark N.; Stewart, Susan M.; Davison, H. Kristl – Educational and Psychological Measurement, 2009
Handheld calculators have been used on the job for more than 30 years, yet the degree to which these devices can affect performance on employment tests of mathematical ability has not been thoroughly examined. This study used a within-subjects research design (N = 167) to investigate the effects of calculator use on test score reliability, test…
Descriptors: Calculators, Mathematics Tests, Occupational Tests, Test Reliability
Lissitz, Robert W.; Hou, Xiaodong; Slater, Sharon Cadman – Journal of Applied Testing Technology, 2012
This article investigates several questions regarding the impact of different item formats on measurement characteristics. Constructed response (CR) items and multiple choice (MC) items obviously differ in their formats and in the resources needed to score them. As such, they have been the subject of considerable discussion regarding the impact of…
Descriptors: Computer Assisted Testing, Scoring, Evaluation Problems, Psychometrics
Peer reviewedMehryar, A. H.; Shapurian, R. – British Journal of Educational Psychology, 1971
Descriptors: Reliability, Secondary School Students, Statistical Analysis, Test Reliability
Raju, T. J. M. S.; Suryanarayana, N. V. S. – Journal on School Educational Technology, 2011
This study focuses on the availability and use of Science Laboratories at the secondary education level in Visakhapatnam District of Andhra Pradesh, India. It is commented that most of the schools do not possess well equipped laboratories and even when equipment is available some science teachers are not utilizing the laboratory facilities.…
Descriptors: Science Laboratories, Secondary Schools, Secondary Education, Foreign Countries
Burmester, Kristen O'Rourke – ProQuest LLC, 2011
Classrooms are a primary site of evidence about learning. Yet classroom proceedings often occur behind closed doors and hence evidence of student learning is observable only to the classroom teacher. The informal and undocumented nature of this information means that it is rarely included in statistical models or quantifiable analyses. This…
Descriptors: Evidence, Student Evaluation, Educational Research, Validity
Teo, Timothy; Koh, Joyce Hwee Ling – International Journal of Education and Development using Information and Communication Technology, 2010
This study examines the computer self-efficacy among pre-service teachers (N = 708) at a teacher training institute in Singapore. Data were collected through self-reported ratings on a 7-point Likert-type scale. Exploratory factor analysis (EFA) was performed on an initial sample (N = 354) and the result revealed that pre-service teachers'…
Descriptors: Computer Literacy, Self Efficacy, Preservice Teachers, Structural Equation Models
Plankis, Brian J.; Marrero, Meghan E. – International Electronic Journal of Environmental Education, 2010
Recent research conducted on adults in the United States indicates low ocean literacy (Ocean Project, 2009b, 1999), but there is a dearth of peer-reviewed research on K-12 students' ocean literacy. This paper presents two research studies that examined the ocean and environmental literacy of 464 K-12 students in five states. Like the majority of…
Descriptors: Public Schools, Oceanography, Scientific Literacy, Elementary Secondary Education
Dhami, Mandeep K. – Journal of Experimental Psychology: Applied, 2008
Beyond reasonable doubt represents a probability value that acts as the criterion for conviction in criminal trials. I introduce the membership function (MF) method as a new tool for measuring quantitative interpretations of reasonable doubt. Experiment 1 demonstrated that three different methods (i.e., direct rating, decision theory based, and…
Descriptors: Probability, Criminal Law, Court Litigation, Decision Making

Direct link
