Publication Date
| In 2026 | 0 |
| Since 2025 | 59 |
| Since 2022 (last 5 years) | 416 |
| Since 2017 (last 10 years) | 919 |
| Since 2007 (last 20 years) | 1970 |
Descriptor
Source
Author
Publication Type
Education Level
Audience
| Researchers | 93 |
| Practitioners | 23 |
| Teachers | 22 |
| Policymakers | 10 |
| Administrators | 5 |
| Students | 4 |
| Counselors | 2 |
| Parents | 2 |
| Community | 1 |
Location
| United States | 47 |
| Germany | 42 |
| Australia | 34 |
| Canada | 27 |
| Turkey | 27 |
| California | 22 |
| United Kingdom (England) | 20 |
| Netherlands | 18 |
| China | 17 |
| New York | 15 |
| United Kingdom | 15 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Does not meet standards | 1 |
Liu, Chunyan; Kolen, Michael J. – Journal of Educational Measurement, 2018
Smoothing techniques are designed to improve the accuracy of equating functions. The main purpose of this study is to compare seven model selection strategies for choosing the smoothing parameter (C) for polynomial loglinear presmoothing and one procedure for model selection in cubic spline postsmoothing for mixed-format pseudo tests under the…
Descriptors: Comparative Analysis, Accuracy, Models, Sample Size
Choi, Ikkyu; Hao, Jiangang; Deane, Paul; Zhang, Mo – ETS Research Report Series, 2021
"Biometrics" are physical or behavioral human characteristics that can be used to identify a person. It is widely known that keystroke or typing dynamics for short, fixed texts (e.g., passwords) could serve as a behavioral biometric. In this study, we investigate whether keystroke data from essay responses can lead to a reliable…
Descriptors: Accuracy, High Stakes Tests, Writing Tests, Benchmarking
Grund, Simon; Lüdtke, Oliver; Robitzsch, Alexander – Journal of Educational and Behavioral Statistics, 2021
Large-scale assessments (LSAs) use Mislevy's "plausible value" (PV) approach to relate student proficiency to noncognitive variables administered in a background questionnaire. This method requires background variables to be completely observed, a requirement that is seldom fulfilled. In this article, we evaluate and compare the…
Descriptors: Data Analysis, Error of Measurement, Research Problems, Statistical Inference
Saluja, Ronak; Cheng, Sierra; delos Santos, Keemo Althea; Chan, Kelvin K. W. – Research Synthesis Methods, 2019
Objective: Various statistical methods have been developed to estimate hazard ratios (HRs) from published Kaplan-Meier (KM) curves for the purpose of performing meta-analyses. The objective of this study was to determine the reliability, accuracy, and precision of four commonly used methods by Guyot, Williamson, Parmar, and Hoyle and Henley.…
Descriptors: Meta Analysis, Reliability, Accuracy, Randomized Controlled Trials
Carnazzo, Katherine; Dowdy, Erin; Furlong, Michael J.; Quirk, Matthew P. – Psychology in the Schools, 2019
Students with learning disabilities (LD) represent a vulnerable population and are at higher risk for social and emotional challenges compared to their peers without LD. A strengths-based orientation is recommended to encourage building resilience factors to counteract the negative effects of LD over the lifespan. To identify areas of strength and…
Descriptors: Mental Health, Learning Disabilities, Social Development, Emotional Development
Haberman, Shelby J. – ETS Research Report Series, 2020
Best linear prediction (BLP) and penalized best linear prediction (PBLP) are techniques for combining sources of information to produce task scores, section scores, and composite test scores. The report examines issues to consider in operational implementation of BLP and PBLP in testing programs administered by ETS [Educational Testing Service].
Descriptors: Prediction, Scores, Tests, Testing Programs
Erdem, Devrim – International Journal of Assessment Tools in Education, 2020
The purpose of this study was to develop a scale measuring attitudes toward women's working. In line with this main purpose, two studies were conducted to develop the tool and investigate its psychometric properties in two different samples. The study 1 started with generating item pool, conducting exploratory factor analysis to identify…
Descriptors: Young Adults, Employed Women, Test Construction, Error of Measurement
Moore, Joann L.; Li, Tianli; Lu, Yang – ACT, Inc., 2020
The Every Student Succeeds Act requires that English Learners (ELs) are included in annual state testing (grades 3-8 and once in high school) and included in each state's accountability system disaggregated by subgroup to ensure that they receive the support they need to learn English, participate fully in their education experience, and graduate…
Descriptors: College Entrance Examinations, Scores, English Language Learners, Accountability
Driller, Matthew; Brophy-Williams, Ned; Walker, Anthony – Measurement in Physical Education and Exercise Science, 2017
The purpose of the present study was to determine the reliability of a 5km run test on a motorized treadmill. Over three consecutive weeks, 12 well-trained runners completed three 5km time trials on a treadmill following a standardized warm-up. Runners were partially-blinded to their running speed and distance covered. Total time to complete the…
Descriptors: Athletics, Physical Activities, Athletes, Test Reliability
Takeda, Kazuya; Tanabe, Shigeo; Koyama, Soichiro; Nagai, Tomoko; Sakurai, Hiroaki; Kanada, Yoshikiyo; Shomoto, Koji – Measurement in Physical Education and Exercise Science, 2018
The aim of this study was to clarify the intra- and inter-rater reliability of the rate of force development in hip abductor muscle force measurements using a hand-held dynamometer. Thirty healthy adults were separately assessed by two independent raters on two separate days. Rate of force development was calculated from the slope of the…
Descriptors: Interrater Reliability, Human Body, Measurement Equipment, Handheld Devices
Hsiao, Yu-Yu; Kwok, Oi-Man; Lai, Mark H. C. – Educational and Psychological Measurement, 2018
Path models with observed composites based on multiple items (e.g., mean or sum score of the items) are commonly used to test interaction effects. Under this practice, researchers generally assume that the observed composites are measured without errors. In this study, we reviewed and evaluated two alternative methods within the structural…
Descriptors: Error of Measurement, Testing, Scores, Models
Tadesse, Tefera; Gillies, Robyn M.; Campbell, Chris – Education Sciences, 2018
This study tested the construct validity, factorial validity, and measurement invariance of the learning gains scale based on survey responses of a large sample (n = 536) of undergraduate students in two colleges at a university in Ethiopia. The analyses were performed through structural equation modeling technique using the stata 13 data analysis…
Descriptors: Construct Validity, Measurement, Achievement Gains, Undergraduate Students
Hong, Dae S.; Choi, Kyong Mi; Runnalls, Cristina; Hwang, Jihyun – North American Chapter of the International Group for the Psychology of Mathematics Education, 2018
This study compared area lessons from Korean textbooks and US standards-based textbooks to understand differences and similarities among these textbooks, as well as how these textbooks address known learning challenges in area measurement. Several well-known challenges have been identified in previous studies, such as covering, array structure,…
Descriptors: Geometric Concepts, Measurement, Mathematics Instruction, Elementary School Mathematics
Shakerin, Said – Physics Teacher, 2016
A simple mistake in properly setting up a measuring device caused millions of dollars to be spent in correcting the initial optical failure of the Hubble Space Telescope (HST). This short article is intended as a lesson for a physics laboratory and discussion of errors in measurement.
Descriptors: Laboratory Equipment, Science Laboratories, Science Instruction, Physics
Someki, Fumio; Ohnishi, Masafumi; Vejdemo-Johansson, Mikael; Nakamura, Kazuhiko – Journal of Psychoeducational Assessment, 2020
To examine reliability, validity, factor structure, and measurement invariance (i.e., configural, metric, and scalar invariance) of the Japanese Conners' Adult attention deficit hyperactivity disorder (ADHD) Rating Scales (CAARS), Japanese nonclinical adults (N = 786) completed the CAARS Self-Report (CAARS-S). Each participant was also rated by…
Descriptors: Attention Deficit Hyperactivity Disorder, Rating Scales, Foreign Countries, Test Reliability

Peer reviewed
Direct link
