Publication Date
| In 2026 | 0 |
| Since 2025 | 12 |
| Since 2022 (last 5 years) | 114 |
| Since 2017 (last 10 years) | 375 |
| Since 2007 (last 20 years) | 1130 |
Descriptor
| Comparative Analysis | 1943 |
| Reliability | 880 |
| Test Reliability | 792 |
| Foreign Countries | 554 |
| Test Validity | 443 |
| Correlation | 350 |
| Validity | 332 |
| Interrater Reliability | 327 |
| Statistical Analysis | 321 |
| Scores | 280 |
| Measures (Individuals) | 236 |
| More ▼ | |
Source
Author
| Reckase, Mark D. | 6 |
| Attali, Yigal | 5 |
| Coniam, David | 5 |
| Brennan, Robert L. | 4 |
| Crehan, Kevin D. | 4 |
| Feldt, Leonard S. | 4 |
| Hakstian, A. Ralph | 4 |
| Jones, Ian | 4 |
| Kolen, Michael J. | 4 |
| Lunz, Mary E. | 4 |
| August, Diane | 3 |
| More ▼ | |
Publication Type
Education Level
Audience
| Researchers | 35 |
| Practitioners | 29 |
| Teachers | 15 |
| Administrators | 9 |
| Policymakers | 6 |
| Counselors | 2 |
| Media Staff | 2 |
| Parents | 1 |
| Support Staff | 1 |
Location
| Turkey | 59 |
| United States | 47 |
| Australia | 36 |
| China | 33 |
| Canada | 32 |
| United Kingdom (England) | 32 |
| United Kingdom | 28 |
| Germany | 25 |
| Netherlands | 24 |
| Taiwan | 22 |
| Hong Kong | 20 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Meets WWC Standards with or without Reservations | 1 |
| Does not meet standards | 1 |
Winke, Paula – Language Assessment Quarterly, 2011
In this study, I investigated the reliability of the U.S. Naturalization Test's civics component by asking 414 individuals to take a mock U.S. citizenship test comprising civics test questions. Using an incomplete block design of six forms with 16 nonoverlapping items and four anchor items on each form (the anchors connected the six subsets of…
Descriptors: Test Items, Citizenship, Civics, Test Validity
Hill, Heather C.; Charalambous, Charalambos Y.; Kraft, Matthew A. – Educational Researcher, 2012
In recent years, interest has grown in using classroom observation as a means to several ends, including teacher development, teacher evaluation, and impact evaluation of classroom-based interventions. Although education practitioners and researchers have developed numerous observational instruments for these purposes, many developers fail to…
Descriptors: Generalizability Theory, Observation, Classroom Observation Techniques, Evaluation
Balogh, Jennifer; Bernstein, Jared; Cheng, Jian; Van Moere, Alistair; Townshend, Brent; Suzuki, Masanori – Educational and Psychological Measurement, 2012
A two-part experiment is presented that validates a new measurement tool for scoring oral reading ability. Data collected by the U.S. government in a large-scale literacy assessment of adults were analyzed by a system called VersaReader that uses automatic speech recognition and speech processing technologies to score oral reading fluency. In the…
Descriptors: Reading Fluency, Measures (Individuals), Scoring, Reading Ability
Hirsch, Shanna Eisner; Kennedy, Michael J.; Haines, Shana J.; Thomas, Cathy Newman; Alves, Kat D. – Behavioral Disorders, 2015
Functional behavioral assessment (FBA) is an empirically supported intervention associated with decreasing problem behavior and increasing appropriate behavior. To date, few studies have examined multimedia approaches to FBA training. This paper provides the outcomes of a randomized controlled trial across three university sites and evaluates…
Descriptors: Functional Behavioral Assessment, Preservice Teachers, Knowledge Level, Randomized Controlled Trials
Christie, A.; Kamen, G.; Boucher, Jean P.; Inglis, J. Greig; Gabriel, David A. – Measurement in Physical Education and Exercise Science, 2010
The Hoffmann reflex is obtained through surface electromyographic recordings, and it is one of the most common neurophysiological techniques in exercise science. Measurement and evaluation of the peak-to-peak amplitude of the Hoffmann reflex has been guided by the observation that it is a variable response that requires multiple trials to obtain a…
Descriptors: Motor Reactions, Measurement Techniques, Comparative Analysis, Statistical Analysis
Almehrizi, Rashid S. – Applied Psychological Measurement, 2013
The majority of large-scale assessments develop various score scales that are either linear or nonlinear transformations of raw scores for better interpretations and uses of assessment results. The current formula for coefficient alpha (a; the commonly used reliability coefficient) only provides internal consistency reliability estimates of raw…
Descriptors: Raw Scores, Scaling, Reliability, Computation
Shirazi, Masoumeh Ahmadi; Shekarabi, Zeinab – Iranian Journal of Language Teaching Research, 2014
This study is an attempt to investigate the effect of direct and indirect feedback on the writing performance of Iranian learners of Japanese as a foreign language. During one academic semester, three indirect feedback types including underlining, coding and translation were used as well as direct type of feedback in order to see which one makes a…
Descriptors: Foreign Countries, Second Language Learning, Second Language Instruction, Japanese
Collier, Lizabeth C. – ProQuest LLC, 2014
This study investigates how university instructors from various disciplines at a large, comprehensive university in the United States evaluate different varieties of English from countries considered "outer circle" (OC) countries, formerly colonized countries where English has been transplanted and is now used unofficially and officially…
Descriptors: Universities, Global Approach, College English, Writing Evaluation
Ling, Guangming – ETS Research Report Series, 2012
To assess the value of individual students' subscores on the Major Field Test in Business (MFT Business), I examined the test's internal structure with factor analysis and structural equation model methods, and analyzed the subscore reliabilities using the augmented scores method. Analyses of the internal structure suggested that the MFT Business…
Descriptors: Factor Analysis, Construct Validity, Structural Equation Models, Correlation
DeChenne, Sue Ellen; Enochs, Larry G.; Needham, Mark – Journal of the Scholarship of Teaching and Learning, 2012
The graduate experience is a critical time for development of academic faculty, but often there is little preparation for teaching during the graduate career. Teaching self-efficacy, an instructor's belief in his or her ability to teach students in a specific context, can help to predict teaching behavior and student achievement, and can be used…
Descriptors: STEM Education, Teaching Assistants, Graduate Students, Self Efficacy
Price, Katherine W.; Meisinger, Elizabeth B.; Louwerse, Max M.; D'Mello, Sidney K. – Psychology in the Schools, 2012
Assessing silent reading fluency in classroom environments is challenging. This article reports on a method of assessing silent reading using underlining, an approach that solves many problems other silent reading fluency assessment measures face. This method computationally monitors readers' silent reading fluency by the speed they underline…
Descriptors: Evidence, Reading Comprehension, Silent Reading, Reading Fluency
Yao, Lihua – Psychometrika, 2012
Multidimensional computer adaptive testing (MCAT) can provide higher precision and reliability or reduce test length when compared with unidimensional CAT or with the paper-and-pencil test. This study compared five item selection procedures in the MCAT framework for both domain scores and overall scores through simulation by varying the structure…
Descriptors: Item Banks, Test Length, Simulation, Adaptive Testing
Yoshida, Hanako – Journal of Cognition and Development, 2012
A long history of research has considered the role of iconicity in language and the existence and role of nonarbitrary properties in language and the use of language. Previous studies with Japanese-speaking children, whose language defines a large grammatical class of words with clear sound symbolism, suggest that iconicity properties in Japanese…
Descriptors: Language Usage, Speech Communication, Verbs, Linguistics
Mary E. Little; D'Ann Rawlinson; Deborah C. Simmons; Minjung Kim; Oi-man Kwok; Shanna Hagan-Burke; Leslie E. Simmons; Melissa Fogarty; Eric Oslund; Michael D. Coyne – Learning Disabilities Research & Practice, 2012
This study compared the effects of Tier 2 reading interventions that operated in response-to-intervention contexts. Kindergarten children (N = 90) who were identified as at risk for reading difficulties were stratified by school and randomly assigned to receive (a) Early Reading Intervention (ERI; Pearson/Scott Foresman, 2004) modified in response…
Descriptors: Reading Achievement, Outcome Measures, Kindergarten, Response to Intervention
Thomason-Sassi, Jessica L.; Iwata, Brian A.; Neidert, Pamela L.; Roscoe, Eileen M. – Journal of Applied Behavior Analysis, 2011
Dependent variables in research on problem behavior typically are based on measures of response repetition, but these measures may be problematic when behavior poses high risk or when its occurrence terminates a session. We examined response latency as the index of behavior during assessment. In Experiment 1, we compared response rate and latency…
Descriptors: Behavior Problems, Reaction Time, Functional Behavioral Assessment, Experiments

Peer reviewed
Direct link
