Publication Date
| In 2026 | 0 |
| Since 2025 | 0 |
| Since 2022 (last 5 years) | 1 |
| Since 2017 (last 10 years) | 6 |
| Since 2007 (last 20 years) | 21 |
Descriptor
| Standardized Tests | 47 |
| Statistical Analysis | 47 |
| Test Reliability | 36 |
| Achievement Tests | 15 |
| Scores | 15 |
| Test Validity | 15 |
| Correlation | 12 |
| Reliability | 11 |
| Reading Tests | 10 |
| Academic Achievement | 9 |
| Comparative Analysis | 9 |
| More ▼ | |
Source
Author
| Alonzo, Julie | 2 |
| Bashaw, W. L. | 2 |
| Booker, Kevin | 2 |
| Bruch, Julie | 2 |
| Foorman, Barbara R. | 2 |
| Gill, Brian | 2 |
| Irvin, P. Shawn | 2 |
| Lai, Cheng-Fei | 2 |
| Lenke, Joanne M. | 2 |
| Park, Bitnara Jasmine | 2 |
| Petscher, Yaacov | 2 |
| More ▼ | |
Publication Type
Education Level
Audience
| Practitioners | 1 |
| Researchers | 1 |
Location
| Florida | 3 |
| Germany | 2 |
| Tennessee | 2 |
| California | 1 |
| Canada | 1 |
| China | 1 |
| Colorado | 1 |
| Netherlands | 1 |
| New York | 1 |
| Nigeria | 1 |
| North Carolina | 1 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Meets WWC Standards without Reservations | 1 |
| Meets WWC Standards with or without Reservations | 1 |
Merchant, Stefan; Rich, Jessica; Klinger, Don A. – Canadian Journal of Educational Administration and Policy, 2022
Both school and district administrators use the results of standardized, large-scale tests to inform decisions about the need for, or success of, educational programs and interventions. However, test results at the school level are subject to random fluctuations due to changes in cohort, test items, and other factors outside of the school's…
Descriptors: Standardized Tests, Foreign Countries, Generalizability Theory, Scores
Ssemakula, Mukasa E.; Liao, Gene Y.; Sawilowsky, Shlomo – American Journal of Engineering Education, 2018
There is a major trend in engineering education to provide students with realistic hands-on learning experiences. This paper reports on the results of work done to develop standardized test instruments to use for student learning outcomes assessment in an experiential hands-on manufacturing engineering and technology environment. The specific…
Descriptors: Test Construction, Psychometrics, Test Validity, Standardized Tests
Kane, Michael T. – Assessment in Education: Principles, Policy & Practice, 2017
In response to an argument by Baird, Andrich, Hopfenbeck and Stobart (2017), Michael Kane states that there needs to be a better fit between educational assessment and learning theory. In line with this goal, Kane will examine how psychometric constraints might be loosened by relaxing some psychometric "rules" in some assessment…
Descriptors: Educational Assessment, Psychometrics, Standards, Test Reliability
Liu, Xueman Lucy; de Villiers, Jill; Ning, Chunyan; Rolfhus, Eric; Hutchings, Teresa; Lee, Wendy; Jiang, Fan; Zhang, Yi Wen – Journal of Speech, Language, and Hearing Research, 2017
Purpose: With no existing gold standard for comparison, challenges arise for establishing the validity of a new standardized Mandarin language assessment normed in mainland China. Method: A new assessment, Diagnostic Receptive and Expressive Assessment of Mandarin (DREAM), was normed with a stratified sample of 969 children ages 2;6 (years;months)…
Descriptors: Mandarin Chinese, Correlation, Language Tests, Diagnostic Tests
Hawley, Leslie R.; Bovaird, James A.; Wu, ChaoRong – Applied Measurement in Education, 2017
Value-added assessment methods have been criticized by researchers and policy makers for a number of reasons. One issue includes the sensitivity of model results across different outcome measures. This study examined the utility of incorporating multivariate latent variable approaches within a traditional value-added framework. We evaluated the…
Descriptors: Value Added Models, Reliability, Multivariate Analysis, Scaling
Duong, Mylien T.; Badaly, Daryaneh; Liu, Freda F.; Schwartz, David; McCarty, Carolyn A. – Review of Educational Research, 2016
Research on generational differences in immigrant youths' academic achievement has yielded conflicting findings. This meta-analysis reconciles discrepant findings by testing meta-analytic moderators. Fifty-three studies provided 74 comparisons on academic outcomes. First- and second-generation youths did not significantly differ on academic…
Descriptors: Immigrants, Academic Achievement, Meta Analysis, Educational Research
Ezeudu, Florence O.; Chiaha, Gertrude-Theresa Uzoamaka; Anazor, Lynda Chioma; Eze, Justina Uzoamaka; Omeke, Faith Chinwe – Journal of Education and Practice, 2015
The purpose of this study was to do a SWOT analysis and compare performances of male and female students in chemistry. Four research questions and four null hypotheses guided the study. Two boys', two girls' and two coeducational schools involving 1319 males and 1831 females, were selected by a stratified, deliberate sampling technique. A…
Descriptors: Foreign Countries, Chemistry, Science Instruction, Strategic Planning
Amrein-Beardsley, Audrey; Geiger, Tray – Phi Delta Kappan, 2017
Houston's experience with the Educational Value-Added Assessment System (R) (EVAAS) raises questions that other districts should consider before buying the software and using it for high-stakes decisions. Researchers found that teachers in Houston, all of whom were under the EVAAS gun, but who taught relatively more racial minority students,…
Descriptors: Value Added Models, School Districts, Computer Software, Educational Technology
König, Johannes; Lammerding, Sandra; Nold, Günter; Rohde, Andreas; Strauß, Sarah; Tachtsoglou, Sarantis – Journal of Teacher Education, 2016
Despite an increasing research interest in subject-specific teacher knowledge, the scientific understanding regarding teachers' professional knowledge for teaching English as a foreign language (TEFL) is very limited. This study therefore applies standardized tests to directly assess content knowledge (CK), pedagogical content knowledge (PCK), and…
Descriptors: Knowledge Base for Teaching, English (Second Language), Second Language Learning, Second Language Instruction
Polikoff, Morgan S. – Educational Assessment, 2016
As state tests of student achievement are used for an increasingly wide array of high- and low-stakes purposes, evaluating their instructional sensitivity is essential. This article uses data from the Bill and Melinda Gates Foundation's Measures of Effective Project to examine the instructional sensitivity of 4 states' mathematics and English…
Descriptors: High Stakes Tests, Achievement Tests, Mathematics Tests, English
Südkamp, Anna; Pohl, Steffi; Weinert, Sabine – Frontline Learning Research, 2015
Including students with special educational needs in learning (SEN-L) is a challenge for large-scale assessments. In order to draw inferences with respect to students with SEN-L and to compare their scores to students in general education, one needs to assure that the measurement model is reliable and that the same construct is measured for…
Descriptors: Disabilities, Special Education, Inclusion, Competence
Foorman, Barbara R.; Petscher, Yaacov; Schatschneider, Chris – Florida Center for Reading Research, 2015
The grades K-2 Florida Center for Reading Research (FCRR) Reading Assessment (FRA) consists of computer-adaptive alphabetic and oral language screening tasks that provide a Probability of Literacy Success (PLS) linked to grade-level performance (i.e., the 40th percentile) on the word reading (in kindergarten) or reading comprehension (in grades…
Descriptors: Reading Instruction, Reading Tests, Kindergarten, Grade 1
Foorman, Barbara R.; Petscher, Yaacov; Schatschneider, Chris – Florida Center for Reading Research, 2015
The Florida Center for Reading Research (FCRR) Reading Assessment (FRA) consists of computer-adaptive reading comprehension and oral language screening tasks that provide measures to track growth over time, as well as a Probability of Literacy Success (PLS) linked to grade-level performance (i.e., the 50th percentile) on the reading comprehension…
Descriptors: Elementary School Students, Middle School Students, High School Students, Written Language
Stephens, Christopher Neil – ProQuest LLC, 2012
Augmentation procedures are designed to provide better estimates for a given test or subtest through the use of collateral information. The main purpose of this dissertation was to use Haberman's and Wainer's augmentation procedures on a large-scale, standardized achievement test to understand the relationship between reliability and…
Descriptors: Psychometrics, Error of Measurement, Scores, Reliability
Lai, Cheng-Fei; Irvin, P. Shawn; Alonzo, Julie; Park, Bitnara Jasmine; Tindal, Gerald – Behavioral Research and Teaching, 2012
In this technical report, we present the results of a reliability study of the second-grade multiple choice reading comprehension measures available on the easyCBM learning system conducted in the spring of 2011. Analyses include split-half reliability, alternate form reliability, person and item reliability as derived from Rasch analysis,…
Descriptors: Reading Comprehension, Testing Programs, Statistical Analysis, Elementary School Students

Peer reviewed
Direct link
