ERIC - Search Results

Publication Date

In 2026	0
Since 2025	59
Since 2022 (last 5 years)	416
Since 2017 (last 10 years)	919
Since 2007 (last 20 years)	1970

Descriptor

Error of Measurement	3316
Statistical Analysis	602
Scores	511
Item Response Theory	449
Correlation	434
Comparative Analysis	424
Foreign Countries	418
Test Reliability	412
Computation	407
Simulation	370
Reliability	357
Sample Size	355
Models	353
Evaluation Methods	350
Test Items	349
Measurement Techniques	318
Factor Analysis	311
Sampling	301
Statistical Bias	300
Research Methodology	288
Goodness of Fit	260
Psychometrics	260
Monte Carlo Methods	259
Regression (Statistics)	246
Mathematical Models	241
More ▼

Author

Raykov, Tenko	23
Brennan, Robert L.	19
Kolen, Michael J.	19
Lord, Frederic M.	17
Thompson, Bruce	16
Zimmerman, Donald W.	16
Lee, Won-Chan	15
Livingston, Samuel A.	14
McCaffrey, Daniel F.	14
Yuan, Ke-Hai	14
van der Linden, Wim J.	14
Cai, Li	13
Moses, Tim	13
Beretvas, S. Natasha	12
Marsh, Herbert W.	12
Zwick, Rebecca	12
Algina, James	11
Ferron, John M.	11
Lee, Guemin	11
Lockwood, J. R.	11
Marcoulides, George A.	11
Reardon, Sean F.	11
DeMars, Christine E.	10
Henson, Robin K.	10
More ▼

Education Level

Higher Education	271
Secondary Education	201
Postsecondary Education	197
Elementary Education	194
Elementary Secondary Education	126
Middle Schools	98
High Schools	82
Junior High Schools	78
Early Childhood Education	61
Grade 4	48
Intermediate Grades	44
Primary Education	42
Grade 8	40
Grade 3	39
Grade 5	39
Grade 7	33
Kindergarten	24
Adult Education	23
Grade 6	19
Grade 2	17
Preschool Education	16
Grade 1	15
Grade 10	12
Grade 9	12
Two Year Colleges	6
More ▼

Audience

Researchers	93
Practitioners	23
Teachers	22
Policymakers	10
Administrators	5
Students	4
Counselors	2
Parents	2
Community	1

Location

United States	47
Germany	42
Australia	34
Canada	27
Turkey	27
California	22
United Kingdom (England)	20
Netherlands	18
China	17
New York	15
United Kingdom	15
North Carolina	14
Texas	14
Italy	12
South Korea	12
Florida	11
Indonesia	11
New Zealand	11
Pennsylvania	11
Spain	11
Japan	10
Taiwan	10
Iran	9
Norway	9
Portugal	9
More ▼

Laws, Policies, & Programs

No Child Left Behind Act 2001	11
Race to the Top	6
Elementary and Secondary…	4
Aid to Families with…	1
Elementary and Secondary…	1
Every Student Succeeds Act…	1
Family Educational Rights and…	1
Guaranteed Student Loan…	1
Head Start	1
Individuals with Disabilities…	1
Job Training Partnership Act…	1
Strengthening Career and…	1
More ▼

What Works Clearinghouse Rating

Does not meet standards

Showing 1,486 to 1,500 of 3,316 results Save | Export

Assessing First- and Second-Order Equity for the Common-Item Nonequivalent Groups Design Using Multidimensional IRT

Direct link

Andrews, Benjamin James – ProQuest LLC, 2011

The equity properties can be used to assess the quality of an equating. The degree to which expected scores conditional on ability are similar between test forms is referred to as first-order equity. Second-order equity is the degree to which conditional standard errors of measurement are similar between test forms after equating. The purpose of…

Descriptors: Test Format, Advanced Placement, Simulation, True Scores

The Public Understanding of Error in Educational Assessment

Peer reviewed

Direct link

Gardner, John – Oxford Review of Education, 2013

Evidence from recent research suggests that in the UK the public perception of errors in national examinations is that they are simply mistakes; events that are preventable. This perception predominates over the more sophisticated technical view that errors arise from many sources and create an inevitable variability in assessment outcomes. The…

Descriptors: Educational Assessment, Public Opinion, Error of Measurement, Foreign Countries

Single- versus Double-Scoring of Trend Responses in Trend Score Equating with Constructed-Response Tests. Research Report. ETS RR-10-12

Download full text

Tan, Xuan; Ricker, Kathryn L.; Puhan, Gautam – Educational Testing Service, 2010

This study examines the differences in equating outcomes between two trend score equating designs resulting from two different scoring strategies for trend scoring when operational constructed-response (CR) items are double-scored--the single group (SG) design, where each trend CR item is double-scored, and the nonequivalent groups with anchor…

Descriptors: Equated Scores, Scoring, Responses, Test Items

Generalizability Theory: Measuring the Dependability of Selected Methods for Scoring Classroom Assessments

Direct link

Lengh, Carolyn J. – ProQuest LLC, 2010

This study compares the dependability of four classroom assessment scoring methods. Generalizability theory (G) and alternative decision (D) are used to measure the results of students' classroom assessment scores and compare the results of the four scoring methods on variability of rater by person variance and the level of G and D coefficients…

Descriptors: Generalizability Theory, Scoring, Social Studies, Tests

Using State-Space Model with Regime Switching to Represent the Dynamics of Facial Electromyography (EMG) Data

Peer reviewed

Direct link

Yang, Manshu; Chow, Sy-Miin – Psychometrika, 2010

Facial electromyography (EMG) is a useful physiological measure for detecting subtle affective changes in real time. A time series of EMG data contains bursts of electrical activity that increase in magnitude when the pertinent facial muscles are activated. Whereas previous methods for detecting EMG activation are often based on deterministic or…

Descriptors: Test Bias, Error of Measurement, Human Body, Diagnostic Tests

Alternative Matching Scores to Control Type I Error of the Mantel-Haenszel Procedure for DIF in Dichotomously Scored Items Conforming to 3PL IRT and Nonparametric 4PBCB Models

Peer reviewed

Direct link

Monahan, Patrick O.; Ankenmann, Robert D. – Applied Psychological Measurement, 2010

When the matching score is either less than perfectly reliable or not a sufficient statistic for determining latent proficiency in data conforming to item response theory (IRT) models, Type I error (TIE) inflation may occur for the Mantel-Haenszel (MH) procedure or any differential item functioning (DIF) procedure that matches on summed-item…

Descriptors: Error of Measurement, Item Response Theory, Test Bias, Scores

Effects of Ventilation on Segmental Bioimpedance Spectroscopy Measures Using Generalizability Theory

Peer reviewed

Direct link

Turner, A. Allan; Lozano-Nieto, Albert; Bouffard, Marcel – Measurement in Physical Education and Exercise Science, 2010

The purpose of this study was to examine the effect of three ventilation conditions (i.e., normal, regimented, and no-ventilation) on the reproducibility of bioimpedance scores in humans for the forearm and trunk segments. One hundred able-bodied North American men and women, from 18 to 71 years of age, volunteered as participants. The…

Descriptors: Ventilation, Generalizability Theory, Spectroscopy, Scores

Reliability of Decision-Making Frameworks for Response to Intervention for Reading

Peer reviewed

Direct link

Burns, Matthew K.; Scholin, Sarah E.; Kosciolek, Stacey; Livingston, Judy – Journal of Psychoeducational Assessment, 2010

The current study examines the consistency of two response-to-intervention (RTI) decision-making models. Weekly progress monitoring data for 30 students participating in a Tier II intervention were collected for 30 weeks. The data were examined by comparing them to an aimline with a yearly goal and by computing a dual discrepancy (DD) using…

Descriptors: Reading Achievement, Reading Tests, Data Collection, Responses

Structural Equation Models of Latent Interactions: An Appropriate Standardized Solution and Its Scale-Free Properties

Peer reviewed

Direct link

Wen, Zhonglin; Marsh, Herbert W.; Hau, Kit-Tai – Structural Equation Modeling: A Multidisciplinary Journal, 2010

Standardized parameter estimates are routinely used to summarize the results of multiple regression models of manifest variables and structural equation models of latent variables, because they facilitate interpretation. Although the typical standardization of interaction terms is not appropriate for multiple regression models, straightforward…

Descriptors: Structural Equation Models, Multiple Regression Analysis, Interaction, Computation

The Effect of Image Quality Training on Reading Comprehension of EFL Students Using the Keyword Method

Peer reviewed

Direct link

Wang, Lihui; Lawson, Michael J.; Curtis, David D. – Language Teaching Research, 2015

Imagery training has been shown to improve reading comprehension. Recent research has also shown that the quality of visual mental imagery used is important for reading comprehension. A review of literature shows that there has been relatively little detailed research on the quality of imagery used by learners, especially in the case of students…

Descriptors: Educational Quality, Teaching Methods, English (Second Language), Second Language Learning

Improving Creativity Performance Assessment: A Rater Effect Examination with Many Facet Rasch Model

Peer reviewed

Direct link

Hung, Su-Pin; Chen, Po-Hsi; Chen, Hsueh-Chih – Creativity Research Journal, 2012

Product assessment is widely applied in creative studies, typically as an important dependent measure. Within this context, this study had 2 purposes. First, the focus of this research was on methods for investigating possible rater effects, an issue that has not received a great deal of attention in past creativity studies. Second, the…

Descriptors: Item Response Theory, Creativity, Interrater Reliability, Undergraduate Students

A Survey Data Quality Strategy: The Institutional Research Perspective. IR Applications, Volume 34

Download full text

Liu, Qin – Association for Institutional Research, 2012

This discussion constructs a survey data quality strategy for institutional researchers in higher education in light of total survey error theory. It starts with describing the characteristics of institutional research and identifying the gaps in literature regarding survey data quality issues in institutional research and then introduces the…

Descriptors: Institutional Research, Higher Education, Quality Control, Researchers

Efficient Estimation of the Standardized Value

Peer reviewed

Direct link

Longford, Nicholas T. – Journal of Educational and Behavioral Statistics, 2009

We derive an estimator of the standardized value which, under the standard assumptions of normality and homoscedasticity, is more efficient than the established (asymptotically efficient) estimator and discuss its gains for small samples. (Contains 1 table and 3 figures.)

Descriptors: Efficiency, Computation, Statistics, Sample Size

On the Use, the Misuse, and the Very Limited Usefulness of Cronbach's Alpha

Peer reviewed

Direct link

Sijtsma, Klaas – Psychometrika, 2009

This discussion paper argues that both the use of Cronbach's alpha as a reliability estimate and as a measure of internal consistency suffer from major problems. First, alpha always has a value, which cannot be equal to the test score's reliability given the inter-item covariance matrix and the usual assumptions about measurement error. Second, in…

Descriptors: Measurement, Error of Measurement, Scores, Computation

Psychometric Properties of Raw and Scale Scores on Mixed-Format Tests

Peer reviewed

Direct link

Kolen, Michael J.; Lee, Won-Chan – Educational Measurement: Issues and Practice, 2011

This paper illustrates that the psychometric properties of scores and scales that are used with mixed-format educational tests can impact the use and interpretation of the scores that are reported to examinees. Psychometric properties that include reliability and conditional standard errors of measurement are considered in this paper. The focus is…

Descriptors: Test Use, Test Format, Error of Measurement, Raw Scores

« Previous Page | Next Page »

Pages: 1 | ... | 96 | 97 | 98 | 99 | 100 | 101 | 102 | 103 | 104 | ... | 222

Educational and Psychological…	259
Journal of Educational…	117
ProQuest LLC	95
Applied Psychological…	85
Journal of Educational and…	85
Psychometrika	82
Structural Equation Modeling:…	76
Grantee Submission	71
Journal of Experimental…	70
ETS Research Report Series	59
Multivariate Behavioral…	54
Applied Measurement in…	50
Sociological Methods &…	47
Journal of Psychoeducational…	38
Psychological Methods	33
Society for Research on…	33
Educational Measurement:…	32
Research Synthesis Methods	32
Online Submission	29
Practical Assessment,…	27
International Journal of…	26
Journal of Educational…	26
National Center for Education…	25
Psychology in the Schools	25
International Journal of…	23
More ▼

Journal Articles	2363
Reports - Research	1909
Reports - Evaluative	704
Reports - Descriptive	344
Speeches/Meeting Papers	329
Dissertations/Theses -…	95
Numerical/Quantitative Data	86
Opinion Papers	77
Information Analyses	72
Tests/Questionnaires	47
Guides - Non-Classroom	27
Guides - Classroom - Teacher	12
Book/Product Reviews	10
Reports - General	9
ERIC Publications	8
ERIC Digests in Full Text	7
Guides - General	7
Books	6
Guides - Classroom - Learner	4
Collected Works - General	3
Legal/Legislative/Regulatory…	3
Historical Materials	2
Collected Works - Proceedings	1
Collected Works - Serial	1
Collected Works - Serials	1
More ▼

Program for International…	45
National Assessment of…	40
SAT (College Admission Test)	24
Trends in International…	24
ACT Assessment	20
Wechsler Intelligence Scale…	20
Early Childhood Longitudinal…	19
Wechsler Adult Intelligence…	12
Iowa Tests of Basic Skills	10
Schools and Staffing Survey…	10
Test of English as a Foreign…	9
Child Behavior Checklist	7
Graduate Record Examinations	7
National Longitudinal Survey…	7
Progress in International…	7
Beck Depression Inventory	6
Advanced Placement…	5
Armed Services Vocational…	5
Cognitive Abilities Test	5
Longitudinal Surveys of…	5
National Household Education…	5
Rosenberg Self Esteem Scale	5
Dynamic Indicators of Basic…	4
Law School Admission Test	4
Motivated Strategies for…	4
More ▼