Publication Date
| In 2026 | 0 |
| Since 2025 | 59 |
| Since 2022 (last 5 years) | 416 |
| Since 2017 (last 10 years) | 919 |
| Since 2007 (last 20 years) | 1970 |
Descriptor
Source
Author
Publication Type
Education Level
Audience
| Researchers | 93 |
| Practitioners | 23 |
| Teachers | 22 |
| Policymakers | 10 |
| Administrators | 5 |
| Students | 4 |
| Counselors | 2 |
| Parents | 2 |
| Community | 1 |
Location
| United States | 47 |
| Germany | 42 |
| Australia | 34 |
| Canada | 27 |
| Turkey | 27 |
| California | 22 |
| United Kingdom (England) | 20 |
| Netherlands | 18 |
| China | 17 |
| New York | 15 |
| United Kingdom | 15 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Does not meet standards | 1 |
Peer reviewedPress, S. James; Tanur, Judith M. – Evaluation Review, 1991
Relevance of the intersection of sociology, statistics, and public policy to the study of quality control in three family assistance programs--food stamps, Aid to Families with Dependent Children (AFDC), and Medicaid--is reviewed using a study by the National Academy of Sciences of methods for improving quality control systems. (SLD)
Descriptors: Error of Measurement, Estimation (Mathematics), Federal Aid, Federal Programs
Abedi, Jamal – Teachers College Record, 2006
Assessments in English that are constructed for native English speakers may not provide valid inferences about the achievement of English language learners (ELLs). The linguistic complexity of the test items that are not related to the content of the assessment may increase the measurement error, thus reducing the reliability of the assessment.…
Descriptors: Second Language Learning, Test Items, Psychometrics, Inferences
Lilley, M.; Barker, T.; Britton, C. – Computers and Education, 2004
This paper presents ongoing research at the University of Hertfordshire on the use of computer-adaptive tests (CATs) in Higher Education. A software prototype based on Item Response Theory has been developed and is described here. This application was designed to estimate the level of proficiency in English for those students whose first language…
Descriptors: Foreign Countries, Adaptive Testing, Computer Assisted Testing, Computer Software Evaluation
Kane, Thomas J.; Staiger, Douglas O. – Brookings Papers on Education Policy, 2002
By the spring of 2000, forty states had begun using student test scores to rate school performance. Twenty states have gone a step further and are attaching explicit monetary rewards or sanctions to a school's test performance. In this paper, the authors focus on accountability programs in which states measure the effectiveness of individual…
Descriptors: Elementary Schools, Accountability, Scores, Risk
Solano-Flores, Guillermo; Li, Min – Educational Measurement: Issues and Practice, 2006
We contend that generalizability (G) theory allows the design of psychometric approaches to testing English-language learners (ELLs) that are consistent with current thinking in linguistics. We used G theory to estimate the amount of measurement error due to code (language or dialect). Fourth- and fifth-grade ELLs, native speakers of…
Descriptors: Foreign Countries, Grade 4, Grade 5, English (Second Language)
Gorsuch, Greta – CALICO Journal, 2004
In this study, retrospective interviews were used to investigate reliability (and thus validity) threats to a computerized ESL listening comprehension test administered at a university in the US. The participants in the investigation, six international graduate students, were asked to respond to semi- and open-ended questions during individual…
Descriptors: Graduate Students, Listening Comprehension, Investigations, Listening Comprehension Tests
Lee, Sik-Yum; Xia, Ye-Mao – Psychometrika, 2006
By means of more than a dozen user friendly packages, structural equation models (SEMs) are widely used in behavioral, education, social, and psychological research. As the underlying theory and methods in these packages are vulnerable to outliers and distributions with longer-than-normal tails, a fundamental problem in the field is the…
Descriptors: Maximum Likelihood Statistics, Statistical Distributions, Structural Equation Models, Robustness (Statistics)
New Mexico Public Education Department, 2007
The purpose of the NMSBA technical report is to provide users and other interested parties with a general overview of and technical characteristics of the 2007 NMSBA. The 2007 technical report contains the following information: (1) Test development; (2) Scoring procedures; (3) Summary of student performance; (4) Statistical analyses of item and…
Descriptors: Interrater Reliability, Standard Setting, Measures (Individuals), Scoring
Meyer, Kevin D.; Foster, Jeff L. – International Journal of Testing, 2008
With the increasing globalization of human resources practices, a commensurate increase in demand has occurred for multi-language ("global") personality norms for use in selection and development efforts. The combination of data from multiple translations of a personality assessment into a single norm engenders error from multiple sources. This…
Descriptors: Global Approach, Cultural Differences, Norms, Human Resources
Takalkar, Pradnya; And Others – 1993
This study compared 4,594 student responses from three different surveys of incoming students at the University of South Florida (USF) with data from Florida's State University System (SUS) admissions files to determine what proportion of error occurs in the survey responses. Specifically, the study investigated the amount of measurement error in…
Descriptors: College Admission, College Applicants, College Bound Students, Comparative Analysis
Spencer, Bruce D. – 1986
The National Assessment of Educational Progress (NAEP) currently tests seventeen-year-old students enrolled in public and private secondary schools, but it does not test "out-of-school" seventeen-year-olds who have either graduated or dropped out. Estimating that one of five seventeen-year-olds is out of school, the interpretability of…
Descriptors: Adolescents, Cohort Analysis, Dropouts, Educational Assessment
Angoff, William H.; Cowell, William R. – 1985
Linear and equipercentile equating conversions were developed for two forms of the Graduate Record Examinations (GRE) quantitative test and the verbal-plus-quantitative test. From a very large sample of students taking the GRE in October 1981, subpopulations were selected with respect to race, sex, field of study, and level of performance (defined…
Descriptors: Aptitude Tests, College Entrance Examinations, Equated Scores, Error of Measurement
Rodgers, Willard L.; Bachman, Jerald G. – 1986
This paper explores various procedures of panel data in the estimation of causal models. The reported analyses are from the Monitoring the Future study, a nationwide questionnaire survey of 16,000 to 17,000 high school seniors conducted annually since 1975. First, the parameters of causal models are estimated in which the dependent variables are…
Descriptors: Attitude Measures, Attribution Theory, Comparative Analysis, Drug Use
Cook, Linda L.; Petersen, Nancy S. – 1986
This paper examines how various equating methods are affected by: (1) sampling error; (2) sample characteristics; and (3) characteristics of anchor test items. It reviews empirical studies that investigated the invariance of equating transformations, and it discusses empirical and simulation studies that focus on how the properties of anchor tests…
Descriptors: Educational Research, Equated Scores, Error of Measurement, Evaluation Methods
Hummel, Thomas J.; Johnston, Charles B. – 1986
This study investigated seven methods for analyzing multivariate group differences. Bonferroni t statistics, multivariate analysis of variance (MANOVA) followed by analysis of variance (ANOVA), and five other methods were studied using Monte Carlo methods. Methods were compared with respect to (1) experimentwise error rate; (2) power; (3) number…
Descriptors: Analysis of Variance, Comparative Analysis, Correlation, Differences

Direct link
