Publication Date
| In 2026 | 0 |
| Since 2025 | 0 |
| Since 2022 (last 5 years) | 0 |
| Since 2017 (last 10 years) | 0 |
| Since 2007 (last 20 years) | 18 |
Descriptor
| Data Analysis | 10 |
| Data Collection | 7 |
| Equated Scores | 6 |
| Item Response Theory | 6 |
| Statistical Analysis | 6 |
| Scores | 5 |
| Comparative Analysis | 4 |
| Data | 4 |
| Computation | 3 |
| English (Second Language) | 3 |
| Methods | 3 |
| More ▼ | |
Source
| Educational Testing Service | 18 |
Author
| Moses, Tim | 3 |
| Nguyen, Bach Mai Dolly | 2 |
| Sinharay, Sandip | 2 |
| Xu, Xueli | 2 |
| von Davier, Matthias | 2 |
| Barton, Paul E. | 1 |
| Carstensen, Claus H. | 1 |
| Deng, Weiling | 1 |
| Dorans, Neil | 1 |
| Dorans, Neil J. | 1 |
| Eignor, Daniel R. | 1 |
| More ▼ | |
Publication Type
| Reports - Research | 7 |
| Numerical/Quantitative Data | 5 |
| Reports - Descriptive | 5 |
| Reports - Evaluative | 4 |
| Guides - Classroom - Learner | 1 |
| Information Analyses | 1 |
| Tests/Questionnaires | 1 |
Education Level
| Elementary Secondary Education | 4 |
| Higher Education | 3 |
| Postsecondary Education | 2 |
| Secondary Education | 2 |
| Grade 10 | 1 |
| Grade 9 | 1 |
| High Schools | 1 |
Audience
| Policymakers | 1 |
Location
| California | 1 |
| Germany | 1 |
| Guam | 1 |
| Hawaii | 1 |
| Washington | 1 |
Laws, Policies, & Programs
Assessments and Surveys
| Program for International… | 2 |
| Test of English as a Foreign… | 1 |
What Works Clearinghouse Rating
Nguyen, Bach Mai Dolly; Nguyen, Mike Hoa; Teranishi, Robert T.; Hune, Shirley – Educational Testing Service, 2015
Utilizing disaggregated data from the Office of the Superintendent of Public Instruction (OSPI) and the Educational Research Data Center (ERDC), this report offers a deeper and more nuanced perspective on the educational realities of Asian Americans and Pacific Islander (AAPI) students and reinforces the need for disaggregated data to unmask the…
Descriptors: Asian American Students, Pacific Islanders, Student Needs, Educational Opportunities
Teranishi, Robert; Lok, Libby; Nguyen, Bach Mai Dolly – Educational Testing Service, 2013
In 2013, the National Commission on Asian American and Pacific Islander Research in Education (CARE) and the White House Initiative on Asian Americans and Pacific Islanders (WHIAAPI)--with support from ETS and Asian Americans and Pacific Islanders in Philanthropy (AAPIP)--began an Asian American and Pacific Islander (AAPI) data quality campaign.…
Descriptors: College Students, Asian American Students, Pacific Islanders, Consciousness Raising
Dorans, Neil J.; Moses, Tim P.; Eignor, Daniel R. – Educational Testing Service, 2010
Score equating is essential for any testing program that continually produces new editions of a test and for which the expectation is that scores from these editions have the same meaning over time. Particularly in testing programs that help make high-stakes decisions, it is extremely important that test equating be done carefully and accurately.…
Descriptors: Equated Scores, Methods, Data Collection, Data Processing
Livingston, Samuel A. – Educational Testing Service, 2014
This booklet grew out of a half-day class on equating that author Samuel Livingston teaches for new statistical staff at Educational Testing Service (ETS). The class is a nonmathematical introduction to the topic, emphasizing conceptual understanding and practical applications. The class consists of illustrated lectures, interspersed with…
Descriptors: Equated Scores, Scoring, Self Evaluation (Individuals), Scores
Moses, Tim; Liu, Jinghua – Educational Testing Service, 2011
In equating research and practice, equating functions that are smooth are typically assumed to be more accurate than equating functions with irregularities. This assumption presumes that population test score distributions are relatively smooth. In this study, two examples were used to reconsider common beliefs about smoothing and equating. The…
Descriptors: Equated Scores, Data Analysis, Scores, Methods
Moses, Tim; Miao, Jing; Dorans, Neil – Educational Testing Service, 2010
This study compared the accuracies of four differential item functioning (DIF) estimation methods, where each method makes use of only one of the following: raw data, logistic regression, loglinear models, or kernel smoothing. The major focus was on the estimation strategies' potential for estimating score-level, conditional DIF. A secondary focus…
Descriptors: Test Bias, Statistical Analysis, Computation, Scores
Puhan, Gautam – Educational Testing Service, 2011
The study evaluated the effectiveness of log-linear presmoothing (Holland & Thayer, 1987) on the accuracy of small sample chained equipercentile equatings under two conditions (i.e., using small samples that differed randomly in ability from the target population "versus" using small samples that were distinctly different from the…
Descriptors: Equated Scores, Data Analysis, Accuracy, Sample Size
Moses, Tim; Deng, Weiling; Zhang, Yu-Li – Educational Testing Service, 2010
In the equating literature, a recurring concern is that equating functions that utilize a single anchor to account for examinee groups' nonequivalence are biased when the groups are extremely different and/or when the anchor only weakly measures what the tests measure. Several proposals have been made to address this equating bias by incorporating…
Descriptors: Equated Scores, Data Collection, Statistical Analysis, Differences
Sinharay, Sandip – Educational Testing Service, 2010
Recently, there has been an increasing level of interest in subscores for their potential diagnostic value. Haberman (2008) suggested a method based on classical test theory to determine whether subscores have added value over total scores. This paper provides a literature review and reports when subscores were found to have added value for…
Descriptors: Scores, Correlation, Reliability, Item Response Theory
Underwood, Jody S.; Zapata-Rivera, Diego; VanWinkle, Waverely – Educational Testing Service, 2010
District-level policymakers are challenged to use evidence of student achievement to make policy decisions, such as professional development and other school improvement plans. They currently receive reports of student achievement data that are complex, difficult to read, and even harder to interpret. Using the research literature on policymakers'…
Descriptors: Evidence, Data, Educational Assessment, Academic Achievement
Rose, Norman; von Davier, Matthias; Xu, Xueli – Educational Testing Service, 2010
Large-scale educational surveys are low-stakes assessments of educational outcomes conducted using nationally representative samples. In these surveys, students do not receive individual scores, and the outcome of the assessment is inconsequential for respondents. The low-stakes nature of these surveys, as well as variations in average performance…
Descriptors: Item Response Theory, Educational Assessment, Data Analysis, Case Studies
Rijmen, Frank – Educational Testing Service, 2010
As is the case for any statistical model, a multidimensional latent growth model comes with certain requirements with respect to the data collection design. In order to measure growth, repeated measurements of the same set of individuals are required. Furthermore, the data collection design should be specified such that no individual is given the…
Descriptors: Tests, Statistical Analysis, Models, Measurement
Barton, Paul E. – Educational Testing Service, 2009
This Policy Information Perspective provides a status report on efforts to measure the high school graduation rate more accurately and use it more constructively. The first section of the report discusses obtaining reliable national survey data on the graduation status of young adults and of the population as a whole. The second section makes the…
Descriptors: Graduation Rate, Young Adults, Accountability, Longitudinal Studies
von Davier, Matthias; Xu, Xueli; Carstensen, Claus H. – Educational Testing Service, 2009
A general diagnostic model was used to specify and compare two multidimensional item-response-theory (MIRT) models for longitudinal data: (a) a model that handles repeated measurements as multiple, correlated variables over time (Andersen, 1985) and (b) a model that assumes one common variable over time and additional orthogonal variables that…
Descriptors: Models, Item Response Theory, Longitudinal Studies, Measurement
Educational Testing Service, 2008
The Test of English as a Foreign Language[TM], better known as TOEFL[R], is designed to measure the English-language proficiency of people whose native language is not English. TOEFL scores are accepted by more than 6,000 colleges, universities, and licensing agencies in 130 countries. The test is also used by governments, and scholarship and…
Descriptors: English (Second Language), Language Proficiency, Language Tests, Computer Assisted Testing
Previous Page | Next Page ยป
Pages: 1 | 2

