Publication Date
| In 2026 | 0 |
| Since 2025 | 0 |
| Since 2022 (last 5 years) | 0 |
| Since 2017 (last 10 years) | 2 |
| Since 2007 (last 20 years) | 26 |
Descriptor
| Comparative Analysis | 38 |
| Statistical Analysis | 38 |
| Test Items | 16 |
| Scores | 14 |
| Equated Scores | 13 |
| Item Response Theory | 11 |
| Correlation | 8 |
| Simulation | 7 |
| Test Format | 7 |
| Computer Assisted Testing | 6 |
| English (Second Language) | 6 |
| More ▼ | |
Source
| ETS Research Report Series | 38 |
Author
| von Davier, Alina A. | 9 |
| Kim, Sooyeon | 4 |
| Puhan, Gautam | 4 |
| Sinharay, Sandip | 4 |
| Holland, Paul W. | 3 |
| Livingston, Samuel A. | 3 |
| Moses, Tim | 3 |
| Brownstein, Beth | 2 |
| Casabianca, Jodi | 2 |
| Frankel, Lois | 2 |
| Guo, Hongwen | 2 |
| More ▼ | |
Publication Type
| Journal Articles | 38 |
| Reports - Research | 38 |
| Tests/Questionnaires | 4 |
| Speeches/Meeting Papers | 3 |
| Information Analyses | 1 |
Education Level
| Secondary Education | 9 |
| Higher Education | 6 |
| Postsecondary Education | 6 |
| Elementary Education | 4 |
| High Schools | 4 |
| Junior High Schools | 3 |
| Middle Schools | 3 |
| Grade 12 | 1 |
| Grade 8 | 1 |
Audience
Laws, Policies, & Programs
Assessments and Surveys
| Test of English as a Foreign… | 4 |
| National Assessment of… | 2 |
| Praxis Series | 2 |
| SAT (College Admission Test) | 2 |
| Major Field Achievement Test… | 1 |
| Test of English for… | 1 |
What Works Clearinghouse Rating
Kim, Sooyeon; Robin, Frederic – ETS Research Report Series, 2017
In this study, we examined the potential impact of item misfit on the reported scores of an admission test from the subpopulation invariance perspective. The target population of the test consisted of 3 major subgroups with different geographic regions. We used the logistic regression function to estimate item parameters of the operational items…
Descriptors: Scores, Test Items, Test Bias, International Assessment
Guo, Hongwen; Zu, Jiyun; Kyllonen, Patrick; Schmitt, Neal – ETS Research Report Series, 2016
In this report, systematic applications of statistical and psychometric methods are used to develop and evaluate scoring rules in terms of test reliability. Data collected from a situational judgment test are used to facilitate the comparison. For a well-developed item with appropriate keys (i.e., the correct answers), agreement among various…
Descriptors: Scoring, Test Reliability, Statistical Analysis, Psychometrics
Frankel, Lois; Brownstein, Beth; Soiffer, Neil; Hansen, Eric – ETS Research Report Series, 2016
The work described in this report is the first phase of a project to provide easy-to-use tools for authoring and rendering secondary-school algebra-level math expressions in synthesized speech that is useful for students with blindness or low vision. This report describes the initial development, software implementation, and evaluation of the…
Descriptors: Algebra, Automation, Secondary School Mathematics, Artificial Speech
Frankel, Lois; Brownstein, Beth; Soiffer, Neil – ETS Research Report Series, 2017
This report describes the pilot conducted in the final phase of a project, Expanding Audio Access to Mathematics Expressions by Students With Visual Impairments via MathML, to provide easy-to-use tools for authoring and rendering secondary-school algebra-level math expressions in synthesized speech that is useful for students with blindness or low…
Descriptors: Visual Impairments, Research Reports, Audiovisual Aids, Assistive Technology
Qian, Jiahe; Jiang, Yanming; von Davier, Alina A. – ETS Research Report Series, 2013
Several factors could cause variability in item response theory (IRT) linking and equating procedures, such as the variability across examinee samples and/or test items, seasonality, regional differences, native language diversity, gender, and other demographic variables. Hence, the following question arises: Is it possible to select optimal…
Descriptors: Item Response Theory, Test Items, Sampling, True Scores
Chen, Jing; Sheehan, Kathleen M. – ETS Research Report Series, 2015
The "TOEFL"® family of assessments includes the "TOEFL"® Primary"™, "TOEFL Junior"®, and "TOEFL iBT"® tests. The linguistic complexity of stimulus passages in the reading sections of the TOEFL family of assessments is expected to differ across the test levels. This study evaluates the linguistic…
Descriptors: Language Tests, Second Language Learning, English (Second Language), Reading Comprehension
Ali, Usama S.; Chang, Hua-Hua – ETS Research Report Series, 2014
Adaptive testing is advantageous in that it provides more efficient ability estimates with fewer items than linear testing does. Item-driven adaptive pretesting may also offer similar advantages, and verification of such a hypothesis about item calibration was the main objective of this study. A suitability index (SI) was introduced to adaptively…
Descriptors: Adaptive Testing, Simulation, Pretests Posttests, Test Items
Wang, Lin; Qian, Jiahe; Lee, Yi-Hsuan – ETS Research Report Series, 2013
The purpose of this study was to evaluate the combined effects of reduced equating sample size and shortened anchor test length on item response theory (IRT)-based linking and equating results. Data from two independent operational forms of a large-scale testing program were used to establish the baseline results for evaluating the results from…
Descriptors: Test Construction, Item Response Theory, Testing Programs, Simulation
Barkaoui, Khaled – ETS Research Report Series, 2015
This study aimed to describe the writing activities that test takers engage in when responding to the writing tasks in the "TOEFL iBT"[superscript R] test and to examine the effects of task type and test-taker English language proficiency (ELP) and keyboarding skills on the frequency and distribution of these activities. Each of 22 test…
Descriptors: Second Language Learning, Language Tests, English (Second Language), Writing Instruction
Ling, Guangming – ETS Research Report Series, 2012
To assess the value of individual students' subscores on the Major Field Test in Business (MFT Business), I examined the test's internal structure with factor analysis and structural equation model methods, and analyzed the subscore reliabilities using the augmented scores method. Analyses of the internal structure suggested that the MFT Business…
Descriptors: Factor Analysis, Construct Validity, Structural Equation Models, Correlation
Burrus, Jeremy; Jackson, Teresa; Holtzman, Steven; Roberts, Richard D.; Mandigo, Terri – ETS Research Report Series, 2013
The current paper reports the results of 2 quasiexperimental studies conducted to examine the efficacy of a new time management intervention designed for high school students. In both studies, there was no difference between the treatment and control groups in improvement in self-reported time management skills as a result of the intervention.…
Descriptors: Time Management, Intervention, High School Students, Quasiexperimental Design
Paek, Insu – ETS Research Report Series, 2009
Three statistical testing procedures well-known in the maximum likelihood approach are the Wald, likelihood ratio (LR), and score tests. Although well-known, the application of these three testing procedures in the logistic regression method to investigate differential item function (DIF) has not been rigorously made yet. Employing a variety of…
Descriptors: Test Bias, Statistical Analysis, Regression (Statistics), Maximum Likelihood Statistics
Lee, Yi-Hsuan; von Davier, Alina A. – ETS Research Report Series, 2008
The kernel equating method (von Davier, Holland, & Thayer, 2004) is based on a flexible family of equipercentile-like equating functions that use a Gaussian kernel to continuize the discrete score distributions. While the classical equipercentile, or percentile-rank, equating method carries out the continuization step by linear interpolation,…
Descriptors: Equated Scores, Comparative Analysis, Methods, Accuracy
Young, John W.; Morgan, Rick; Rybinski, Paul; Steinberg, Jonathan; Wang, Yuan – ETS Research Report Series, 2013
The "TOEFL Junior"® Standard Test is an assessment that measures the degree to which middle school-aged students learning English as a second language have attained proficiency in the academic and social English skills representative of English-medium instructional environments. The assessment measures skills in three areas: listening…
Descriptors: Item Response Theory, Test Items, Language Tests, Second Language Learning
Puhan, Gautam; Sinharay, Sandip; Haberman, Shelby; Larkin, Kevin – ETS Research Report Series, 2008
Will reporting subscores provide any additional information than the total score? Is there a method that can be used to provide more trustworthy subscores than observed subscores? These 2 questions are addressed in this study. To answer the 2nd question, 2 subscore estimation methods (i.e., subscore estimated from the observed total score or…
Descriptors: Comparative Analysis, Scores, Tests, Certification

Peer reviewed
