ERIC - Search Results

Publication Date

In 2026	0
Since 2025	0
Since 2022 (last 5 years)	0
Since 2017 (last 10 years)	2
Since 2007 (last 20 years)	7

Descriptor

Probability	15
Statistical Analysis	15
Computation	5
Regression (Statistics)	5
Models	4
Test Items	4
Grade 8	3
Item Response Theory	3
National Competency Tests	3
Sampling	3
Scores	3
College Entrance Examinations	2
Comparative Analysis	2
Computer Assisted Testing	2
Correlation	2
Educational Assessment	2
Equated Scores	2
Error of Measurement	2
Essays	2
Gender Differences	2
Graduate Study	2
Inferences	2
Maximum Likelihood Statistics	2
Monte Carlo Methods	2
Reading Tests	2
More ▼

Source

ETS Research Report Series

Publication Type

Journal Articles	15
Reports - Research	15
Numerical/Quantitative Data	1
Tests/Questionnaires	1

Education Level

Higher Education	4
Postsecondary Education	4
Elementary Education	3
Grade 8	3
Junior High Schools	3
Middle Schools	3
Secondary Education	3
Grade 10	1
High Schools	1

Audience

Location

Laws, Policies, & Programs

Assessments and Surveys

National Assessment of…	3
Graduate Record Examinations	2
Praxis Series	1
Test of English as a Foreign…	1

What Works Clearinghouse Rating

Showing all 15 results Save | Export

Variance Estimation with Complex Data and Finite Population Correction--A Paradigm for Comparing Jackknife and Formula-Based Methods for Variance Estimation. Research Report. ETS RR-20-11

Peer reviewed
PDF on ERIC

Download full text

Qian, Jiahe – ETS Research Report Series, 2020

The finite population correction (FPC) factor is often used to adjust variance estimators for survey data sampled from a finite population without replacement. As a replicated resampling approach, the jackknife approach is usually implemented without the FPC factor incorporated in its variance estimates. A paradigm is proposed to compare the…

Descriptors: Computation, Sampling, Data, Statistical Analysis

A Statistical Procedure for Testing Unusually Frequent Exactly Matching Responses and Nearly Matching Responses. Research Report. ETS RR-17-23

Peer reviewed
PDF on ERIC

Download full text

Haberman, Shelby J.; Lee, Yi-Hsuan – ETS Research Report Series, 2017

In investigations of unusual testing behavior, a common question is whether a specific pattern of responses occurs unusually often within a group of examinees. In many current tests, modern communication techniques can permit quite large numbers of examinees to share keys, or common response patterns, to the entire test. To address this issue,…

Descriptors: Student Evaluation, Testing, Item Response Theory, Maximum Likelihood Statistics

A General Program for Item-Response Analysis That Employs the Stabilized Newton-Raphson Algorithm. Research Report. ETS RR-13-32

Peer reviewed
PDF on ERIC

Download full text

Haberman, Shelby J. – ETS Research Report Series, 2013

A general program for item-response analysis is described that uses the stabilized Newton-Raphson algorithm. This program is written to be compliant with Fortran 2003 standards and is sufficiently general to handle independent variables, multidimensional ability parameters, and matrix sampling. The ability variables may be either polytomous or…

Descriptors: Predictor Variables, Mathematics, Item Response Theory, Probability

A Study of Frequency Estimation Equipercentile Equating When There Are Large Ability Differences. Research Report. ETS RR-09-45

Peer reviewed
PDF on ERIC

Download full text

Guo, Hongwen; Oh, Hyeonjoo J. – ETS Research Report Series, 2009

In operational equating, frequency estimation (FE) equipercentile equating is often excluded from consideration when the old and new groups have a large ability difference. This convention may, in some instances, cause the exclusion of one competitive equating method from the set of methods under consideration. In this report, we study the…

Descriptors: Equated Scores, Computation, Statistical Analysis, Test Items

Outliers in Assessments. Research Report. ETS RR-08-41

Peer reviewed
PDF on ERIC

Download full text

Haberman, Shelby J. – ETS Research Report Series, 2008

Outliers in assessments are often treated as a nuisance for data analysis; however, they can also assist in quality assurance. Their frequency can suggest problems with form codes, scanning accuracy, ability of examinees to enter responses as they intend, or exposure of items.

Descriptors: Educational Assessment, Quality Assurance, Scores, Regression (Statistics)

Model-Based Weighting and Comparisons: Research Report. ETS RR-08-17

Peer reviewed
PDF on ERIC

Download full text

Qian, Jiahe – ETS Research Report Series, 2008

In survey research, sometimes the formation of groupings, or aggregations of cases on which to make an inference, are of importance. Of particular interest are the situations where the cases aggregated carry useful information that has been transferred from a sample employed in a previous study. For example, a school to be included in the sample…

Descriptors: Surveys, Models, High Schools, School Effectiveness

Fitting the Structured General Diagnostic Model to NAEP Data. Research Report. ETS RR-08-27

Peer reviewed
PDF on ERIC

Download full text

Xu, Xueli; von Davier, Matthias – ETS Research Report Series, 2008

Xu and von Davier (2006) demonstrated the feasibility of using the general diagnostic model (GDM) to analyze National Assessment of Educational Progress (NAEP) proficiency data. Their work showed that the GDM analysis not only led to conclusions for gender and race groups similar to those published in the NAEP Report Card, but also allowed…

Descriptors: National Competency Tests, Models, Data Analysis, Reading Tests

Locating the Structural Zeros for Internal Anchor Tests: Including the Case of Rounded Formula Scores. Research Report. ETS RR-05-22

Peer reviewed
PDF on ERIC

Download full text

Holland, Paul W. – ETS Research Report Series, 2005

There are test-equating situations in which it may be appropriate to fit a loglinear or other type of probability model to the joint distribution of a total score on a test and a score on part of that test. For anchor test designs, this situation arises for internal anchor tests, which are embedded within the total test. Similarly, a part-whole…

Descriptors: Test Items, Equated Scores, Probability, Statistical Analysis

Confidence Intervals for Proportion Estimates in Complex Samples. Research Report. ETS RR-06-21

Peer reviewed
PDF on ERIC

Download full text

Oranje, Andreas – ETS Research Report Series, 2006

Confidence intervals are an important tool to indicate uncertainty of estimates and to give an idea of probable values of an estimate if a different sample from the population was drawn or a different sample of measures was used. Standard symmetric confidence intervals for proportion estimates based on a normal approximation can yield bounds…

Descriptors: Computation, Statistical Analysis, National Competency Tests, Comparative Analysis

Bayesian Network Models for Local Dependence among Observable Outcome Variables. Research Report. ETS RR-06-36

Peer reviewed
PDF on ERIC

Download full text

Almond, Russell G.; Mulder, Joris; Hemat, Lisa A.; Yan, Duanli – ETS Research Report Series, 2006

Bayesian network models offer a large degree of flexibility for modeling dependence among observables (item outcome variables) from the same task that may be dependent. This paper explores four design patterns for modeling locally dependent observations from the same task: (1) No context--Ignore dependence among observables; (2) Compensatory…

Descriptors: Bayesian Statistics, Networks, Models, Design

Effects of Preexamination Disclosure of Essay Prompts for the GRE® Analytical Writing Assessment. ETS GRE® Board Research Report No. 01-07R. ETS RR-05-01

Peer reviewed
PDF on ERIC

Download full text

Powers, Donald E. – ETS Research Report Series, 2005

This study examined how the practice of prepublishing prompts used on the writing section of the Graduate Record Examinations® (GRE®) General Test impacts test-preparation behavior, test performance, test validity, and examinee perceptions of the value of prompt prepublication. Researchers imposed modest experimental control over how participants…

Descriptors: Essays, Prompting, Cues, Writing Tests

Weighting Procedures and the Cluster Forming Algorithm for Delete-k Jackknife Variance Estimation for Institutional Surveys. Research Report. ETS RR-06-15

Peer reviewed
PDF on ERIC

Download full text

Qian, Jiahe – ETS Research Report Series, 2006

Weighting and variance estimation are two statistical issues involved in survey data analysis for large-scale assessment programs such as the Higher Education Information and Communication Technology (ICT) Literacy Assessment. Because survey data are always acquired by probability sampling, to draw unbiased or almost unbiased inferences for the…

Descriptors: Weighted Scores, Sampling, Statistical Analysis, Higher Education

Investigating Differences in Examinee Performance between Computer-Based and Handwritten Essays. Research Report. ETS RR-04-18

Peer reviewed
PDF on ERIC

Download full text

Yu, Lei; Livingston, Samuel A.; Larkin, Kevin C.; Bonett, John – ETS Research Report Series, 2004

This study compared essay scores from paper-based and computer-based versions of a writing test for prospective teachers. Scores for essays in the paper-based version averaged nearly half a standard deviation higher than those in the computer-based version, after applying a statistical control for demographic differences between the groups of…

Descriptors: Essays, Writing (Composition), Computer Assisted Testing, Technology Uses in Education

Inside Sourcefinder: Predicting the Acceptability Status of Candidate Reading-Comprehension Source Documents. Research Report. ETS RR-06-24

Peer reviewed
PDF on ERIC

Download full text

Sheehan, Kathleen M.; Kostin, Irene; Futagi, Yoko; Hemat, Ramin; Zuckerman, Daniel – ETS Research Report Series, 2006

This paper describes the development, implementation, and evaluation of an automated system for predicting the acceptability status of candidate reading-comprehension stimuli extracted from a database of journal and magazine articles. The system uses a combination of classification and regression techniques to predict the probability that a given…

Descriptors: Automation, Prediction, Reading Comprehension, Classification

An Investigation of the Impact of Composition Medium on the Quality of TOEFL Writing Scores. TOEFL® Research Report. RR-72. ETS RR-04-29

Peer reviewed
PDF on ERIC

Download full text

Wolfe, Edward W.; Manalo, Jonathan R. – ETS Research Report Series, 2005

This study examined scores from 133,906 operationally scored Test of English as a Foreign Language™ (TOEFL®) essays to determine whether the choice of composition medium has any impact on score quality for subgroups of test-takers. Results of analyses demonstrate that (a) scores assigned to word-processed essays are slightly more reliable than…

Descriptors: English (Second Language), Language Tests, Second Language Learning, Scores

Haberman, Shelby J.	3
Qian, Jiahe	3
Almond, Russell G.	1
Bonett, John	1
Futagi, Yoko	1
Guo, Hongwen	1
Hemat, Lisa A.	1
Hemat, Ramin	1
Holland, Paul W.	1
Kostin, Irene	1
Larkin, Kevin C.	1
Lee, Yi-Hsuan	1
Livingston, Samuel A.	1
Manalo, Jonathan R.	1
Mulder, Joris	1
Oh, Hyeonjoo J.	1
Oranje, Andreas	1
Powers, Donald E.	1
Sheehan, Kathleen M.	1
Wolfe, Edward W.	1
Xu, Xueli	1
Yan, Duanli	1
Yu, Lei	1
Zuckerman, Daniel	1
von Davier, Matthias	1
More ▼