NotesFAQContact Us
Collection
Advanced
Search Tips
Publication Date
In 20260
Since 20250
Since 2022 (last 5 years)0
Since 2017 (last 10 years)0
Since 2007 (last 20 years)4
Audience
Researchers2
Laws, Policies, & Programs
What Works Clearinghouse Rating
Showing 1 to 15 of 17 results Save | Export
Peer reviewed Peer reviewed
Direct linkDirect link
Guo, Hongwen; Sinharay, Sandip – Journal of Educational and Behavioral Statistics, 2011
Nonparametric or kernel regression estimation of item response curves (IRCs) is often used in item analysis in testing programs. These estimates are biased when the observed scores are used as the regressor because the observed scores are contaminated by measurement error. Accuracy of this estimation is a concern theoretically and operationally.…
Descriptors: Testing Programs, Measurement, Item Analysis, Error of Measurement
New York State Education Department, 2015
This technical report provides an overview of the New York State Alternate Assessment (NYSAA), including a description of the purpose of the NYSAA, the processes utilized to develop and implement the NYSAA program, and Stakeholder involvement in those processes. By comparing the intent of the NYSAA with its process and design, the validity of the…
Descriptors: Alternative Assessment, Grade 3, Grade 4, Grade 5
Peer reviewed Peer reviewed
PDF on ERIC Download full text
Moses, Tim; Liu, Jinghua; Tan, Adele; Deng, Weiling; Dorans, Neil J. – ETS Research Report Series, 2013
In this study, differential item functioning (DIF) methods utilizing 14 different matching variables were applied to assess DIF in the constructed-response (CR) items from 6 forms of 3 mixed-format tests. Results suggested that the methods might produce distinct patterns of DIF results for different tests and testing programs, in that the DIF…
Descriptors: Test Construction, Multiple Choice Tests, Test Items, Item Analysis
Peer reviewed Peer reviewed
PDF on ERIC Download full text
Zhang, Mo; Breyer, F. Jay; Lorenz, Florian – ETS Research Report Series, 2013
In this research, we investigated the suitability of implementing "e-rater"® automated essay scoring in a high-stakes large-scale English language testing program. We examined the effectiveness of generic scoring and 2 variants of prompt-based scoring approaches. Effectiveness was evaluated on a number of dimensions, including agreement…
Descriptors: Computer Assisted Testing, Computer Software, Scoring, Language Tests
Klein, Stephen P.; Bolus, Roger – 1983
A solution to reduce the likelihood of one examinee copying another's answers on large scale tests that require all examinees to answer the same set of questions is to use multiple test forms that differ in terms of item ordering. This study was conducted to determine whether varying the sequence in which blocks of items were presented to…
Descriptors: Adults, Cheating, Cost Effectiveness, Item Analysis
Grosswald, Jules – 1975
Much of the intrinsic wealth of planning and instructional information available from achievement testing programs goes untapped in typical reporting procedures. Large-scale programs reporting only pupil scores and the results of aggregating those scores stop far short of the purposes intended and fail to realize the potential of such information.…
Descriptors: Achievement Tests, Data Analysis, Decision Making, Evaluation Methods
New Mexico State Dept. of Education, Santa Fe. Evaluation, Assessment, and Testing Unit. – 1974
This survey of the standardized testing program summarizes the data accumulated from the most recent administration of selected instruments in October 1973. It compares these findings with information from previous years and points to a few trends and possible conclusions. Assessment of mental abilities--1973-74 is presented for grade 1, and…
Descriptors: Academic Achievement, Comparative Analysis, Ethnic Groups, Grade 1
McLean, Les – 1985
Data gathered from large-scale assessments in the Ontario Assessment Instrument Pool (OAIP) are examined. Implications for science instruction are to be found at the item level; the items should not involve more than two or three steps if the responses are to be informative. Items are collected and linked to provincial curriculum guidelines. These…
Descriptors: College Bound Students, Instructional Improvement, Item Analysis, Item Banks
New Jersey State Dept. of Education, Trenton. – 1986
This Technical Report provides descriptions and summary data that assist measurement specialists in assessing the procedures used in developing New Jersey's High School Proficiency Test (HSPT), the technical qualities of the tests, and the statewide results obtained from its use. The data summarized in this report were collected during the various…
Descriptors: Equated Scores, Graduation Requirements, High Schools, Item Analysis
Bennett, Randy Elliot; And Others – 1985
This study examined the psychometric characteristics of the Scholastic Aptitude Test (SAT) administered under special conditions for nine handicapped groups. Four psychometric characteristics were studied: level of test performance, test reliability, speededness, and extent of unexpected differential item performance. Psychometric comparisons were…
Descriptors: Aptitude Tests, College Entrance Examinations, Disabilities, Hearing Impairments
Hertzog, James F., Comp.; Seiverling, Richard F., Comp. – 1979
This manual is designed to accompany reports prepared by the Pennsylvania Department of Education for the 505 school districts in that state, summarizing results of the Educational Quality Assessment. The normative sample is described as being stratified on two dimensions: district enrollment and wealth. Administrative procedures are summarized.…
Descriptors: Affective Objectives, Cognitive Objectives, Criterion Referenced Tests, Educational Assessment
District of Columbia Public Schools, Washington, DC. Div. of Quality Assurance. – 1985
The Comprehensive Tests of Basic Skills, Expanded Edition (CTBS), was administered in 1984 to students in the District of Columbia Public Schools. This report includes: (1) an overview covering the purpose of testing, a description of the subject areas tested by the CTBS, and a glossary of technical terms; (2) a narrative and tabular summary of…
Descriptors: Academic Achievement, Achievement Tests, Basic Skills, Elementary Secondary Education
Hertzog, James F., Comp.; Seiverling, Richard F., Comp. – 1979
This manual is designed to accompany reports prepared by the Pennsylvania Department of Education for the 505 school districts in that state, summarizing results of the Educational Quality Assessment. The normative sample is described as being stratified on two dimensions: district enrollment and wealth. Administrative procedures are summarized.…
Descriptors: Affective Objectives, Cognitive Objectives, Criterion Referenced Tests, Educational Assessment
Clarke, S. C. T.; And Others – 1977
To compare achievement standards of 1977 with those of 1956, three tests were administered in their original form to all third graders in a large school district. Approximately 3500 students in 1956 and 4500 in 1977 were administered the California Achievement Tests, the California Short Form Tests of Mental Maturity, and the Gates Advanced…
Descriptors: Academic Achievement, Achievement Gains, Achievement Tests, Arithmetic
Hertzog, James F., Comp.; Seiverling, Richard F., Comp. – 1979
This manual is designed to accompany reports prepared by the Pennsylvania Department of Education for the 505 school districts in that state, summarizing results of the Educational Quality Assessment. The normative sample is described as being stratified on two dimensions: district enrollment and wealth. Administrative procedures are summarized.…
Descriptors: Affective Objectives, Cognitive Objectives, Criterion Referenced Tests, Educational Assessment
Previous Page | Next Page »
Pages: 1  |  2