ERIC - Search Results

Publication Date

In 2026	0
Since 2025	0
Since 2022 (last 5 years)	0
Since 2017 (last 10 years)	1
Since 2007 (last 20 years)	3

Descriptor

Error of Measurement	9
Scoring Formulas	9
Test Reliability	9
Test Items	4
Test Construction	3
True Scores	3
Computer Simulation	2
Cutting Scores	2
Guessing (Tests)	2
Interrater Reliability	2
Item Analysis	2
Mastery Tests	2
Multiple Choice Tests	2
Sample Size	2
Scores	2
Test Length	2
Test Validity	2
Accuracy	1
Administration	1
Alternative Assessment	1
Benchmarking	1
Child Abuse	1
Computation	1
Computer Programs	1
Confidence Testing	1
More ▼

Source

Assessment & Evaluation in…	1
Child Abuse and Neglect: The…	1
Educational Sciences: Theory…	1
Educational and Psychological…	1
Journal of Educational…	1
Measurement and Evaluation in…	1
National Center for Research…	1

Author

Huynh, Huynh	2
Bardhoshi, Gerta	1
Berger, Dale E.	1
Brennan, Robert L.	1
Burton, Richard F.	1
Cureton, Edward E.	1
Erdogan, Semra	1
Erford, Bradley T.	1
Griffin, Noelle	1
Kaya, Irem Ersöz	1
Niemi, David	1
Saunders, Joseph C.	1
Selvi, Hüseyin	1
Temel, Gülhan Orekici	1
Tsujimoto, Richard N.	1
Vallone, Julia	1
Wang, Haiwen	1
Wang, Jia	1
More ▼

Publication Type

Journal Articles	5
Reports - Research	5
Reports - Evaluative	2
Opinion Papers	1
Reports - Descriptive	1
Speeches/Meeting Papers	1
Tests/Questionnaires	1

Education Level

Elementary Secondary Education

Audience

Location

Mississippi	1
Turkey	1

Laws, Policies, & Programs

Assessments and Surveys

Comprehensive Tests of Basic…

What Works Clearinghouse Rating

Showing all 9 results Save | Export

Processes and Procedures for Estimating Score Reliability and Precision

Peer reviewed

Direct link

Bardhoshi, Gerta; Erford, Bradley T. – Measurement and Evaluation in Counseling and Development, 2017

Precision is a key facet of test development, with score reliability determined primarily according to the types of error one wants to approximate and demonstrate. This article identifies and discusses several primary forms of reliability estimation: internal consistency (i.e., split-half, KR-20, a), test-retest, alternate forms, interscorer, and…

Descriptors: Scores, Test Reliability, Accuracy, Pretests Posttests

Investigation of Coefficient of Individual Agreement in Terms of Sample Size, Random and Monotone Missing Ratio, and Number of Repeated Measures

Peer reviewed
PDF on ERIC

Download full text

Temel, Gülhan Orekici; Erdogan, Semra; Selvi, Hüseyin; Kaya, Irem Ersöz – Educational Sciences: Theory and Practice, 2016

Studies based on longitudinal data focus on the change and development of the situation being investigated and allow for examining cases regarding education, individual development, cultural change, and socioeconomic improvement in time. However, as these studies require taking repeated measures in different time periods, they may include various…

Descriptors: Investigations, Sample Size, Longitudinal Studies, Interrater Reliability

Reliability of Multiple-Choice Tests is the Proportion of Variance Which is True Variance

Peer reviewed

Cureton, Edward E. – Educational and Psychological Measurement, 1971

A rebuttal of Frary's 1969 article in Educational and Psychological Measurement. (MS)

Descriptors: Error of Measurement, Guessing (Tests), Multiple Choice Tests, Scoring Formulas

Reliability of Composite Measurements Based on the m Highest of n Equivalent Components.

Peer reviewed

Huynh, Huynh – Journal of Educational Statistics, 1986

Under the assumptions of classical measurement theory and the condition of normality, a formula is derived for the reliability of composite scores. The formula represents an extension of the Spearman-Brown formula to the case of truncated data. (Author/JAZ)

Descriptors: Computer Simulation, Error of Measurement, Expectancy Tables, Scoring Formulas

Predicting/Preventing Child Abuse: Value of Utility Maximizing Cutting Scores.

Tsujimoto, Richard N.; Berger, Dale E. – Child Abuse and Neglect: The International Journal, 1988

Two criteria are discussed for determining cutting scores on a predictor variable for identifying cases of likely child abuse--utility maximizing and error minimizing. Utility maximizing is the preferable criterion, as it optimizes the balance between the costs of incorrect decisions and the benefits of correct decisions. (Author/JDD)

Descriptors: Child Abuse, Cost Effectiveness, Cutting Scores, Error of Measurement

Multiple Choice and True/False Tests: Reliability Measures and Some Implications of Negative Marking

Peer reviewed

Direct link

Burton, Richard F. – Assessment & Evaluation in Higher Education, 2004

The standard error of measurement usefully provides confidence limits for scores in a given test, but is it possible to quantify the reliability of a test with just a single number that allows comparison of tests of different format? Reliability coefficients do not do this, being dependent on the spread of examinee attainment. Better in this…

Descriptors: Multiple Choice Tests, Error of Measurement, Test Reliability, Test Items

Consideration for Sample Size in Reliability Studies for Mastery Tests. Publication Series in Mastery Testing.

Download full text

Saunders, Joseph C.; Huynh, Huynh – 1980

In most reliability studies, the precision of a reliability estimate varies inversely with the number of examinees (sample size). Thus, to achieve a given level of accuracy, some minimum sample size is required. An approximation for this minimum size may be made if some reasonable assumptions regarding the mean and standard deviation of the test…

Descriptors: Cutting Scores, Difficulty Level, Error of Measurement, Mastery Tests

Recommendations for Building a Valid Benchmark Assessment System: Second Report to the Jackson Public Schools. CRESST Report 724

Download full text

Niemi, David; Wang, Jia; Wang, Haiwen; Vallone, Julia; Griffin, Noelle – National Center for Research on Evaluation, Standards, and Student Testing (CRESST), 2007

There are usually many testing activities going on in a school, with different tests serving different purposes, thus organization and planning are key in creating an efficient system in assessing the most important educational objectives. In the ideal case, an assessment system will be able to inform on student learning, instruction and…

Descriptors: School Administration, Educational Objectives, Administration, Public Schools

The Evaluation of Mastery Test Items. Final Report.

Download full text

Brennan, Robert L. – 1974

The first four chapters of this report primarily provide an extensive, critical review of the literature with regard to selected aspects of the criterion-referenced and mastery testing fields. Major topics treated include: (a) definitions, distinctions, and background, (b) the relevance of classical test theory, (c) validity and procedures for…

Descriptors: Computer Programs, Confidence Testing, Criterion Referenced Tests, Error of Measurement