ERIC - Search Results

Publication Date

In 2026	0
Since 2025	0
Since 2022 (last 5 years)	0
Since 2017 (last 10 years)	0
Since 2007 (last 20 years)	5

Descriptor

Error of Measurement	16
Scoring Formulas	16
Test Items	5
Test Reliability	5
Cutting Scores	4
Test Theory	4
True Scores	4
Essay Tests	3
Higher Education	3
Mathematical Models	3
Medical Students	3
Multiple Regression Analysis	3
Scaling	3
Scoring	3
Statistical Analysis	3
Computer Assisted Testing	2
Computer Simulation	2
Criterion Referenced Tests	2
Difficulty Level	2
Foreign Countries	2
Grading	2
Interrater Reliability	2
Mastery Tests	2
Multiple Choice Tests	2
Reliability	2
More ▼

Source

Advances in Health Sciences…	1
American Educational Research…	1
British Journal of…	1
Child Abuse and Neglect: The…	1
ETS Research Report Series	1
Educational Sciences: Theory…	1
Journal of Educational…	1
Journal of Educational…	1
Language Assessment Quarterly	1
Scandinavian Journal of…	1
Universal Journal of…	1
More ▼

Publication Type

Reports - Research	16
Journal Articles	11
Speeches/Meeting Papers	4
Opinion Papers	1

Education Level

Higher Education	3
Postsecondary Education	3
Elementary Secondary Education	1

Audience

Researchers

Location

Japan	1
Turkey	1

Laws, Policies, & Programs

Assessments and Surveys

Comprehensive Tests of Basic…

What Works Clearinghouse Rating

Showing 1 to 15 of 16 results Save | Export

The "Don't Know" Option in Progress Testing

Peer reviewed

Direct link

Ravesloot, C. J.; Van der Schaaf, M. F.; Muijtjens, A. M. M.; Haaring, C.; Kruitwagen, C. L. J. J.; Beek, F. J. A.; Bakker, J.; Van Schaik, J.P.J.; Ten Cate, Th. J. – Advances in Health Sciences Education, 2015

Formula scoring (FS) is the use of a don't know option (DKO) with subtraction of points for wrong answers. Its effect on construct validity and reliability of progress test scores, is subject of discussion. Choosing a DKO may not only be affected by knowledge level, but also by risk taking tendency, and may thus introduce construct-irrelevant…

Descriptors: Scoring Formulas, Tests, Scores, Construct Validity

Investigation of Coefficient of Individual Agreement in Terms of Sample Size, Random and Monotone Missing Ratio, and Number of Repeated Measures

Peer reviewed
PDF on ERIC

Download full text

Temel, Gülhan Orekici; Erdogan, Semra; Selvi, Hüseyin; Kaya, Irem Ersöz – Educational Sciences: Theory and Practice, 2016

Studies based on longitudinal data focus on the change and development of the situation being investigated and allow for examining cases regarding education, individual development, cultural change, and socioeconomic improvement in time. However, as these studies require taking repeated measures in different time periods, they may include various…

Descriptors: Investigations, Sample Size, Longitudinal Studies, Interrater Reliability

Guessing and the Rasch Model

Peer reviewed

Direct link

Holster, Trevor A.; Lake, J. – Language Assessment Quarterly, 2016

Stewart questioned Beglar's use of Rasch analysis of the Vocabulary Size Test (VST) and advocated the use of 3-parameter logistic item response theory (3PLIRT) on the basis that it models a non-zero lower asymptote for items, often called a "guessing" parameter. In support of this theory, Stewart presented fit statistics derived from…

Descriptors: Guessing (Tests), Item Response Theory, Vocabulary, Language Tests

Surgical Theatre (Operating Room) Measure STEEM (OREEM) Scoring Overestimates Educational Environment: The 1-to-L Bias

Peer reviewed
PDF on ERIC

Download full text

Dimoliatis, Ioannis D. K.; Jelastopulu, Eleni – Universal Journal of Educational Research, 2013

The surgical theatre educational environment measures STEEM, OREEM and mini-STEEM for students (student-STEEM) comprise an up to now disregarded systematic overestimation (OE) due to inaccurate percentage calculation. The aim of the present study was to investigate the magnitude of and suggest a correction for this systematic bias. After an…

Descriptors: Educational Environment, Scores, Grade Prediction, Academic Standards

Reporting Student Achievement: How Many Grades?

Peer reviewed

Mitchelmore, M. C. – British Journal of Educational Psychology, 1981

This paper presents a scientific rationale for deciding the number of points to use on a grading scale in any given assessment situation. The rationale is applied to two common methods of assessment (multiple-choice and essay tests) and an example of a composite assessment. (Author/SJL)

Descriptors: Error of Measurement, Essay Tests, Grading, Higher Education

Choice of the Metric for Effect Size in Meta-analysis.

Peer reviewed

McGaw, Barry; Glass, Gene V. – American Educational Research Journal, 1980

There are difficulties in expressing effect sizes on a common metric when some studies use transformed scales to express group differences, or use factorial designs or covariance adjustments to obtain a reduced error term. A common metric on which effect sizes may be standardized is described. (Author/RL)

Descriptors: Control Groups, Error of Measurement, Mathematical Models, Research Problems

The Relation of the Scale Coarseness to the Dependability of Marks.

Peer reviewed

Kleven, Thor Arnfinn – Scandinavian Journal of Educational Research, 1979

Supposing different values of the standard measurement error, the relation of scale coarseness to the total amount of error is studied on the basis of probability distribution of error. The analyses are performed within two models of error and with two criteria of amount of error. (Editor/SJL)

Descriptors: Cutting Scores, Error of Measurement, Goodness of Fit, Grading

Reliability of Composite Measurements Based on the m Highest of n Equivalent Components.

Peer reviewed

Huynh, Huynh – Journal of Educational Statistics, 1986

Under the assumptions of classical measurement theory and the condition of normality, a formula is derived for the reliability of composite scores. The formula represents an extension of the Spearman-Brown formula to the case of truncated data. (Author/JAZ)

Descriptors: Computer Simulation, Error of Measurement, Expectancy Tables, Scoring Formulas

On-the-Fly Customization of Automated Essay Scoring. Research Report. ETS RR-07-42

Peer reviewed
PDF on ERIC

Download full text

Attali, Yigal – ETS Research Report Series, 2007

Because there is no commonly accepted view of what makes for good writing, automated essay scoring (AES) ideally should be able to accommodate different theoretical positions, certainly at the level of state standards but also perhaps among teachers at the classroom level. This paper presents a practical approach and an interactive computer…

Descriptors: Computer Assisted Testing, Automation, Essay Tests, Scoring

Predicting/Preventing Child Abuse: Value of Utility Maximizing Cutting Scores.

Tsujimoto, Richard N.; Berger, Dale E. – Child Abuse and Neglect: The International Journal, 1988

Two criteria are discussed for determining cutting scores on a predictor variable for identifying cases of likely child abuse--utility maximizing and error minimizing. Utility maximizing is the preferable criterion, as it optimizes the balance between the costs of incorrect decisions and the benefits of correct decisions. (Author/JDD)

Descriptors: Child Abuse, Cost Effectiveness, Cutting Scores, Error of Measurement

Standard Errors of Measurement at Different Ability Levels.

Peer reviewed

Lord, Frederic M. – Journal of Educational Measurement, 1984

Four methods are outlined for estimating or approximating from a single test administration the standard error of measurement of number-right test score at specified ability levels or cutting scores. The methods are illustrated and compared on one set of real test data. (Author)

Descriptors: Academic Ability, Cutting Scores, Error of Measurement, Scoring Formulas

Consideration for Sample Size in Reliability Studies for Mastery Tests. Publication Series in Mastery Testing.

Download full text

Saunders, Joseph C.; Huynh, Huynh – 1980

In most reliability studies, the precision of a reliability estimate varies inversely with the number of examinees (sample size). Thus, to achieve a given level of accuracy, some minimum sample size is required. An approximation for this minimum size may be made if some reasonable assumptions regarding the mean and standard deviation of the test…

Descriptors: Cutting Scores, Difficulty Level, Error of Measurement, Mastery Tests

Adjusting Observational Ratings to Improve Inter-Rater Consistency.

Download full text

Littlefield, John H.; And Others – 1983

Observational ratings of student clinical performance are influenced by factors other than the quality of the performance. Individual raters may be more stringent or lenient than their colleagues. In this medical school setting, multiple raters evaluated each student. To reduce the influence of "error" due to differences among raters, each rater…

Descriptors: Bias, Error of Measurement, Higher Education, Interrater Reliability

Adjusting Scores on Examinations Offering a Choice of Questions.

Download full text

Livingston, Samuel A. – 1986

This paper deals with test fairness regarding a test consisting of two parts: (1) a "common" section, taken by all students; and (2) a "variable" section, in which some students may answer a different set of questions from other students. For example, a test taken by several thousand students each year contains a common multiple-choice portion and…

Descriptors: Difficulty Level, Error of Measurement, Essay Tests, Mathematical Models

The Least-Squares Estimation of Latent Trait Variables.

Tatsuoka, Kikumi – 1980

This paper presents a new method for estimating a given latent trait variable by the least-squares approach. The beta weights are obtained recursively with the help of Fourier series and expressed as functions of item parameters of response curves. The values of the latent trait variable estimated by this method and by maximum likelihood method…

Descriptors: Computer Assisted Testing, Error of Measurement, Higher Education, Latent Trait Theory

Previous Page | Next Page »

Pages: 1 | 2

Huynh, Huynh	2
Attali, Yigal	1
Bakker, J.	1
Beek, F. J. A.	1
Berger, Dale E.	1
Brennan, Robert L.	1
Dimoliatis, Ioannis D. K.	1
Erdogan, Semra	1
Glass, Gene V.	1
Haaring, C.	1
Holster, Trevor A.	1
Jelastopulu, Eleni	1
Kaya, Irem Ersöz	1
Kleven, Thor Arnfinn	1
Kruitwagen, C. L. J. J.	1
Lake, J.	1
Littlefield, John H.	1
Livingston, Samuel A.	1
Lord, Frederic M.	1
McGaw, Barry	1
Mitchelmore, M. C.	1
Muijtjens, A. M. M.	1
Ravesloot, C. J.	1
Saunders, Joseph C.	1
More ▼