Publication Date
| In 2026 | 0 |
| Since 2025 | 1 |
| Since 2022 (last 5 years) | 1 |
| Since 2017 (last 10 years) | 1 |
| Since 2007 (last 20 years) | 1 |
Descriptor
| Comparative Testing | 3 |
| Test Reliability | 3 |
| Test Validity | 2 |
| Achievement Tests | 1 |
| Black Students | 1 |
| College Students | 1 |
| Computer Assisted Testing | 1 |
| Context Effect | 1 |
| Correlation | 1 |
| Essay Tests | 1 |
| Essays | 1 |
| More ▼ | |
Source
| Journal of Educational… | 3 |
Author
| Breland, Hunter M. | 1 |
| Gaynor, Judith L. | 1 |
| Hamid Mohammadi | 1 |
| Mark J. Gierl | 1 |
| Ryan, Katherine E. | 1 |
| Tahereh Firoozi | 1 |
Publication Type
| Journal Articles | 3 |
| Reports - Research | 3 |
| Speeches/Meeting Papers | 1 |
Education Level
| Higher Education | 1 |
| Postsecondary Education | 1 |
Audience
Location
Laws, Policies, & Programs
Assessments and Surveys
| Test of Standard Written… | 1 |
What Works Clearinghouse Rating
Tahereh Firoozi; Hamid Mohammadi; Mark J. Gierl – Journal of Educational Measurement, 2025
The purpose of this study is to describe and evaluate a multilingual automated essay scoring (AES) system for grading essays in three languages. Two different sentence embedding models were evaluated within the AES system, multilingual BERT (mBERT) and language-agnostic BERT sentence embedding (LaBSE). German, Italian, and Czech essays were…
Descriptors: College Students, Slavic Languages, German, Italian
Peer reviewedBreland, Hunter M.; Gaynor, Judith L. – Journal of Educational Measurement, 1979
Over 2,000 writing samples were collected from four undergraduate institutions and compared, where possible, with scores on a multiple-choice test. High correlations between ratings of the writing samples and multiple-choice test scores were obtained. Samples contributed substantially to the prediction of both college grades and writing…
Descriptors: Achievement Tests, Comparative Testing, Correlation, Essay Tests
Peer reviewedRyan, Katherine E. – Journal of Educational Measurement, 1991
The reliability of Mantel-Haenszel (MH) indexes across samples of examinees and sample sizes and their robustness to item context effects were investigated with data for 670 African-American and 5,015 white students from the Second International Mathematics Study. MH procedures can be used to detect differential item functioning. (SLD)
Descriptors: Black Students, Comparative Testing, Context Effect, Evaluation Criteria

Direct link
