NotesFAQContact Us
Collection
Advanced
Search Tips
Back to results
Peer reviewed Peer reviewed
Direct linkDirect link
ERIC Number: EJ1405350
Record Type: Journal
Publication Date: 2024
Pages: 11
Abstractor: As Provided
ISBN: N/A
ISSN: N/A
EISSN: EISSN-1939-1382
Available Date: N/A
Automatically Difficulty Grading Method for English Reading Corpus with Multifeature Embedding Based on a Pretrained Language Model
IEEE Transactions on Learning Technologies, v17 p474-484 2024
Graded reading is one of the important ways of English learning. How to automatically judge and grade the difficulty of the English reading corpus is of great significance for precision teaching and personalized learning. However, the current rule-based readability assessment methods have some limitations, such as low efficiency and poor accuracy. In particular, these traditional methods usually lack semantics, which is crucial for students to understand the reading material. Meanwhile, they are difficult to be mapped to the difficulty level, which is not conducive to flexible application in actual personalized teaching. In this study, a method for grading the difficulty of the English reading corpus is proposed. This approach makes use of a pretrained language model and feature fusion embedding to make the most of multifeature data when training. First, based on linguists' evaluations of the variables influencing the difficulty of English reading corpus, three primary statistical features--sentence length, word length, and the number of prepositions--are taken into consideration. Then, the semantic features and part-of-speech features of the text are learned by a pretrained language model and long short-term memory, respectively, to capture polysemy features and fine-grained semantic representations that are difficult to represent with traditional models. Finally, multifeature embedding extractions are fused to grade the difficulty of the English reading corpus. Extensive experimental comparisons on a self-built dataset and two datasets that are freely accessible with various models indicate that our method outperforms the others in the task of grading the difficulty of English reading corpora.
Institute of Electrical and Electronics Engineers, Inc. 445 Hoes Lane, Piscataway, NJ 08854. Tel: 732-981-0060; Web site: http://ieeexplore.ieee.org/xpl/RecentIssue.jsp?punumber=4620076
Publication Type: Journal Articles; Reports - Research
Education Level: N/A
Audience: N/A
Language: English
Sponsor: N/A
Authoring Institution: N/A
Grant or Contract Numbers: N/A
Author Affiliations: N/A