ERIC Number: EJ1334636
Record Type: Journal
Publication Date: 2021
Pages: 13
Abstractor: As Provided
ISBN: N/A
ISSN: EISSN-2209-0959
EISSN: N/A
Available Date: N/A
Automated Identification of Discourse Markers Using the NLP Approach: The Case of "Okay"
Sanosi, Abdulaziz; Abdalla, Mohamed
Australian Journal of Applied Linguistics, v4 n3 p119-131 2021
This study aimed to examine the potentials of the NLP approach in detecting discourse markers (DMs), namely okay, in transcribed spoken data. One hundred thirty-eight concordance lines were presented to human referees to judge the functions of okay in them as a DM or Non-DM. After that, the researchers used a Python script written according to the POS tagging scheme of the NLTK library to set rules for identifying cases where okay is used as non-DM. The output of the script was compared to the reference human-annotated data. The results showed that the script could accurately identify the function of okay as DM or non-DM in 92% of the cases. The inaccuracy of detecting the rest was found to be caused by a lack of proper and detailed punctuations. The main implications of the results are that new NLP approaches can detect DMS; however, proper punctuation is required to enable the proper identification of DMs. In accordance with the findings, the researcher recommended adopting the approach after conducting further comprehensive studies.
Descriptors: Natural Language Processing, Computational Linguistics, Programming Languages, Accuracy, Punctuation, Discourse Analysis, Identification, Evaluators, Comparative Analysis, Computer Software
Castledown Publishers. Level 9, 440 Collins Street, Melbourne, Victoria, 3000, Australia. Tel: +61-3-7003-8355; e-mail: contact@castledown-publishers.com.au; Web site: https://www.castledown.com.au/
Publication Type: Journal Articles; Reports - Research
Education Level: N/A
Audience: N/A
Language: English
Sponsor: N/A
Authoring Institution: N/A
Grant or Contract Numbers: N/A
Author Affiliations: N/A