ERIC Number: EJ1406168
Record Type: Journal
Publication Date: 2023
Pages: 17
Abstractor: As Provided
ISBN: N/A
ISSN: ISSN-1364-5579
EISSN: EISSN-1464-5300
Available Date: N/A
Does Accuracy Matter? Methodological Considerations When Using Automated Speech-to-Text for Social Science Research
Steven J. Pentland; Christie M. Fuller; Lee A. Spitzley; Douglas P. Twitchell
International Journal of Social Research Methodology, v26 n6 p661-677 2023
The analysis of spoken language has been integral to a breadth of research in social science and beyond. However, for analyses to occur with efficiency, language must be in the form of computer-readable text. Historically, the speech-to-text process has occurred manually using human transcriptionists. Automated speech recognition (ASR) is advertised as an efficient and inexpensive alternative, but research shows this method of speech-to-text is prone to error. This paper investigates the viability of using error prone ASR transcriptions as part of the methodological process of language analysis. Results show that at the individual feature level, analysis of ASR transcriptions differ dramatically from human transcriptions. However, when the same features are used for classification, a common machine learning task, performance results between ASR and human transcriptions are similar. We present these findings and conclude with a discussion on the methodological considerations for researchers who opt to use automated speech recognition for social science research.
Descriptors: Accuracy, Social Science Research, Classification, Reading Processes, Speech Communication, Transcripts (Written Records), Audio Equipment, Computational Linguistics, Language Research, Error Patterns, Computer Software, Research Problems
Routledge. Available from: Taylor & Francis, Ltd. 530 Walnut Street Suite 850, Philadelphia, PA 19106. Tel: 800-354-1420; Tel: 215-625-8900; Fax: 215-207-0050; Web site: http://www.tandf.co.uk/journals
Publication Type: Journal Articles; Reports - Research
Education Level: N/A
Audience: N/A
Language: English
Sponsor: N/A
Authoring Institution: N/A
Grant or Contract Numbers: N/A
Author Affiliations: N/A