Comparing text mining and manual coding methods: Analysing interview data on quality of care in long-term care for older adults

Coen Hacking*, Hilde Verbeek, Jan P H Hamers, Sil Aarts

*Corresponding author for this work

Research output: Contribution to journalArticleAcademicpeer-review

Abstract

OBJECTIVES: In long-term care for older adults, large amounts of text are collected relating to the quality of care, such as transcribed interviews. Researchers currently analyze textual data manually to gain insights, which is a time-consuming process. Text mining could provide a solution, as this methodology can be used to analyze large amounts of text automatically. This study aims to compare text mining to manual coding with regard to sentiment analysis and thematic content analysis. METHODS: Data were collected from interviews with residents (n = 21), family members (n = 20), and care professionals (n = 20). Text mining models were developed and compared to the manual approach. The results of the manual and text mining approaches were evaluated based on three criteria: accuracy, consistency, and expert feedback. Accuracy assessed the similarity between the two approaches, while consistency determined whether each individual approach found the same themes in similar text segments. Expert feedback served as a representation of the perceived correctness of the text mining approach. RESULTS: An accuracy analysis revealed that more than 80% of the text segments were assigned the same themes and sentiment using both text mining and manual approaches. Interviews coded with text mining demonstrated higher consistency compared to those coded manually. Expert feedback identified certain limitations in both the text mining and manual approaches. CONCLUSIONS AND IMPLICATIONS: While these analyses highlighted the current limitations of text mining, they also exposed certain inconsistencies in manual analysis. This information suggests that text mining has the potential to be an effective and efficient tool for analysing large volumes of textual data in the context of long-term care for older adults.
Original languageEnglish
Article numbere0292578
Number of pages13
JournalPLOS ONE
Volume18
Issue number11
DOIs
Publication statusPublished - 8 Nov 2023

Keywords

  • Humans
  • Aged
  • Long-Term Care
  • Data Mining/methods
  • Sentiment Analysis
  • Text Messaging
  • Quality of Health Care

Cite this