Using machine learning to develop customer insights from user-generated content
(2024) In Journal of Retailing and Consumer Services 81.- Abstract
- Uncovering customer insights (CI) is indispensable for contemporary marketing strategies. The widespread availability of user-generated content (UGC) presents a unique opportunity for firms to gain a nuanced understanding of their customers. However, the size and complexity of UGC datasets pose significant challenges for traditional market research methods, limiting their effectiveness in this context. To address this challenge, this study leverages natural language processing (NLP) and machine learning (ML) techniques to extract nuanced insights from UGC. By integrating sentiment analysis and topic modeling algorithms, we analyzed a dataset of approximately four million X posts (formerly tweets) encompassing 20 global brands across... (More)
- Uncovering customer insights (CI) is indispensable for contemporary marketing strategies. The widespread availability of user-generated content (UGC) presents a unique opportunity for firms to gain a nuanced understanding of their customers. However, the size and complexity of UGC datasets pose significant challenges for traditional market research methods, limiting their effectiveness in this context. To address this challenge, this study leverages natural language processing (NLP) and machine learning (ML) techniques to extract nuanced insights from UGC. By integrating sentiment analysis and topic modeling algorithms, we analyzed a dataset of approximately four million X posts (formerly tweets) encompassing 20 global brands across industries. The findings reveal primary brand-related emotions and identify the top 10 keywords indicative of brand-related sentiment. Using FedEx as a case study, we identify five prominent areas of customer concern: parcel tracking, small business services, the firm's comparative performance, package delivery dynamics, and customer service. Overall, this study offers a roadmap for academics to navigate the complex landscape of generating CI from UGC datasets. It thus raises pertinent practical implications, including boosting customer service, refining marketing strategies, and better understanding customer needs and preferences, thereby contributing to more effective, more responsive business strategies. (Less)
Please use this url to cite or link to this publication:
https://lup.lub.lu.se/record/a481d938-bea9-4a96-859f-d525650cadc8
- author
- Mustak, Mekhail ; Hallikainen, Heli ; Laukkanen, Tommi ; Plé, Loïc ; Hollebeek, Linda LU and Aleem, Majid
- organization
- publishing date
- 2024-11
- type
- Contribution to journal
- publication status
- published
- subject
- keywords
- Customer insights, User-generated content, UGC, Sentiment analysis, Topic modeling, Artificial intelligence, Machine learning, Natural language processing, NLP, Marketing, Big data
- in
- Journal of Retailing and Consumer Services
- volume
- 81
- article number
- 104034
- publisher
- Elsevier
- external identifiers
-
- scopus:85201377662
- ISSN
- 1873-1384
- DOI
- 10.1016/j.jretconser.2024.104034
- language
- English
- LU publication?
- yes
- id
- a481d938-bea9-4a96-859f-d525650cadc8
- date added to LUP
- 2024-08-15 15:02:22
- date last changed
- 2024-10-30 11:30:07
@article{a481d938-bea9-4a96-859f-d525650cadc8, abstract = {{Uncovering customer insights (CI) is indispensable for contemporary marketing strategies. The widespread availability of user-generated content (UGC) presents a unique opportunity for firms to gain a nuanced understanding of their customers. However, the size and complexity of UGC datasets pose significant challenges for traditional market research methods, limiting their effectiveness in this context. To address this challenge, this study leverages natural language processing (NLP) and machine learning (ML) techniques to extract nuanced insights from UGC. By integrating sentiment analysis and topic modeling algorithms, we analyzed a dataset of approximately four million X posts (formerly tweets) encompassing 20 global brands across industries. The findings reveal primary brand-related emotions and identify the top 10 keywords indicative of brand-related sentiment. Using FedEx as a case study, we identify five prominent areas of customer concern: parcel tracking, small business services, the firm's comparative performance, package delivery dynamics, and customer service. Overall, this study offers a roadmap for academics to navigate the complex landscape of generating CI from UGC datasets. It thus raises pertinent practical implications, including boosting customer service, refining marketing strategies, and better understanding customer needs and preferences, thereby contributing to more effective, more responsive business strategies.}}, author = {{Mustak, Mekhail and Hallikainen, Heli and Laukkanen, Tommi and Plé, Loïc and Hollebeek, Linda and Aleem, Majid}}, issn = {{1873-1384}}, keywords = {{Customer insights; User-generated content; UGC; Sentiment analysis; Topic modeling; Artificial intelligence; Machine learning; Natural language processing; NLP; Marketing; Big data}}, language = {{eng}}, publisher = {{Elsevier}}, series = {{Journal of Retailing and Consumer Services}}, title = {{Using machine learning to develop customer insights from user-generated content}}, url = {{http://dx.doi.org/10.1016/j.jretconser.2024.104034}}, doi = {{10.1016/j.jretconser.2024.104034}}, volume = {{81}}, year = {{2024}}, }