Skip to main content

Lund University Publications

LUND UNIVERSITY LIBRARIES

Using machine learning to develop customer insights from user-generated content

Mustak, Mekhail ; Hallikainen, Heli ; Laukkanen, Tommi ; Plé, Loïc ; Hollebeek, Linda LU and Aleem, Majid (2024) In Journal of Retailing and Consumer Services 81.
Abstract
Uncovering customer insights (CI) is indispensable for contemporary marketing strategies. The widespread availability of user-generated content (UGC) presents a unique opportunity for firms to gain a nuanced understanding of their customers. However, the size and complexity of UGC datasets pose significant challenges for traditional market research methods, limiting their effectiveness in this context. To address this challenge, this study leverages natural language processing (NLP) and machine learning (ML) techniques to extract nuanced insights from UGC. By integrating sentiment analysis and topic modeling algorithms, we analyzed a dataset of approximately four million X posts (formerly tweets) encompassing 20 global brands across... (More)
Uncovering customer insights (CI) is indispensable for contemporary marketing strategies. The widespread availability of user-generated content (UGC) presents a unique opportunity for firms to gain a nuanced understanding of their customers. However, the size and complexity of UGC datasets pose significant challenges for traditional market research methods, limiting their effectiveness in this context. To address this challenge, this study leverages natural language processing (NLP) and machine learning (ML) techniques to extract nuanced insights from UGC. By integrating sentiment analysis and topic modeling algorithms, we analyzed a dataset of approximately four million X posts (formerly tweets) encompassing 20 global brands across industries. The findings reveal primary brand-related emotions and identify the top 10 keywords indicative of brand-related sentiment. Using FedEx as a case study, we identify five prominent areas of customer concern: parcel tracking, small business services, the firm's comparative performance, package delivery dynamics, and customer service. Overall, this study offers a roadmap for academics to navigate the complex landscape of generating CI from UGC datasets. It thus raises pertinent practical implications, including boosting customer service, refining marketing strategies, and better understanding customer needs and preferences, thereby contributing to more effective, more responsive business strategies. (Less)
Please use this url to cite or link to this publication:
author
; ; ; ; and
organization
publishing date
type
Contribution to journal
publication status
published
subject
keywords
Customer insights, User-generated content, UGC, Sentiment analysis, Topic modeling, Artificial intelligence, Machine learning, Natural language processing, NLP, Marketing, Big data
in
Journal of Retailing and Consumer Services
volume
81
article number
104034
publisher
Elsevier
external identifiers
  • scopus:85201377662
ISSN
1873-1384
DOI
10.1016/j.jretconser.2024.104034
language
English
LU publication?
yes
id
a481d938-bea9-4a96-859f-d525650cadc8
date added to LUP
2024-08-15 15:02:22
date last changed
2024-10-30 11:30:07
@article{a481d938-bea9-4a96-859f-d525650cadc8,
  abstract     = {{Uncovering customer insights (CI) is indispensable for contemporary marketing strategies. The widespread availability of user-generated content (UGC) presents a unique opportunity for firms to gain a nuanced understanding of their customers. However, the size and complexity of UGC datasets pose significant challenges for traditional market research methods, limiting their effectiveness in this context. To address this challenge, this study leverages natural language processing (NLP) and machine learning (ML) techniques to extract nuanced insights from UGC. By integrating sentiment analysis and topic modeling algorithms, we analyzed a dataset of approximately four million X posts (formerly tweets) encompassing 20 global brands across industries. The findings reveal primary brand-related emotions and identify the top 10 keywords indicative of brand-related sentiment. Using FedEx as a case study, we identify five prominent areas of customer concern: parcel tracking, small business services, the firm's comparative performance, package delivery dynamics, and customer service. Overall, this study offers a roadmap for academics to navigate the complex landscape of generating CI from UGC datasets. It thus raises pertinent practical implications, including boosting customer service, refining marketing strategies, and better understanding customer needs and preferences, thereby contributing to more effective, more responsive business strategies.}},
  author       = {{Mustak, Mekhail and Hallikainen, Heli and Laukkanen, Tommi and Plé, Loïc and Hollebeek, Linda and Aleem, Majid}},
  issn         = {{1873-1384}},
  keywords     = {{Customer insights; User-generated content; UGC; Sentiment analysis; Topic modeling; Artificial intelligence; Machine learning; Natural language processing; NLP; Marketing; Big data}},
  language     = {{eng}},
  publisher    = {{Elsevier}},
  series       = {{Journal of Retailing and Consumer Services}},
  title        = {{Using machine learning to develop customer insights from user-generated content}},
  url          = {{http://dx.doi.org/10.1016/j.jretconser.2024.104034}},
  doi          = {{10.1016/j.jretconser.2024.104034}},
  volume       = {{81}},
  year         = {{2024}},
}