Advanced

Active Learning and Visual Analytics for Stance Classification with ALVA

Kucher, Kostiantyn; Paradis, Carita LU ; Sahlgren, Magnus and Kerren, Andreas (2017) In ACM Transactions on Interactive Intelligent Systems (TiiS)
Abstract
The automatic detection and classification of stance (e.g., certainty or agreement) in text data using natural language processing and machine learning methods create an opportunity to gain insight into the speakers' attitudes towards their own and other people's utterances. However, identifying stance in text presents many challenges related to training data collection and classifier training. In order to facilitate the entire process of training a stance classifier, we propose a visual analytics approach, called ALVA, for text data annotation and visualization. ALVA's interplay with the stance classifier follows an active learning strategy in order to select suitable candidate utterances for manual annotation. Our approach supports... (More)
The automatic detection and classification of stance (e.g., certainty or agreement) in text data using natural language processing and machine learning methods create an opportunity to gain insight into the speakers' attitudes towards their own and other people's utterances. However, identifying stance in text presents many challenges related to training data collection and classifier training. In order to facilitate the entire process of training a stance classifier, we propose a visual analytics approach, called ALVA, for text data annotation and visualization. ALVA's interplay with the stance classifier follows an active learning strategy in order to select suitable candidate utterances for manual annotation. Our approach supports annotation process management and provides the annotators with a clean user interface for labeling utterances with multiple stance categories. ALVA also contains a visualization method to help analysts of the annotation and training process gain a better understanding of the categories used by the annotators. The visualization uses a novel visual representation, called CatCombos, which groups individual annotation items by the combination of stance categories. Additionally, our system makes a visualization of a vector space model available that is itself based on utterances. ALVA is already being used by our domain experts in linguistics and computational linguistics in order to improve the understanding of stance phenomena and to build a stance classifier for applications such as social media monitoring. (Less)
Please use this url to cite or link to this publication:
author
organization
publishing date
type
Contribution to journal
publication status
in press
subject
keywords
visualization,, stance visualization, , active learning, text visualization, sentiment visualization, annotation, visual analytics, sentiment analysis, stance analysis, NLP, text analytics
in
ACM Transactions on Interactive Intelligent Systems (TiiS)
publisher
ACM
ISSN
2160-6455
language
English
LU publication?
yes
id
d8370ae3-e09e-4f69-8389-82aeca79b462
date added to LUP
2017-08-08 16:43:51
date last changed
2017-08-30 16:03:45
@article{d8370ae3-e09e-4f69-8389-82aeca79b462,
  abstract     = {The automatic detection and classification of stance (e.g., certainty or agreement) in text data using natural language processing and machine learning methods create an opportunity to gain insight into the speakers' attitudes towards their own and other people's utterances. However, identifying stance in text presents many challenges related to training data collection and classifier training. In order to facilitate the entire process of training a stance classifier, we propose a visual analytics approach, called ALVA, for text data annotation and visualization. ALVA's interplay with the stance classifier follows an active learning strategy in order to select suitable candidate utterances for manual annotation. Our approach supports annotation process management and provides the annotators with a clean user interface for labeling utterances with multiple stance categories. ALVA also contains a visualization method to help analysts of the annotation and training process gain a better understanding of the categories used by the annotators. The visualization uses a novel visual representation, called CatCombos, which groups individual annotation items by the combination of stance categories. Additionally, our system makes a visualization of a vector space model available that is itself based on utterances. ALVA is already being used by our domain experts in linguistics and computational linguistics in order to improve the understanding of stance phenomena and to build a stance classifier for applications such as social media monitoring.},
  author       = {Kucher, Kostiantyn and Paradis, Carita and Sahlgren, Magnus and Kerren, Andreas},
  issn         = {2160-6455},
  keyword      = {visualization,,stance visualization, ,active learning,text visualization,sentiment visualization,annotation,visual analytics,sentiment analysis,stance analysis, NLP,text analytics},
  language     = {eng},
  publisher    = {ACM},
  series       = {ACM Transactions on Interactive Intelligent Systems (TiiS)},
  title        = {Active Learning and Visual Analytics for Stance Classification with ALVA},
  year         = {2017},
}