Target-Based Sentiment Analysis as a Sequence-Tagging Task

Zoe Gerolemou; Jan Scholtes

Target-Based Sentiment Analysis as a Sequence-Tagging Task

Advanced Computing Sciences

Research output: Contribution to conference › Paper › Academic

Abstract

By focusing on the online-reviews domain, this study aims to provide a complete solution to the sentiment-analysis task consisting off its three constituent components: opinion holder, polarity of the underlying sentiment and target. For the purposes of this research, several challenges and issues related to the nature of the problem are addressed such as class imbalance and the need for meaningful linguistic data-augmentation techniques to increase the size of the training set and make the use of Long Short-Term Memory models (LSTMs) possible. For both of them, new effective approaches are proposed and evaluated. As a means of quantifying class imbalance, the Minority-to-Majority Ratio (M2MR) is introduced. The two sub tasks of target and polarity detection are tackled using machine-learning means. To support the training process, a new data set, which combined sentences from two different review-based corpora, was constructed. In our research, the best-performing LSTM-based models make use of the context-sensitive BERT embeddings and yield F1-Scores of 0.9263 and 0.8911 over all possible classes for the polarity and target components respectively.

Original language	English
Number of pages	15
Publication status	Published - 1 Nov 2019
Event	BNAIC 2019 - VU, Brussels, Belgium Duration: 7 Nov 2019 → 8 Nov 2019

Conference

Conference	BNAIC 2019
Country/Territory	Belgium
City	Brussels
Period	7/11/19 → 8/11/19

Cite this

@conference{e31cb6f0b72347ba86596cdc3b04ab8c,

title = "Target-Based Sentiment Analysis as a Sequence-Tagging Task",

abstract = "By focusing on the online-reviews domain, this study aims to provide a complete solution to the sentiment-analysis task consisting off its three constituent components: opinion holder, polarity of the underlying sentiment and target. For the purposes of this research, several challenges and issues related to the nature of the problem are addressed such as class imbalance and the need for meaningful linguistic data-augmentation techniques to increase the size of the training set and make the use of Long Short-Term Memory models (LSTMs) possible. For both of them, new effective approaches are proposed and evaluated. As a means of quantifying class imbalance, the Minority-to-Majority Ratio (M2MR) is introduced. The two sub tasks of target and polarity detection are tackled using machine-learning means. To support the training process, a new data set, which combined sentences from two different review-based corpora, was constructed. In our research, the best-performing LSTM-based models make use of the context-sensitive BERT embeddings and yield F1-Scores of 0.9263 and 0.8911 over all possible classes for the polarity and target components respectively.",

author = "Zoe Gerolemou and Jan Scholtes",

note = "Funding Information: ★ Supported by ZyLAB Technologies B.V., Amsterdam, the Netherlands Publisher Copyright: {\textcopyright} 2019 for this paper by its authors.; BNAIC 2019 ; Conference date: 07-11-2019 Through 08-11-2019",

year = "2019",

month = nov,

day = "1",

language = "English",

}

TY - CONF

T1 - Target-Based Sentiment Analysis as a Sequence-Tagging Task

AU - Gerolemou, Zoe

AU - Scholtes, Jan

PY - 2019/11/1

Y1 - 2019/11/1

N2 - By focusing on the online-reviews domain, this study aims to provide a complete solution to the sentiment-analysis task consisting off its three constituent components: opinion holder, polarity of the underlying sentiment and target. For the purposes of this research, several challenges and issues related to the nature of the problem are addressed such as class imbalance and the need for meaningful linguistic data-augmentation techniques to increase the size of the training set and make the use of Long Short-Term Memory models (LSTMs) possible. For both of them, new effective approaches are proposed and evaluated. As a means of quantifying class imbalance, the Minority-to-Majority Ratio (M2MR) is introduced. The two sub tasks of target and polarity detection are tackled using machine-learning means. To support the training process, a new data set, which combined sentences from two different review-based corpora, was constructed. In our research, the best-performing LSTM-based models make use of the context-sensitive BERT embeddings and yield F1-Scores of 0.9263 and 0.8911 over all possible classes for the polarity and target components respectively.

AB - By focusing on the online-reviews domain, this study aims to provide a complete solution to the sentiment-analysis task consisting off its three constituent components: opinion holder, polarity of the underlying sentiment and target. For the purposes of this research, several challenges and issues related to the nature of the problem are addressed such as class imbalance and the need for meaningful linguistic data-augmentation techniques to increase the size of the training set and make the use of Long Short-Term Memory models (LSTMs) possible. For both of them, new effective approaches are proposed and evaluated. As a means of quantifying class imbalance, the Minority-to-Majority Ratio (M2MR) is introduced. The two sub tasks of target and polarity detection are tackled using machine-learning means. To support the training process, a new data set, which combined sentences from two different review-based corpora, was constructed. In our research, the best-performing LSTM-based models make use of the context-sensitive BERT embeddings and yield F1-Scores of 0.9263 and 0.8911 over all possible classes for the polarity and target components respectively.

M3 - Paper

T2 - BNAIC 2019

Y2 - 7 November 2019 through 8 November 2019

ER -