Information Extraction for Social Media

M. B. Habib; M. van Keulen

doi:10.3115/V1/W14-6202

Information Extraction for Social Media

M. B. Habib, M. van Keulen

Research output: Chapter in Book/Report/Conference proceeding › Conference article in proceeding › Academic › peer-review

54 Downloads (Pure)

Abstract

The rapid growth in IT in the last two decades has led to a growth in the amount of information available online. A new style for sharing information is social media. Social media is a continuously instantly updated source of information. In this position paper, we propose a framework for Information Extraction (IE) from unstructured user generated contents on social media. The framework proposes solutions to overcome the IE challenges in this domain such as the short context, the noisy sparse contents and the uncertain contents. To overcome the challenges facing IE from social media, State-Of-The-Art approaches need to be adapted to suit the nature of social media posts. The key components and aspects of our proposed framework are noisy text filtering, named entity extraction, named entity disambiguation, feedback loops, and uncertainty handling.

Original language	English
Title of host publication	Proceedings of the Third Workshop on Semantic Web and Information Extraction (SWAIE 2014), Dublin, Ireland
Place of Publication	Dublin
Publisher	Association for Computational Linguistics
Pages	9-16
Number of pages	8
Volume	W14-62
DOIs	https://doi.org/10.3115/V1/W14-6202
Publication status	Published - 1 Aug 2014
Externally published	Yes

Keywords

Information Extraction
Social Media
Twitter

Access to Document

10.3115/V1/W14-6202Licence: CC BY

paper (1)Final published version, 194 KB

Cite this

@inproceedings{d741dfefedb24d35b64ccdb57eff74d1,

title = "Information Extraction for Social Media",

abstract = "The rapid growth in IT in the last two decades has led to a growth in the amount of information available online. A new style for sharing information is social media. Social media is a continuously instantly updated source of information. In this position paper, we propose a framework for Information Extraction (IE) from unstructured user generated contents on social media. The framework proposes solutions to overcome the IE challenges in this domain such as the short context, the noisy sparse contents and the uncertain contents. To overcome the challenges facing IE from social media, State-Of-The-Art approaches need to be adapted to suit the nature of social media posts. The key components and aspects of our proposed framework are noisy text filtering, named entity extraction, named entity disambiguation, feedback loops, and uncertainty handling.",

keywords = "Information Extraction, Social Media, Twitter",

author = "Habib, {M. B.} and Keulen, {M. van}",

note = "http://eprints.eemcs.utwente.nl/24959/",

year = "2014",

month = aug,

day = "1",

doi = "10.3115/V1/W14-6202",

language = "English",

volume = "W14-62",

pages = "9--16",

booktitle = "Proceedings of the Third Workshop on Semantic Web and Information Extraction (SWAIE 2014), Dublin, Ireland",

publisher = "Association for Computational Linguistics",

}

Information Extraction for Social Media. / Habib, M. B.; Keulen, M. van.
Proceedings of the Third Workshop on Semantic Web and Information Extraction (SWAIE 2014), Dublin, Ireland. Vol. W14-62 Dublin: Association for Computational Linguistics, 2014. p. 9-16.

Research output: Chapter in Book/Report/Conference proceeding › Conference article in proceeding › Academic › peer-review

TY - GEN

T1 - Information Extraction for Social Media

AU - Habib, M. B.

AU - Keulen, M. van

N1 - http://eprints.eemcs.utwente.nl/24959/

PY - 2014/8/1

Y1 - 2014/8/1

N2 - The rapid growth in IT in the last two decades has led to a growth in the amount of information available online. A new style for sharing information is social media. Social media is a continuously instantly updated source of information. In this position paper, we propose a framework for Information Extraction (IE) from unstructured user generated contents on social media. The framework proposes solutions to overcome the IE challenges in this domain such as the short context, the noisy sparse contents and the uncertain contents. To overcome the challenges facing IE from social media, State-Of-The-Art approaches need to be adapted to suit the nature of social media posts. The key components and aspects of our proposed framework are noisy text filtering, named entity extraction, named entity disambiguation, feedback loops, and uncertainty handling.

AB - The rapid growth in IT in the last two decades has led to a growth in the amount of information available online. A new style for sharing information is social media. Social media is a continuously instantly updated source of information. In this position paper, we propose a framework for Information Extraction (IE) from unstructured user generated contents on social media. The framework proposes solutions to overcome the IE challenges in this domain such as the short context, the noisy sparse contents and the uncertain contents. To overcome the challenges facing IE from social media, State-Of-The-Art approaches need to be adapted to suit the nature of social media posts. The key components and aspects of our proposed framework are noisy text filtering, named entity extraction, named entity disambiguation, feedback loops, and uncertainty handling.

KW - Information Extraction

KW - Social Media

KW - Twitter

U2 - 10.3115/V1/W14-6202

DO - 10.3115/V1/W14-6202

M3 - Conference article in proceeding

VL - W14-62

SP - 9

EP - 16

BT - Proceedings of the Third Workshop on Semantic Web and Information Extraction (SWAIE 2014), Dublin, Ireland

PB - Association for Computational Linguistics

CY - Dublin

ER -