Semantic Correlation Graph Embedding

W.W. Wang*, Y.C. Han, S. Bromuri, M. Dumontier

*Corresponding author for this work

Research output: Chapter in Book/Report/Conference proceedingConference article in proceedingAcademicpeer-review

Abstract

Many data sets include categorical features in the form of nominal and ordinal features. However, most machine learning algorithms cannot deal with categorical features directly because they require numerical input features. Categorical embeddings are an effective approach to converting categorical features into numerical vectors. This work proposes a novel embedding approach, called Semantic Correlation Graph Embedding, to create embeddings from knowledge graphs. The approach constructs a semantic correlation graph of triplets among the categorical features to learn numerical embeddings. Our approach aims to uncover relationships taking place in categorical data in terms of low-level knowledge and semantics that may help group the features of the data sets under semantic entities. Three distinct embedding models are proposed according to how the graph is constructed. The results are evaluated with two public data sets. They show that the learned embeddings produce a statistically significant improvement in the performance of the classification tasks in terms of AUC, F1 score, precision, and recall.
Original languageEnglish
Title of host publication2022 IEEE INTERNATIONAL CONFERENCE ON FUZZY SYSTEMS (FUZZ-IEEE)
PublisherIEEE
Number of pages10
ISBN (Print)9781665467100
DOIs
Publication statusPublished - 2022
EventIEEE International Conference on Fuzzy Systems - Padua, Italy
Duration: 18 Jul 202223 Jul 2022

Publication series

SeriesIEEE International Fuzzy Systems Conference Proceedings
ISSN1544-5615

Conference

ConferenceIEEE International Conference on Fuzzy Systems
Abbreviated titleFUZZ-IEEE 2022
Country/TerritoryItaly
CityPadua
Period18/07/2223/07/22

Keywords

  • Categorical data
  • Logistic regression
  • Knowledge graph
  • Graph embedding
  • TransE

Fingerprint

Dive into the research topics of 'Semantic Correlation Graph Embedding'. Together they form a unique fingerprint.

Cite this