Distill2Vec: Dynamic Graph Representation Learning with Knowledge Distillation

Stefanos Antaris; Dimitrios Rafailidis

doi:10.1109/ASONAM49781.2020.9381315

Distill2Vec: Dynamic Graph Representation Learning with Knowledge Distillation

Stefanos Antaris^*, Dimitrios Rafailidis

^*Corresponding author for this work

Research output: Chapter in Book/Report/Conference proceeding › Conference article in proceeding › Academic › peer-review

Abstract

Dynamic graph representation learning strategies are based on different neural architectures to capture the graph evolution over time. However, the underlying neural architectures require a large amount of parameters to train and suffer from high online inference latency, that is several model parameters have to be updated when new data arrive online. In this study we propose Distill2Vec, a knowledge distillation strategy to train a compact model with a low number of trainable parameters, so as to reduce the latency of online inference and maintain the model accuracy high. We design a distillation loss function based on Kullback-Leibler divergence to transfer the acquired knowledge from a teacher model trained on offline data, to a small-size student model for online data. Our experiments with publicly available datasets show the superiority of our proposed model over several state-of-the-art approaches with relative gains up to 5% in the link prediction task. In addition, we demonstrate the effectiveness of our knowledge distillation strategy, in terms of number of required parameters, where Distill2Vec achieves a compression ratio up to 7:100 when compared with baseline approaches. For reproduction purposes, our implementation is publicly available at https://stefanosantaris.github.io/Distill2Vec.

Original language	English
Title of host publication	2020 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining (ASONAM)
Editors	M Atzmuller, M Coscia, R Missaoui
Publisher	IEEE Xplore
Pages	60-64
Number of pages	5
ISBN (Print)	978-1-7281-1057-8
DOIs	https://doi.org/10.1109/ASONAM49781.2020.9381315
Publication status	Published - 10 Dec 2020
Event	2020 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining (ASONAM) - The Hague, Netherlands Duration: 7 Dec 2020 → 10 Dec 2020

Conference

Conference	2020 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining (ASONAM)
Period	7/12/20 → 10/12/20

Keywords

Analytical models
Data models
Predictive models
Social networking (online)
Task analysis
model compression
Dynamic graph representation learning
knowledge distillation

Access to Document

10.1109/ASONAM49781.2020.9381315

Cite this

@inproceedings{afb6df435a6f4b99819de8cb9a93665d,

title = "Distill2Vec: Dynamic Graph Representation Learning with Knowledge Distillation",

abstract = "Dynamic graph representation learning strategies are based on different neural architectures to capture the graph evolution over time. However, the underlying neural architectures require a large amount of parameters to train and suffer from high online inference latency, that is several model parameters have to be updated when new data arrive online. In this study we propose Distill2Vec, a knowledge distillation strategy to train a compact model with a low number of trainable parameters, so as to reduce the latency of online inference and maintain the model accuracy high. We design a distillation loss function based on Kullback-Leibler divergence to transfer the acquired knowledge from a teacher model trained on offline data, to a small-size student model for online data. Our experiments with publicly available datasets show the superiority of our proposed model over several state-of-the-art approaches with relative gains up to 5% in the link prediction task. In addition, we demonstrate the effectiveness of our knowledge distillation strategy, in terms of number of required parameters, where Distill2Vec achieves a compression ratio up to 7:100 when compared with baseline approaches. For reproduction purposes, our implementation is publicly available at https://stefanosantaris.github.io/Distill2Vec.",

keywords = "Analytical models, Data models, Predictive models, Social networking (online), Task analysis, model compression, Dynamic graph representation learning, knowledge distillation",

author = "Stefanos Antaris and Dimitrios Rafailidis",

note = "Publisher Copyright: {\textcopyright} 2020 IEEE.; 2020 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining (ASONAM) ; Conference date: 07-12-2020 Through 10-12-2020",

year = "2020",

month = dec,

day = "10",

doi = "10.1109/ASONAM49781.2020.9381315",

language = "English",

isbn = "978-1-7281-1057-8",

pages = "60--64",

editor = "M Atzmuller and M Coscia and R Missaoui",

booktitle = "2020 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining (ASONAM)",

publisher = "IEEE Xplore",

}

Antaris, S & Rafailidis, D 2020, Distill2Vec: Dynamic Graph Representation Learning with Knowledge Distillation. in M Atzmuller, M Coscia & R Missaoui (eds), 2020 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining (ASONAM)., 9381315, IEEE Xplore, pp. 60-64, 2020 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining (ASONAM), 7/12/20. https://doi.org/10.1109/ASONAM49781.2020.9381315

Distill2Vec: Dynamic Graph Representation Learning with Knowledge Distillation. / Antaris, Stefanos; Rafailidis, Dimitrios.
2020 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining (ASONAM). ed. / M Atzmuller; M Coscia; R Missaoui. IEEE Xplore, 2020. p. 60-64 9381315.

Research output: Chapter in Book/Report/Conference proceeding › Conference article in proceeding › Academic › peer-review

TY - GEN

T1 - Distill2Vec: Dynamic Graph Representation Learning with Knowledge Distillation

AU - Antaris, Stefanos

AU - Rafailidis, Dimitrios

PY - 2020/12/10

Y1 - 2020/12/10

N2 - Dynamic graph representation learning strategies are based on different neural architectures to capture the graph evolution over time. However, the underlying neural architectures require a large amount of parameters to train and suffer from high online inference latency, that is several model parameters have to be updated when new data arrive online. In this study we propose Distill2Vec, a knowledge distillation strategy to train a compact model with a low number of trainable parameters, so as to reduce the latency of online inference and maintain the model accuracy high. We design a distillation loss function based on Kullback-Leibler divergence to transfer the acquired knowledge from a teacher model trained on offline data, to a small-size student model for online data. Our experiments with publicly available datasets show the superiority of our proposed model over several state-of-the-art approaches with relative gains up to 5% in the link prediction task. In addition, we demonstrate the effectiveness of our knowledge distillation strategy, in terms of number of required parameters, where Distill2Vec achieves a compression ratio up to 7:100 when compared with baseline approaches. For reproduction purposes, our implementation is publicly available at https://stefanosantaris.github.io/Distill2Vec.

AB - Dynamic graph representation learning strategies are based on different neural architectures to capture the graph evolution over time. However, the underlying neural architectures require a large amount of parameters to train and suffer from high online inference latency, that is several model parameters have to be updated when new data arrive online. In this study we propose Distill2Vec, a knowledge distillation strategy to train a compact model with a low number of trainable parameters, so as to reduce the latency of online inference and maintain the model accuracy high. We design a distillation loss function based on Kullback-Leibler divergence to transfer the acquired knowledge from a teacher model trained on offline data, to a small-size student model for online data. Our experiments with publicly available datasets show the superiority of our proposed model over several state-of-the-art approaches with relative gains up to 5% in the link prediction task. In addition, we demonstrate the effectiveness of our knowledge distillation strategy, in terms of number of required parameters, where Distill2Vec achieves a compression ratio up to 7:100 when compared with baseline approaches. For reproduction purposes, our implementation is publicly available at https://stefanosantaris.github.io/Distill2Vec.

KW - Analytical models

KW - Data models

KW - Predictive models

KW - Social networking (online)

KW - Task analysis

KW - model compression

KW - Dynamic graph representation learning

KW - knowledge distillation

U2 - 10.1109/ASONAM49781.2020.9381315

DO - 10.1109/ASONAM49781.2020.9381315

M3 - Conference article in proceeding

SN - 978-1-7281-1057-8

SP - 60

EP - 64

BT - 2020 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining (ASONAM)

A2 - Atzmuller, M

A2 - Coscia, M

A2 - Missaoui, R

PB - IEEE Xplore

T2 - 2020 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining (ASONAM)

Y2 - 7 December 2020 through 10 December 2020

ER -