A Data Driven Similarity Measure and Example Mapping Function for General , Unlabelled Data Sets

Damien Lejeune; Kurt Driessens

doi:10.3233/978-1-61499-672-9-158

A Data Driven Similarity Measure and Example Mapping Function for General , Unlabelled Data Sets

^*Corresponding author for this work

Robots, Agents, Interaction

Research output: Chapter in Book/Report/Conference proceeding › Conference article in proceeding › Academic › peer-review

Abstract

Deep networks such as autoencoders and deep belief nets are able to construct alternative, and often informative, repre-sentations of unlabeled data by searching for (hidden) structure and correlations between the features chosen to represent the data and combining them into new features that allow sparse representations of the data. These representations have been chosen to often increase the accuracy of further classification or regression accuracy when compared to the original, often human chosen representations. In this work, we attempt an investigation of the relation between such discovered representations found using related but differently repre-sented sets of examples. To this end, we combine the cross-domain comparison capabilities of unsupervised manifold alignment with the unsupervised feature construction of deep belief nets, resulting in an example mapping function that allows re-encoding examples from any source to any target task. Using the t-Distributed Stochastic Neighbour Embedding technique to map translated and real exam-ples to a lower dimensional space, we employ KL-divergence to de-fine a dissimilarity measure between data sets enabling us to measure found representation similarities between domains.

Original language	English
Title of host publication	Proceedings of the 22nd European Conference on Artificial Intelligence (ECAI'16)
Pages	158-166
Number of pages	9
DOIs	https://doi.org/10.3233/978-1-61499-672-9-158
Publication status	Published - 2016

Publication series

Series	Proceedings of the 22nd European Conference on Artificial Intelligence (ECAI'16)

Access to Document

10.3233/978-1-61499-672-9-158Licence: CC BY

Cite this

@inproceedings{a209a8a30aa64757872d07856d687f75,

title = "A Data Driven Similarity Measure and Example Mapping Function for General , Unlabelled Data Sets",

abstract = "Deep networks such as autoencoders and deep belief nets are able to construct alternative, and often informative, repre-sentations of unlabeled data by searching for (hidden) structure and correlations between the features chosen to represent the data and combining them into new features that allow sparse representations of the data. These representations have been chosen to often increase the accuracy of further classification or regression accuracy when compared to the original, often human chosen representations. In this work, we attempt an investigation of the relation between such discovered representations found using related but differently repre-sented sets of examples. To this end, we combine the cross-domain comparison capabilities of unsupervised manifold alignment with the unsupervised feature construction of deep belief nets, resulting in an example mapping function that allows re-encoding examples from any source to any target task. Using the t-Distributed Stochastic Neighbour Embedding technique to map translated and real exam-ples to a lower dimensional space, we employ KL-divergence to de-fine a dissimilarity measure between data sets enabling us to measure found representation similarities between domains.",

author = "Damien Lejeune and Kurt Driessens",

year = "2016",

doi = "10.3233/978-1-61499-672-9-158",

language = "English",

isbn = "9781614996729",

series = "Proceedings of the 22nd European Conference on Artificial Intelligence (ECAI'16)",

pages = "158--166",

booktitle = "Proceedings of the 22nd European Conference on Artificial Intelligence (ECAI'16)",

}

A Data Driven Similarity Measure and Example Mapping Function for General , Unlabelled Data Sets. / Lejeune, Damien; Driessens, Kurt.
Proceedings of the 22nd European Conference on Artificial Intelligence (ECAI'16). 2016. p. 158-166 (Proceedings of the 22nd European Conference on Artificial Intelligence (ECAI'16)).

Research output: Chapter in Book/Report/Conference proceeding › Conference article in proceeding › Academic › peer-review

TY - GEN

T1 - A Data Driven Similarity Measure and Example Mapping Function for General , Unlabelled Data Sets

AU - Lejeune, Damien

AU - Driessens, Kurt

PY - 2016

Y1 - 2016

N2 - Deep networks such as autoencoders and deep belief nets are able to construct alternative, and often informative, repre-sentations of unlabeled data by searching for (hidden) structure and correlations between the features chosen to represent the data and combining them into new features that allow sparse representations of the data. These representations have been chosen to often increase the accuracy of further classification or regression accuracy when compared to the original, often human chosen representations. In this work, we attempt an investigation of the relation between such discovered representations found using related but differently repre-sented sets of examples. To this end, we combine the cross-domain comparison capabilities of unsupervised manifold alignment with the unsupervised feature construction of deep belief nets, resulting in an example mapping function that allows re-encoding examples from any source to any target task. Using the t-Distributed Stochastic Neighbour Embedding technique to map translated and real exam-ples to a lower dimensional space, we employ KL-divergence to de-fine a dissimilarity measure between data sets enabling us to measure found representation similarities between domains.

AB - Deep networks such as autoencoders and deep belief nets are able to construct alternative, and often informative, repre-sentations of unlabeled data by searching for (hidden) structure and correlations between the features chosen to represent the data and combining them into new features that allow sparse representations of the data. These representations have been chosen to often increase the accuracy of further classification or regression accuracy when compared to the original, often human chosen representations. In this work, we attempt an investigation of the relation between such discovered representations found using related but differently repre-sented sets of examples. To this end, we combine the cross-domain comparison capabilities of unsupervised manifold alignment with the unsupervised feature construction of deep belief nets, resulting in an example mapping function that allows re-encoding examples from any source to any target task. Using the t-Distributed Stochastic Neighbour Embedding technique to map translated and real exam-ples to a lower dimensional space, we employ KL-divergence to de-fine a dissimilarity measure between data sets enabling us to measure found representation similarities between domains.

U2 - 10.3233/978-1-61499-672-9-158

DO - 10.3233/978-1-61499-672-9-158

M3 - Conference article in proceeding

SN - 9781614996729

T3 - Proceedings of the 22nd European Conference on Artificial Intelligence (ECAI'16)

SP - 158

EP - 166

BT - Proceedings of the 22nd European Conference on Artificial Intelligence (ECAI'16)

ER -