Largest Source Subset Selection for Instance Transfer

Research output: Chapter in Book/Report/Conference proceedingConference article in proceedingAcademicpeer-review

Abstract

Instance-transfer learning has emerged as a promising learning framework to boost performance of prediction models on newly-arrived tasks. The success of the framework depends on the relevance of the source data to the target data. This paper proposes a new approach to source data selection for instance-transfer learning. The approach is capable of selecting the largest subset S of the source data which relevance to the target data is statistically guaranteed to be the highest among any superset of S. The approach is formally described and theoretically justified. Experimental results on real-world data sets demonstrate that the approach outperforms existing instance selection methods.

Original languageEnglish
Title of host publicationProceedings of The 7th Asian Conference on Machine Learning
Place of PublicationHong Kong, China
Pages423-438
Number of pages16
Publication statusPublished - 2015

Fingerprint

Dive into the research topics of 'Largest Source Subset Selection for Instance Transfer'. Together they form a unique fingerprint.

Cite this