Concept Basis Extraction for Latent Space Interpretation of Image Classifiers

Research output: Contribution to journalConference article in journalAcademicpeer-review

Abstract

Previous research has shown that, to a large-extend, deep feature representations of image-patches that belong to the same semantic concept, lie in the same direction of an image classifier’s feature space. Conventional approaches compute these directions using annotated data, forming an interpretable feature space basis (also referred as concept basis). Unsupervised Interpretable Basis Extraction (UIBE) was recently proposed as a novel method that can suggest an interpretable basis without annotations. In this work, we show that the addition of a classification loss term to the unsupervised basis search, can lead to bases suggestions that align even more with interpretable concepts. This loss term enforces the basis vectors to point towards directions that maximally influence the classifier’s predictions, exploiting concept knowledge encoded by the network. We evaluate our work by deriving a concept basis for three popular convolutional networks, trained on three different datasets. Experiments show that our contributions enhance the interpretability of the learned bases, according to the interpretability metrics, by up-to +45.8% relative improvement. As additional practical contribution, we report hyper-parameters, found by hyper-parameter search in controlled benchmarks, that can serve as a starting point for applications of the proposed method in real-world scenarios that lack annotations.
Original languageEnglish
Pages (from-to)417-424
Number of pages8
JournalVISIGRAPP. Proceedings
Volume3
DOIs
Publication statusPublished - 2024
Event19th International Joint Conference on Computer Vision, Imaging and Computer Graphics Theory and Applications, VISIGRAPP 2024 - Rome, Italy
Duration: 27 Feb 202429 Feb 2024
Conference number: 19
https://portal.insticc.org/SubmissionDeadlines/63e42b885652b110e22e62c9

Keywords

  • Computer Vision
  • Concept Basis
  • Deep Learning
  • Explainable AI
  • Interpretability
  • Interpretable Basis
  • Unsupervised Learning

Fingerprint

Dive into the research topics of 'Concept Basis Extraction for Latent Space Interpretation of Image Classifiers'. Together they form a unique fingerprint.

Cite this