Explaining Boolean Classifiers with Non-monotonic Background Theories

Tjitze Rienstra*

*Corresponding author for this work

Research output: Chapter in Book/Report/Conference proceedingConference article in proceedingAcademicpeer-review

7 Downloads (Pure)

Abstract

Understanding why a classifier makes a certain prediction is crucial in high-stakes applications. It is also one of the central problems studied in the field of Explainable AI. To accurately explain predictions of a classifier, it is essential to take information about relationships between features into account. Many approaches, however, ignore this information. We address this problem in the context of symbolically encoded boolean classifiers. Darwiche and Hirth proposed the notion of sufficient reason (also called PI explanation or abductive explanation) to explain predictions of such classifiers. We show that sufficient reasons may be inaccurate and overly verbose, as they ignore information about relationships between features. We propose to represent this information using preferential models, which we use to encode hard as well as soft constraints between features. Preferential models define non-monotonic consequence relations that encode statements such as “birds typically fly” and “penguins typically don’t fly”. We introduce a number of ways to define reasons in the presence of background knowledge about the feature space, and we analyse these notions by means of general principles that characterise their behaviour.
Original languageEnglish
Title of host publicationArtificial Intelligence and Machine Learning - 35th Benelux Conference, BNAIC/Benelearn 2023, Revised Selected Papers
EditorsFrans A. Oliehoek, Manon Kok, Sicco Verwer
PublisherSpringer
Pages174-188
Number of pages15
Volume2187 CCIS
ISBN (Print)9783031746499
DOIs
Publication statusPublished - 2025
Event35th Benelux Conference on Artificial Intelligence and Machine Learning, BNAIC/Benelearn 2023 - TU Delft, Delft, Netherlands
Duration: 8 Nov 202310 Nov 2023
https://bnaic2023.tudelft.nl

Publication series

SeriesCommunications in Computer and Information Science
Volume2187 CCIS
ISSN1865-0929

Conference

Conference35th Benelux Conference on Artificial Intelligence and Machine Learning, BNAIC/Benelearn 2023
Country/TerritoryNetherlands
CityDelft
Period8/11/2310/11/23
Internet address

Keywords

  • Explainable AI
  • Machine Learning
  • Non-monotonic Reasoning

Fingerprint

Dive into the research topics of 'Explaining Boolean Classifiers with Non-monotonic Background Theories'. Together they form a unique fingerprint.

Cite this