Abstract
Understanding why a classifier makes a certain prediction is crucial in high-stakes applications. It is also one of the central problems studied in the field of Explainable AI. To accurately explain predictions of a classifier, it is essential to take information about relationships between features into account. Many approaches, however, ignore this information. We address this problem in the context of symbolically encoded boolean classifiers. Darwiche and Hirth proposed the notion of sufficient reason (also called PI explanation or abductive explanation) to explain predictions of such classifiers. We show that sufficient reasons may be inaccurate and overly verbose, as they ignore information about relationships between features. We propose to represent this information using preferential models, which we use to encode hard as well as soft constraints between features. Preferential models define non-monotonic consequence relations that encode statements such as “birds typically fly” and “penguins typically don’t fly”. We introduce a number of ways to define reasons in the presence of background knowledge about the feature space, and we analyse these notions by means of general principles that characterise their behaviour.
Original language | English |
---|---|
Title of host publication | Artificial Intelligence and Machine Learning - 35th Benelux Conference, BNAIC/Benelearn 2023, Revised Selected Papers |
Editors | Frans A. Oliehoek, Manon Kok, Sicco Verwer |
Publisher | Springer |
Pages | 174-188 |
Number of pages | 15 |
Volume | 2187 CCIS |
ISBN (Print) | 9783031746499 |
DOIs | |
Publication status | Published - 2025 |
Event | 35th Benelux Conference on Artificial Intelligence and Machine Learning, BNAIC/Benelearn 2023 - TU Delft, Delft, Netherlands Duration: 8 Nov 2023 → 10 Nov 2023 https://bnaic2023.tudelft.nl |
Publication series
Series | Communications in Computer and Information Science |
---|---|
Volume | 2187 CCIS |
ISSN | 1865-0929 |
Conference
Conference | 35th Benelux Conference on Artificial Intelligence and Machine Learning, BNAIC/Benelearn 2023 |
---|---|
Country/Territory | Netherlands |
City | Delft |
Period | 8/11/23 → 10/11/23 |
Internet address |
Keywords
- Explainable AI
- Machine Learning
- Non-monotonic Reasoning