Interpretable cost-sensitive regression through one-step boosting

Thomas Decorte; Jakob Raymaekers; Tim Verdonck

doi:10.1016/j.dss.2023.114024

Interpretable cost-sensitive regression through one-step boosting

Thomas Decorte, Jakob Raymaekers, Tim Verdonck^*

^*Corresponding author for this work

Research output: Contribution to journal › Article › Academic › peer-review

3 Downloads (Pure)

Abstract

In most practical prediction problems, such as regression and classification, the different types of prediction errors are not equally costly in the decision-making process. Although there exist numerous real-world cost-sensitive regression problems, ranging from loan charge-off forecasting to house price predictions, the literature on cost-sensitive learning mainly focuses on classification and only a few solutions are proposed for regression problems. These regressions are typically characterized by an asymmetric cost structure, where over- and underpredictions of a similar magnitude face vastly different costs. In this paper, we present a one-step boosting method (OSB) for cost-sensitive regression. The proposed methodology leverages a secondary learner to incorporate cost-sensitivity into an already trained cost-insensitive regression model. The secondary learner is defined as a linear function of certain variables deemed interesting for cost-sensitivity. These variables do not necessarily need to be the same as in the already trained model. An efficient optimization algorithm is achieved through iteratively reweighted least squares using the asymmetric cost function. The obtained results become interpretable through bootstrapping, enabling decision makers to distinguish important variables for cost-sensitivity as well as facilitating statistical inference. Applying different cost functions and various initial cost-insensitive learning methods on several public datasets consistently yields a significant reduction in the average misprediction cost, illustrating the excellent performance of our approach.

Original language	English
Article number	114024
Number of pages	13
Journal	Decision Support Systems
Volume	175
Early online date	10 Jun 2023
DOIs	https://doi.org/10.1016/j.dss.2023.114024
Publication status	Published - Dec 2023

Keywords

Asymmetric costs
Boosting
Cost-sensitive regression
Data mining
Interpretability

Access to Document

10.1016/j.dss.2023.114024

Full TextFinal published version, 974 KBLicence: Taverne

Cite this

@article{c489feafc6c54af89b218b990d6f8677,

title = "Interpretable cost-sensitive regression through one-step boosting",

abstract = "In most practical prediction problems, such as regression and classification, the different types of prediction errors are not equally costly in the decision-making process. Although there exist numerous real-world cost-sensitive regression problems, ranging from loan charge-off forecasting to house price predictions, the literature on cost-sensitive learning mainly focuses on classification and only a few solutions are proposed for regression problems. These regressions are typically characterized by an asymmetric cost structure, where over- and underpredictions of a similar magnitude face vastly different costs. In this paper, we present a one-step boosting method (OSB) for cost-sensitive regression. The proposed methodology leverages a secondary learner to incorporate cost-sensitivity into an already trained cost-insensitive regression model. The secondary learner is defined as a linear function of certain variables deemed interesting for cost-sensitivity. These variables do not necessarily need to be the same as in the already trained model. An efficient optimization algorithm is achieved through iteratively reweighted least squares using the asymmetric cost function. The obtained results become interpretable through bootstrapping, enabling decision makers to distinguish important variables for cost-sensitivity as well as facilitating statistical inference. Applying different cost functions and various initial cost-insensitive learning methods on several public datasets consistently yields a significant reduction in the average misprediction cost, illustrating the excellent performance of our approach.",

keywords = "Asymmetric costs, Boosting, Cost-sensitive regression, Data mining, Interpretability",

author = "Thomas Decorte and Jakob Raymaekers and Tim Verdonck",

note = "data source:",

year = "2023",

month = dec,

doi = "10.1016/j.dss.2023.114024",

language = "English",

volume = "175",

journal = "Decision Support Systems",

issn = "0167-9236",

publisher = "Elsevier",

}

TY - JOUR

T1 - Interpretable cost-sensitive regression through one-step boosting

AU - Decorte, Thomas

AU - Raymaekers, Jakob

AU - Verdonck, Tim

N1 - data source:

PY - 2023/12

Y1 - 2023/12

N2 - In most practical prediction problems, such as regression and classification, the different types of prediction errors are not equally costly in the decision-making process. Although there exist numerous real-world cost-sensitive regression problems, ranging from loan charge-off forecasting to house price predictions, the literature on cost-sensitive learning mainly focuses on classification and only a few solutions are proposed for regression problems. These regressions are typically characterized by an asymmetric cost structure, where over- and underpredictions of a similar magnitude face vastly different costs. In this paper, we present a one-step boosting method (OSB) for cost-sensitive regression. The proposed methodology leverages a secondary learner to incorporate cost-sensitivity into an already trained cost-insensitive regression model. The secondary learner is defined as a linear function of certain variables deemed interesting for cost-sensitivity. These variables do not necessarily need to be the same as in the already trained model. An efficient optimization algorithm is achieved through iteratively reweighted least squares using the asymmetric cost function. The obtained results become interpretable through bootstrapping, enabling decision makers to distinguish important variables for cost-sensitivity as well as facilitating statistical inference. Applying different cost functions and various initial cost-insensitive learning methods on several public datasets consistently yields a significant reduction in the average misprediction cost, illustrating the excellent performance of our approach.

AB - In most practical prediction problems, such as regression and classification, the different types of prediction errors are not equally costly in the decision-making process. Although there exist numerous real-world cost-sensitive regression problems, ranging from loan charge-off forecasting to house price predictions, the literature on cost-sensitive learning mainly focuses on classification and only a few solutions are proposed for regression problems. These regressions are typically characterized by an asymmetric cost structure, where over- and underpredictions of a similar magnitude face vastly different costs. In this paper, we present a one-step boosting method (OSB) for cost-sensitive regression. The proposed methodology leverages a secondary learner to incorporate cost-sensitivity into an already trained cost-insensitive regression model. The secondary learner is defined as a linear function of certain variables deemed interesting for cost-sensitivity. These variables do not necessarily need to be the same as in the already trained model. An efficient optimization algorithm is achieved through iteratively reweighted least squares using the asymmetric cost function. The obtained results become interpretable through bootstrapping, enabling decision makers to distinguish important variables for cost-sensitivity as well as facilitating statistical inference. Applying different cost functions and various initial cost-insensitive learning methods on several public datasets consistently yields a significant reduction in the average misprediction cost, illustrating the excellent performance of our approach.

KW - Asymmetric costs

KW - Boosting

KW - Cost-sensitive regression

KW - Data mining

KW - Interpretability

U2 - 10.1016/j.dss.2023.114024

DO - 10.1016/j.dss.2023.114024

M3 - Article

SN - 0167-9236

VL - 175

JO - Decision Support Systems

JF - Decision Support Systems

M1 - 114024

ER -