Symmetry Principles in Optimization Problems: an application to Protein Stability Prediction

Katrien Bernaerts; F. Pucci; M. Rooman; D. Gillis; Dimitri Gilis

doi:10.1016/j.ifacol.2015.05.068

Symmetry Principles in Optimization Problems: an application to Protein Stability Prediction

Katrien Bernaerts^*, F. Pucci, M. Rooman, D. Gillis, Dimitri Gilis

^*Corresponding author for this work

Research output: Contribution to journal › Conference article in journal › Academic › peer-review

Abstract

In this paper, we show how the adequate use of the intrinsic symmetry of a system when setting up its model structure can avoid unwanted biases in the parameter optimization phase. The playground of our analysis is the prediction of protein thermodynamic stability changes upon single amino acid substitutions (point mutations). Using a simple artificial neural network (ANN), sixteen different energy-like contributions are combined to predict the change in folding free energy (Delta Delta G). We show that the presence of terms violating the symmetry under inverse mutations induces a bias towards the dataset on which the ANN is trained, even if a strict n-fold cross-validation procedure is performed. A completely symmetric free energy functional is then introduced, which gives predictions that are slightly less efficient in terms of root mean square error with respect to the experimental Delta Delta G's, but appear to be basically independent of the training dataset and are thus more satisfactory. (C) 2015, IFAC (International Federation of Automatic Control) Hosting by Elsevier Ltd. All rights reserved.

Original language	English
Pages (from-to)	458-463
Number of pages	6
Journal	IFAC-PapersOnLine
Volume	48
Issue number	1
DOIs	https://doi.org/10.1016/j.ifacol.2015.05.068
Publication status	Published - 2015
Event	8th Vienna International Conference on Mathematical Modelling - Vienna, Austria Duration: 18 Feb 2015 → 20 Feb 2015

Access to Document

10.1016/j.ifacol.2015.05.068

Cite this

@article{6b474fd9f8784c9fbef793cd2457e6b2,

title = "Symmetry Principles in Optimization Problems: an application to Protein Stability Prediction",

abstract = "In this paper, we show how the adequate use of the intrinsic symmetry of a system when setting up its model structure can avoid unwanted biases in the parameter optimization phase. The playground of our analysis is the prediction of protein thermodynamic stability changes upon single amino acid substitutions (point mutations). Using a simple artificial neural network (ANN), sixteen different energy-like contributions are combined to predict the change in folding free energy (Delta Delta G). We show that the presence of terms violating the symmetry under inverse mutations induces a bias towards the dataset on which the ANN is trained, even if a strict n-fold cross-validation procedure is performed. A completely symmetric free energy functional is then introduced, which gives predictions that are slightly less efficient in terms of root mean square error with respect to the experimental Delta Delta G's, but appear to be basically independent of the training dataset and are thus more satisfactory. (C) 2015, IFAC (International Federation of Automatic Control) Hosting by Elsevier Ltd. All rights reserved.",

author = "Katrien Bernaerts and F. Pucci and M. Rooman and D. Gillis and Dimitri Gilis",

year = "2015",

doi = "10.1016/j.ifacol.2015.05.068",

language = "English",

volume = "48",

pages = "458--463",

journal = "IFAC-PapersOnLine",

issn = "2405-8963",

publisher = "IFAC Secretariat",

number = "1",

note = "8th Vienna International Conference on Mathematical Modelling, MATHMOD 2015 ; Conference date: 18-02-2015 Through 20-02-2015",

}

TY - JOUR

T1 - Symmetry Principles in Optimization Problems

T2 - 8th Vienna International Conference on Mathematical Modelling

AU - Bernaerts, Katrien

AU - Pucci, F.

AU - Rooman, M.

AU - Gillis, D.

AU - Gilis, Dimitri

PY - 2015

Y1 - 2015

N2 - In this paper, we show how the adequate use of the intrinsic symmetry of a system when setting up its model structure can avoid unwanted biases in the parameter optimization phase. The playground of our analysis is the prediction of protein thermodynamic stability changes upon single amino acid substitutions (point mutations). Using a simple artificial neural network (ANN), sixteen different energy-like contributions are combined to predict the change in folding free energy (Delta Delta G). We show that the presence of terms violating the symmetry under inverse mutations induces a bias towards the dataset on which the ANN is trained, even if a strict n-fold cross-validation procedure is performed. A completely symmetric free energy functional is then introduced, which gives predictions that are slightly less efficient in terms of root mean square error with respect to the experimental Delta Delta G's, but appear to be basically independent of the training dataset and are thus more satisfactory. (C) 2015, IFAC (International Federation of Automatic Control) Hosting by Elsevier Ltd. All rights reserved.

AB - In this paper, we show how the adequate use of the intrinsic symmetry of a system when setting up its model structure can avoid unwanted biases in the parameter optimization phase. The playground of our analysis is the prediction of protein thermodynamic stability changes upon single amino acid substitutions (point mutations). Using a simple artificial neural network (ANN), sixteen different energy-like contributions are combined to predict the change in folding free energy (Delta Delta G). We show that the presence of terms violating the symmetry under inverse mutations induces a bias towards the dataset on which the ANN is trained, even if a strict n-fold cross-validation procedure is performed. A completely symmetric free energy functional is then introduced, which gives predictions that are slightly less efficient in terms of root mean square error with respect to the experimental Delta Delta G's, but appear to be basically independent of the training dataset and are thus more satisfactory. (C) 2015, IFAC (International Federation of Automatic Control) Hosting by Elsevier Ltd. All rights reserved.

U2 - 10.1016/j.ifacol.2015.05.068

DO - 10.1016/j.ifacol.2015.05.068

M3 - Conference article in journal

SN - 2405-8963

VL - 48

SP - 458

EP - 463

JO - IFAC-PapersOnLine

JF - IFAC-PapersOnLine

IS - 1

Y2 - 18 February 2015 through 20 February 2015

ER -