Beware of the Simulated DAG! Causal Discovery Benchmarks May Be Easy To Game

Alexander Reisach; Christof Seiler; Sebastian Weichwald

Beware of the Simulated DAG! Causal Discovery Benchmarks May Be Easy To Game

Alexander Reisach^*, Christof Seiler, Sebastian Weichwald

^*Corresponding author for this work

Research output: Chapter in Book/Report/Conference proceeding › Conference article in proceeding › Academic › peer-review

Abstract

Simulated DAG models may exhibit properties that, perhaps inadvertently, render their structure identifiable and unexpectedly affect structure learning algorithms. Here, we show that marginal variance tends to increase along the causal order for generically sampled additive noise models. We introduce varsortability as a measure of the agreement between the order of increasing marginal variance and the causal order. For commonly sampled graphs and model parameters, we show that the remarkable performance of some continuous structure learning algorithms can be explained by high varsortability and matched by a simple baseline method. Yet, this performance may not transfer to real-world data where varsortability may be moderate or dependent on the choice of measurement scales. On standardized data, the same algorithms fail to identify the ground-truth DAG or its Markov equivalence class. While standardization removes the pattern in marginal variance, we show that data generating processes that incur high varsortability also leave a distinct covariance pattern that may be exploited even after standardization. Our findings challenge the significance of generic benchmarks with independently drawn parameters. The code is available at https://github.com/Scriddie/ Varsortability.

Original language	English
Title of host publication	Advances in Neural Information Processing Systems 34 pre-proceedings (NeurIPS 2021)
Editors	M. Ranzato, A. Beygelzimer, P.S. Liang, J.W. Vaughan, Y. Dauphin
Publisher	Neural Information Processing Systems Foundation
Number of pages	13
Publication status	Published - 2021
Event	35th Conference on Neural Information Processing Systems (NeurIPS) - Virtual-only Duration: 6 Dec 2021 → 14 Dec 2021 https://nips.cc/Conferences/2021

Publication series

Series	Advances in Neural Information Processing Systems
Volume	34
ISSN	1049-5258

Conference

Conference	35th Conference on Neural Information Processing Systems (NeurIPS)
Period	6/12/21 → 14/12/21
Internet address	https://nips.cc/Conferences/2021

Keywords

NETWORKS

Access to Document

https://proceedings.neurips.cc/paper/2021/file/e987eff4a7c7b7e580d659feb6f60c1a-Paper.pdf

Cite this

Reisach, A., Seiler, C., & Weichwald, S. (2021). Beware of the Simulated DAG! Causal Discovery Benchmarks May Be Easy To Game. In M. Ranzato, A. Beygelzimer, P. S. Liang, J. W. Vaughan, & Y. Dauphin (Eds.), Advances in Neural Information Processing Systems 34 pre-proceedings (NeurIPS 2021) Neural Information Processing Systems Foundation . https://proceedings.neurips.cc/paper/2021/file/e987eff4a7c7b7e580d659feb6f60c1a-Paper.pdf

Reisach, Alexander ; Seiler, Christof ; Weichwald, Sebastian. / Beware of the Simulated DAG! Causal Discovery Benchmarks May Be Easy To Game. Advances in Neural Information Processing Systems 34 pre-proceedings (NeurIPS 2021). editor / M. Ranzato ; A. Beygelzimer ; P.S. Liang ; J.W. Vaughan ; Y. Dauphin. Neural Information Processing Systems Foundation , 2021. (Advances in Neural Information Processing Systems, Vol. 34).

@inproceedings{e880018b05a84ecba89f2493e36e1ee8,

title = "Beware of the Simulated DAG! Causal Discovery Benchmarks May Be Easy To Game",

abstract = "Simulated DAG models may exhibit properties that, perhaps inadvertently, render their structure identifiable and unexpectedly affect structure learning algorithms. Here, we show that marginal variance tends to increase along the causal order for generically sampled additive noise models. We introduce varsortability as a measure of the agreement between the order of increasing marginal variance and the causal order. For commonly sampled graphs and model parameters, we show that the remarkable performance of some continuous structure learning algorithms can be explained by high varsortability and matched by a simple baseline method. Yet, this performance may not transfer to real-world data where varsortability may be moderate or dependent on the choice of measurement scales. On standardized data, the same algorithms fail to identify the ground-truth DAG or its Markov equivalence class. While standardization removes the pattern in marginal variance, we show that data generating processes that incur high varsortability also leave a distinct covariance pattern that may be exploited even after standardization. Our findings challenge the significance of generic benchmarks with independently drawn parameters. The code is available at https://github.com/Scriddie/ Varsortability.",

keywords = "NETWORKS",

author = "Alexander Reisach and Christof Seiler and Sebastian Weichwald",

year = "2021",

language = "English",

series = "Advances in Neural Information Processing Systems",

editor = "M. Ranzato and A. Beygelzimer and P.S. Liang and J.W. Vaughan and Y. Dauphin",

booktitle = "Advances in Neural Information Processing Systems 34 pre-proceedings (NeurIPS 2021)",

publisher = "Neural Information Processing Systems Foundation ",

address = "United States",

note = "35th Conference on Neural Information Processing Systems (NeurIPS) ; Conference date: 06-12-2021 Through 14-12-2021",

url = "https://nips.cc/Conferences/2021",

}

Reisach, A, Seiler, C & Weichwald, S 2021, Beware of the Simulated DAG! Causal Discovery Benchmarks May Be Easy To Game. in M Ranzato, A Beygelzimer, PS Liang, JW Vaughan & Y Dauphin (eds), Advances in Neural Information Processing Systems 34 pre-proceedings (NeurIPS 2021). Neural Information Processing Systems Foundation , Advances in Neural Information Processing Systems, vol. 34, 35th Conference on Neural Information Processing Systems (NeurIPS), 6/12/21. <https://proceedings.neurips.cc/paper/2021/file/e987eff4a7c7b7e580d659feb6f60c1a-Paper.pdf>

Beware of the Simulated DAG! Causal Discovery Benchmarks May Be Easy To Game. / Reisach, Alexander; Seiler, Christof; Weichwald, Sebastian.
Advances in Neural Information Processing Systems 34 pre-proceedings (NeurIPS 2021). ed. / M. Ranzato; A. Beygelzimer; P.S. Liang; J.W. Vaughan; Y. Dauphin. Neural Information Processing Systems Foundation , 2021. (Advances in Neural Information Processing Systems, Vol. 34).

Research output: Chapter in Book/Report/Conference proceeding › Conference article in proceeding › Academic › peer-review

TY - GEN

T1 - Beware of the Simulated DAG! Causal Discovery Benchmarks May Be Easy To Game

AU - Reisach, Alexander

AU - Seiler, Christof

AU - Weichwald, Sebastian

PY - 2021

Y1 - 2021

N2 - Simulated DAG models may exhibit properties that, perhaps inadvertently, render their structure identifiable and unexpectedly affect structure learning algorithms. Here, we show that marginal variance tends to increase along the causal order for generically sampled additive noise models. We introduce varsortability as a measure of the agreement between the order of increasing marginal variance and the causal order. For commonly sampled graphs and model parameters, we show that the remarkable performance of some continuous structure learning algorithms can be explained by high varsortability and matched by a simple baseline method. Yet, this performance may not transfer to real-world data where varsortability may be moderate or dependent on the choice of measurement scales. On standardized data, the same algorithms fail to identify the ground-truth DAG or its Markov equivalence class. While standardization removes the pattern in marginal variance, we show that data generating processes that incur high varsortability also leave a distinct covariance pattern that may be exploited even after standardization. Our findings challenge the significance of generic benchmarks with independently drawn parameters. The code is available at https://github.com/Scriddie/ Varsortability.

AB - Simulated DAG models may exhibit properties that, perhaps inadvertently, render their structure identifiable and unexpectedly affect structure learning algorithms. Here, we show that marginal variance tends to increase along the causal order for generically sampled additive noise models. We introduce varsortability as a measure of the agreement between the order of increasing marginal variance and the causal order. For commonly sampled graphs and model parameters, we show that the remarkable performance of some continuous structure learning algorithms can be explained by high varsortability and matched by a simple baseline method. Yet, this performance may not transfer to real-world data where varsortability may be moderate or dependent on the choice of measurement scales. On standardized data, the same algorithms fail to identify the ground-truth DAG or its Markov equivalence class. While standardization removes the pattern in marginal variance, we show that data generating processes that incur high varsortability also leave a distinct covariance pattern that may be exploited even after standardization. Our findings challenge the significance of generic benchmarks with independently drawn parameters. The code is available at https://github.com/Scriddie/ Varsortability.

KW - NETWORKS

M3 - Conference article in proceeding

T3 - Advances in Neural Information Processing Systems

BT - Advances in Neural Information Processing Systems 34 pre-proceedings (NeurIPS 2021)

A2 - Ranzato, M.

A2 - Beygelzimer, A.

A2 - Liang, P.S.

A2 - Vaughan, J.W.

A2 - Dauphin, Y.

PB - Neural Information Processing Systems Foundation

T2 - 35th Conference on Neural Information Processing Systems (NeurIPS)

Y2 - 6 December 2021 through 14 December 2021

ER -

Reisach A, Seiler C, Weichwald S. Beware of the Simulated DAG! Causal Discovery Benchmarks May Be Easy To Game. In Ranzato M, Beygelzimer A, Liang PS, Vaughan JW, Dauphin Y, editors, Advances in Neural Information Processing Systems 34 pre-proceedings (NeurIPS 2021). Neural Information Processing Systems Foundation . 2021. (Advances in Neural Information Processing Systems, Vol. 34).

Beware of the Simulated DAG! Causal Discovery Benchmarks May Be Easy To Game

Abstract

Publication series

Conference

Keywords

Access to Document

Fingerprint

Cite this