PredNet and Predictive Coding: A Critical Review

Roshan Rane; Edit Szügyi; Vageesh Saxena; André Ofner; Sebastian Stober

doi:10.1145/3372278.3390694

PredNet and Predictive Coding: A Critical Review

Roshan Rane, Edit Szügyi, Vageesh Saxena, André Ofner, Sebastian Stober

Research output: Contribution to conference › Paper › Academic

Abstract

PredNet, a deep predictive coding network developed by Lotter et al., combines a biologically inspired architecture based on the propagation of prediction error with self-supervised representation learning in video. While the architecture has drawn a lot of attention and various extensions of the model exist, there is a lack of a critical analysis. We fill in the gap by evaluating PredNet both as an implementation of the predictive coding theory and as a self-supervised video prediction model using a challenging video action classification dataset. We design an extended model to test if conditioning future frame predictions on the action class of the video improves the model performance. We show that PredNet does not yet completely follow the principles of predictive coding. The proposed top-down conditioning leads to a performance gain on synthetic data, but does not scale up to the more complex real-world action classification dataset. Our analysis is aimed at guiding future research on similar architectures based on the predictive coding theory.

Original language	English
Pages	233-241
DOIs	https://doi.org/10.1145/3372278.3390694
Publication status	Published - 8 Jun 2020

Access to Document

10.1145/3372278.3390694Licence: CC BY

https://arxiv.org/abs/1906.11902

Cite this

@conference{185c5c22e28c43ec90f4ab3943c17625,

title = "PredNet and Predictive Coding: A Critical Review",

abstract = "PredNet, a deep predictive coding network developed by Lotter et al., combines a biologically inspired architecture based on the propagation of prediction error with self-supervised representation learning in video. While the architecture has drawn a lot of attention and various extensions of the model exist, there is a lack of a critical analysis. We fill in the gap by evaluating PredNet both as an implementation of the predictive coding theory and as a self-supervised video prediction model using a challenging video action classification dataset. We design an extended model to test if conditioning future frame predictions on the action class of the video improves the model performance. We show that PredNet does not yet completely follow the principles of predictive coding. The proposed top-down conditioning leads to a performance gain on synthetic data, but does not scale up to the more complex real-world action classification dataset. Our analysis is aimed at guiding future research on similar architectures based on the predictive coding theory.",

author = "Roshan Rane and Edit Sz{\"u}gyi and Vageesh Saxena and Andr{\'e} Ofner and Sebastian Stober",

note = "Publisher Copyright: {\textcopyright} 2020 ACM.",

year = "2020",

month = jun,

day = "8",

doi = "10.1145/3372278.3390694",

language = "English",

pages = "233--241",

}

TY - CONF

T1 - PredNet and Predictive Coding: A Critical Review

AU - Rane, Roshan

AU - Szügyi, Edit

AU - Saxena, Vageesh

AU - Ofner, André

AU - Stober, Sebastian

PY - 2020/6/8

Y1 - 2020/6/8

N2 - PredNet, a deep predictive coding network developed by Lotter et al., combines a biologically inspired architecture based on the propagation of prediction error with self-supervised representation learning in video. While the architecture has drawn a lot of attention and various extensions of the model exist, there is a lack of a critical analysis. We fill in the gap by evaluating PredNet both as an implementation of the predictive coding theory and as a self-supervised video prediction model using a challenging video action classification dataset. We design an extended model to test if conditioning future frame predictions on the action class of the video improves the model performance. We show that PredNet does not yet completely follow the principles of predictive coding. The proposed top-down conditioning leads to a performance gain on synthetic data, but does not scale up to the more complex real-world action classification dataset. Our analysis is aimed at guiding future research on similar architectures based on the predictive coding theory.

AB - PredNet, a deep predictive coding network developed by Lotter et al., combines a biologically inspired architecture based on the propagation of prediction error with self-supervised representation learning in video. While the architecture has drawn a lot of attention and various extensions of the model exist, there is a lack of a critical analysis. We fill in the gap by evaluating PredNet both as an implementation of the predictive coding theory and as a self-supervised video prediction model using a challenging video action classification dataset. We design an extended model to test if conditioning future frame predictions on the action class of the video improves the model performance. We show that PredNet does not yet completely follow the principles of predictive coding. The proposed top-down conditioning leads to a performance gain on synthetic data, but does not scale up to the more complex real-world action classification dataset. Our analysis is aimed at guiding future research on similar architectures based on the predictive coding theory.

U2 - 10.1145/3372278.3390694

DO - 10.1145/3372278.3390694

M3 - Paper

SP - 233

EP - 241

ER -