2017

Overcoming Catastrophic Forgetting in Neural Networks

J. Kirkpatrick, Razvan Pascanu, N. C. Rabinowitz, J. Veness, G. Desjardins, A. A. Rusu, K. Milan, J. Quan, T. Ramalho, A. G. Barwinska, Demis Hassabis, C. Clopath, D. Kumaran, Raia Hadsell

citations

Cite Score

85

AI summary

This paper introduces elastic weight consolidation (EWC), a novel algorithm which selectively slows down learning on important weights to overcome catastrophic forgetting. EWC achieves state-of-the-art results on MNIST classification and Atari 2600 games.

Main Contributions

  • Introduces elastic weight consolidation (EWC), a novel algorithm that mitigates catastrophic forgetting in neural networks by selectively slowing down learning on the weights.
  • EWC leverages a Bayesian perspective, tempering new learning with a prior based on the posterior distribution of parameters from previous tasks.
  • Demonstrates that EWC enables continual learning in deep neural networks without significant performance degradation on previously learned tasks.
  • Applies EWC to supervised learning tasks on permuted MNIST dataset, achieving state-of-the-art results.
  • Extends EWC to reinforcement learning scenarios, showcasing its effectiveness in the challenging Atari 2600 domain.

Abstract

The ability to learn tasks in a sequential fashion is crucial to the development of artificial intelligence. Neural networks are not, in general, capable of this and it has been widely thought that catastrophic forgetting is an inevitable feature of connectionist models. We show that it is possible to overcome this limitation and train networks that can maintain expertise on tasks which they have not experienced for a long time. Our approach remembers old tasks by selectively slowing down learning on the weights important for those tasks. We demonstrate our approach is scalable and effective by solving a set of classification tasks based on the MNIST hand written digit dataset and by learning several Atari 2600 games sequentially.

Citation Graph

Loading graph...

References [42]

Sort:
Filter:

Alex Krizhevsky, Ilya Sutskever, Geoffrey E. Hinton - 2012

71 papers in library cite

I. Goodfellow, Yoshua Bengio, Y. A. Courville, A. Aaron - 2016

5 papers in library cite

V. Mnih - 2015

9 papers in library cite

Robert M. French - 1999

2 papers in library cite

Yann Lecun - 1998

8 papers in library cite

H. V. Hasselt, A. Guez, D. Silver - 2016

2 papers in library cite

M. Mccloskey, N. J. Cohen - 1989

4 papers in library cite

C. Blundell, J. Cornebise, Koray Kavukcuoglu, Daan Wierstra - 2015

3 papers in library cite

A. A. Rusu, N. C. Rabinowitz, G. Desjardins, H. Soyer, J. Kirkpatrick, Koray Kavukcuoglu, Razvan Pascanu, Raia Hadsell - 2016

1 paper in library cites

Ian J. Goodfellow, M. Mirza, D. Xiao, Aaron Courville, Yoshua Bengio - 2014

2 papers in library cite

A. A. Rusu, S. G. Colmenarejo, C. G. Gulcehre, G. Desjardins, J. Kirkpatrick, Razvan Pascanu, V. Mnih, Koray Kavukcuoglu, Raia Hadsell - 2015

1 paper in library cites

E. Parisotto, Jimmy Lei Ba, Ruslan Salakhutdinov - 2015

2 papers in library cite

R. K. Srivastava, Jonathan Masci, S. Kazerounian, Faustino Gomez, Jürgen Schmidhuber - 2013

3 papers in library cite

M. G. Bellemare, Y. Naddaf, J. Veness, M. Bowling - 2013

5 papers in library cite

Shane Legg, M. Hutter - 2007

4 papers in library cite

Razvan Pascanu, Yoshua Bengio - 2013

3 papers in library cite

D. J. Mackay - 1992

2 papers in library cite

R. C. O'reilly, M. J. Frank - 2005

2 papers in library cite

R. H. Nielsen - 1989

2 papers in library cite

E. K. Miller, J. D. Cohen - 2001

1 paper in library cites

J. Cichon, W. B. Gan - 2015

1 paper in library cites

S. Fusi, P. J. Drew, L. Abbott - 2005

1 paper in library cites

M. B. Ring - 1998

1 paper in library cites

A. G. Collins, M. J. Frank - 2013

1 paper in library cites

M. K. Benna, S. Fusi - 2016

1 paper in library cites

J. Veness, K. S. Ng, M. Hutter, M. Bowling - 2012

1 paper in library cites

V. Mante, D. Sussillo, K. V. Shenoy, W. T. Newsome - 2013

1 paper in library cites

E. Eaton, P. L. Ruvolo - 2013

1 paper in library cites

A. H. Takagi, S. Yagishita, M. Nakamura, F. Shirai, Y. I. Wu, A. L. Loshbaugh, B. Kuhlman, K. M. Hahn, H. Kasai - 2015

1 paper in library cites

E. Eskin, A. J. Smola, S. V. N. Vishwanathan - 2004

1 paper in library cites

K. Doya, K. Samejima, K. I. Katagiri, M. Kawato - 2002

1 paper in library cites

G. Yang, C. S. W. Lai, J. Cichon, L. Ma, Wentao Li, W. B. Gan - 2014

1 paper in library cites

G. Yang, F. Pan, W. B. Gan - 2009

1 paper in library cites

J. P. Pfister, Peter Dayan, M. Lengyel - 2010

1 paper in library cites

C. Clopath, L. Ziegler, E. Vasilaki, L. Busing, W. Gerstner - 2008

1 paper in library cites

M. Kieran, J. Veness, M. Bowling, J. Kirkpatrick, A. Koop, Demis Hassabis - 2016

1 paper in library cites

D. Dowson, B. Landau - 1982

1 paper in library cites

H. J. Sussmann - 1992

1 paper in library cites

Cited by

5

papers in your library

Cites

14

papers in your library

Read

on November 2, 2025

Your review

Tags

Paper Aliases

No aliases