2017

Model-Agnostic Meta-Learning for Fast Adaptation of Deep Networks

Chelsea Finn, P. Abbeel, Sergey Levine

citations

Cite Score

90

AI summary

This paper introduces Model-Agnostic Meta-Learning (MAML), a novel meta-learning algorithm for fast adaptation of deep networks across classification, regression, and reinforcement learning tasks, achieving state-of-the-art few-shot image classification results and accelerating policy gradient reinforcement learning.

Main Contributions

  • Proposes a model- and task-agnostic meta-learning algorithm (MAML) that trains model parameters for fast adaptation to new tasks.
  • Demonstrates the algorithm's effectiveness across different model types (fully connected, convolutional networks) and domains (few-shot regression, image classification, reinforcement learning).
  • Achieves state-of-the-art performance on few-shot image classification benchmarks compared to specialized one-shot learning methods.
  • Shows that MAML accelerates reinforcement learning in the presence of task variability, outperforming direct pretraining as initialization.
  • Provides an algorithm that can be readily applied to regression and can accelerate reinforcement learning.

Abstract

We propose an algorithm for meta-learning that is model-agnostic, in the sense that it is compatible with any model trained with gradient descent and applicable to a variety of different learning problems, including classification, regression, and reinforcement learning. The goal of meta-learning is to train a model on a variety of learning tasks, such that it can solve new learning tasks using only a small number of training samples. In our approach, the parameters of the model are explicitly trained such that a small number of gradient steps with a small amount of training data from a new task will produce good generalization performance on that task. In effect, our method trains the model to be easy to fine-tune. We demonstrate that this approach leads to state-of-the-art performance on two few-shot image classification benchmarks, produces good results on few-shot regression, and accelerates fine-tuning for policy gradient reinforcement learning with neural network policies.

Citation Graph

Loading graph...

References [40]

Sort:
Filter:

D. P. Kingma, Jimmy Lei Ba - 2014

49 papers in library cite

S. Ioffe, Christian Szegedy - 2015

18 papers in library cite

Ian J. Goodfellow, J. Shlens, Christian Szegedy - 2015

4 papers in library cite

R. Williams - 1992

11 papers in library cite

M. Abadi, Akshat Agarwal, P. Barham, E. Brevdo, Ziru Chen, C. Citro, G. Corrado, A. Davis, Jeffrey Dean, M. Devin, Sanjay Ghemawat, I. Goodfellow, A. Harp, Geoffrey Irving, M. Isard, Y. Jia, R. Jozefowicz, Lukasz Kaiser, M. Kudlur, J. Levenberg, D. Mane, R. Monga, S. Moore, D. Murray, Christopher Olah, M. Schuster, J. Shlens, B. Steiner, Ilya Sutskever, K. Talwar, P. Tucker, Vincent Vanhoucke, V. Vasudevan, F. Viegas, Oriol Vinyals, P. Warden, M. Wattenberg, M. Wicke, Y. Yu, Xiaoqiang Zheng - 2015

11 papers in library cite

J. Kirkpatrick, Razvan Pascanu, N. C. Rabinowitz, J. Veness, G. Desjardins, A. A. Rusu, K. Milan, J. Quan, T. Ramalho, A. G. Barwinska, Demis Hassabis, C. Clopath, D. Kumaran, Raia Hadsell - 2017

5 papers in library cite

J. Donahue, Y. Jia, Oriol Vinyals, J. Hoffman, N. Zhang, E. Tzeng, Trevor Darrell - 2014

15 papers in library cite

Sepp Hochreiter, A. Steven Younger, Peter R. Conwell - 2001

4 papers in library cite

Surya Ganguli - 2014

9 papers in library cite

Yoshua Bengio, Samy Bengio, Jocelyn Cloutier - 1990

1 paper in library cites

John Schulman, Sergey Levine, P. Abbeel, Michael I. Jordan, P. Moritz - 2015

4 papers in library cite

Oriol Vinyals, C. Blundell, T. Lillicrap, Daan Wierstra - 2016

2 papers in library cite

G. Koch - 2015

1 paper in library cites

S. Ravi, Hugo Larochelle - 2017

2 papers in library cite

Adam Santoro, S. Bartunov, M. Botvinick, Daan Wierstra, T. Lillicrap - 2016

2 papers in library cite

T. Salimans, D. A. Kingma, D. P. Diederik - 2016

4 papers in library cite

Sebastian Thrun, L. Pratt - 1998

3 papers in library cite

D. Ha, Andrew Dai, Quoc V. Le - 2016

3 papers in library cite

Y. Duan, John Schulman, X. Chen, P. L. Bartlett, Ilya Sutskever, P. Abbeel - 2016

2 papers in library cite

K. Li, Jitendra Malik - 2017

1 paper in library cites

E. Parisotto, Jimmy Lei Ba, Ruslan Salakhutdinov - 2015

2 papers in library cite

D. J. Rezende, S. Mohamed, Ivo Danihelka, K. Gregor, Daan Wierstra - 2016

2 papers in library cite

Jürgen Schmidhuber - 1992

4 papers in library cite

M. Andrychowicz, M. Denil, S. Gomez, M. W. Hoffman, D. Pfau, T. Schaul, N. D. Freitas - 2016

3 papers in library cite

E. Todorov, T. Erez, Y. Tassa - 2012

3 papers in library cite

Y. Duan, X. Chen, R. Houthooft, R. Rein, John Schulman, P. Abbeel - 2016

2 papers in library cite

Pranav Shyam, S. Gupta, A. Dukkipati - 2017

1 paper in library cites

P. Krahenbuhl, C. Doersch, J. Donahue, Trevor Darrell - 2016

1 paper in library cites

M. Husken, C. Goerick - 2000

1 paper in library cites

D. Maclaurin, David Duvenaud, R. A. Adams - 2015

1 paper in library cites

J. X. Wang, Z. K. Nelson, D. Tirumala, H. Soyer, J. Z. Leibo, Rémi Munos, C. Blundell, D. Kumaran, M. Botvinick - 2016

1 paper in library cites

Lukasz Kaiser, O. Nachum, A. Roy, Samy Bengio - 2017

1 paper in library cites

T. Munkhdalai, H. Yu - 2017

1 paper in library cites

D. K. Naik, R. J. Mammone - 1992

1 paper in library cites

Samy Bengio, Samy Bengio, Y. Yoshua, C. Jocelyn, J. Gecsei - 1992

1 paper in library cites

B. M. Lake, Ruslan Salakhutdinov, J. Gross, Joshua B. Tenenbaum - 2011

1 paper in library cites

M. Rei - 2015

1 paper in library cites

J. Snell, K. Swersky, Richard S. Zemel - 2017

1 paper in library cites

Harri Edwards, A. Storkey - 2017

1 paper in library cites

Cited by

4

papers in your library

Cites

22

papers in your library

Read

on November 5, 2025

Your review

Tags

Paper Aliases

No aliases