2018
Cite Score
10
AI summary
This paper introduces Iterated Amplification, a training strategy that builds training signals for complex problems by combining solutions to simpler subproblems, showing efficient learning of complex behaviors in algorithmic environments without external reward functions.
Main Contributions
Abstract
Many real world learning tasks involve complex or hard-to-specify objectives, and using an easier-to-specify proxy can lead to poor performance or misaligned behavior. One solution is to have humans provide a training signal by demonstrating or judging performance, but this approach fails if the task is too complicated for a human to directly evaluate. We propose Iterated Amplification, an alternative training strategy which progressively builds up a training signal for difficult problems by combining solutions to easier subproblems. Iterated Amplification is closely related to Expert Iteration (Anthony et al., 2017; Silver et al., 2017b), except that it uses no external reward function. We present results in algorithmic environments, showing that Iterated Amplification can efficiently learn complex behaviors.
Citation Graph
References [23]
Ashish Vaswani, Noam Shazeer, Niki Parmar, Jakob Uszkoreit, Llion Jones, Aidan N. Gomez, Lukasz Kaiser, Illia Polosukhin - 2017
47 papers in library cite
Paul Christiano, Jan Leike, Tom B. Brown, Miljan Martic, Shane Legg, Dario Amodei - 2017
11 papers in library cite
Oriol Vinyals, M. Fortunato, Navdeep Jaitly - 2015
10 papers in library cite
Lukasz Kaiser, Ilya Sutskever - 2016
5 papers in library cite
Andrew Y. Ng, S. Russell - 2000
3 papers in library cite
Dario Amodei, Christopher Olah, Jacob Steinhardt, Paul Christiano, John Schulman, D. Mane - 2016
6 papers in library cite
Geoffrey Irving, Paul Christiano, Dario Amodei - 2018
8 papers in library cite
D. Silver, J. Schrittwieser, K. Simonyan, I. Antonoglou, A. Huang, A. Guez, T. Hubert, L. Baker, M. Lai, A. Bolton, Yanru Chen, T. Lillicrap, F. Hui, L. Sifre, G. V. D. Driessche, T. Graepel, Demis Hassabis - 2017
7 papers in library cite
Alex Graves, G. Wayne, M. Reynolds, T. Harley, Ivo Danihelka, A. G. Barwinska, S. G. Colmenarejo, Edward Grefenstette, T. Ramalho, J. Agapiou, A. P. Badia, K. M. Hermann, Y. Zwols, G. Ostrovski, A. Cain, H. King, C. Summerfield, Phil Blunsom, Koray Kavukcuoglu, Demis Hassabis - 2016
5 papers in library cite
D. Silver, T. Hubert, J. Schrittwieser, I. Antonoglou, M. Lai, A. Guez, M. Lanctot, L. Sifre, D. Kumaran, T. Graepel, T. Lillicrap, K. Simonyan, Demis Hassabis - 2017
5 papers in library cite
N. Bostrom - 2014
5 papers in library cite
T. Anthony, Z. Tian, D. Barber - 2017
4 papers in library cite
D. H. Menell, S. Russell, P. Abbeel, A. D. Dragan - 2016
3 papers in library cite
Chelsea Finn, Sergey Levine, P. Abbeel - 2016
3 papers in library cite
J. Macglashan, M. K. Ho, R. Loftin, B. Peng, D. Roberts, M. E. Taylor, M. L. Littman - 2017
3 papers in library cite
W. B. Knox, P. Stone - 2009
3 papers in library cite
J. Cai, R. Shin, Dawn Song - 2017
3 papers in library cite
J. Lehman, J. Clune, D. Misevic, C. Adami, J. Beaulieu, P. J. Bentley, S. Bernard, G. Belson, D. M. Bryson, N. Cheney, A. Cully, S. Doncieux, F. C. Dyer, K. O. Ellefsen, R. Feldt, S. Fischer, S. Forrest, A. Frenoy, C. Gagne, L. L. Goff, L. M. Grabowski, B. Hodjat, Frank Hutter, L. Keller, C. Knibbe, P. Krcah, R. E. Lenski, Hod Lipson, R. Maccurdy, C. Maestre, R. Miikkulainen, S. Mitri, D. E. Moriarty, J. B. Mouret, Anh Nguyen, C. Ofria, M. Parizeau, D. Parsons, R. T. Pennock, W. F. Punch, T. S. Ray, M. Schoenauer, E. Shulte, K. Sims, K. O. Stanley, F. Taddei, D. Tarapore, S. Thibault, W. Weimer, R. Watson, Jason Yosinski - 2018
3 papers in library cite
P. Abbeel, A. Ng - 2004
2 papers in library cite
Jack Clark, Dario Amodei - 2016
2 papers in library cite
P. M. Pilarski, M. R. Dawson, T. Degris, F. Fahimi, J. P. Carey, Richard S. Sutton - 2011
2 papers in library cite
A. Nowak, J. Bruna - 2016
1 paper in library cites
Arvind Neelakantan, Quoc V. Le, Ilya Sutskever - 2015
1 paper in library cites
Cited by
7
papers in your library
Cites
7
papers in your library
Read
on May 27, 2026
Your review
Tags
Paper Aliases
No aliases