Cite Score
72
AI summary
This paper introduces Pointer Networks (Ptr-Nets) that use attention mechanisms as pointers to solve combinatorial optimization problems with variable-sized output dictionaries. Ptr-Nets are applied to planar convex hulls, Delaunay triangulations, and the Travelling Salesman Problem, improving over sequence-to-sequence models and generalizing to variable-sized outputs.
Main Contributions
Abstract
We introduce a new neural architecture to learn the conditional probability of an output sequence with elements that are discrete tokens corresponding to positions in an input sequence. Such problems cannot be trivially addressed by existent approaches such as sequence-to-sequence [1] and Neural Turing Machines [2], because the number of target classes in each step of the output depends on the length of the input, which is variable. Problems such as sorting variable sized sequences, and various combinatorial optimization problems belong to this class. Our model solves the problem of variable size output dictionaries using a recently proposed mechanism of neural attention. It differs from the previous attention attempts in that, instead of using attention to blend hidden units of an encoder to a context vector at each decoder step, it uses attention as a pointer to select a member of the input sequence as the output. We call this architecture a Pointer Net (Ptr-Net). We show Ptr-Nets can be used to learn approximate solutions to three challenging geometric problems – finding planar convex hulls, computing Delaunay triangulations, and the planar Travelling Salesman Problem – using training examples alone. Ptr-Nets not only improve over sequence-to-sequence with input attention, but also allow us to generalize to variable size output dictionaries. We show that the learnt models generalize beyond the maximum lengths they were trained on. We hope our results on these tasks will encourage a broader exploration of neural learning for discrete problems.
Citation Graph
References [20]
Sepp Hochreiter, Jürgen Schmidhuber - 1997
94 papers in library cite
D. Bahdanau, Kyunghyun Cho, Yoshua Bengio - 2014
59 papers in library cite
D. E. Rumelhart, Geoffrey E. Hinton, Ronald J. Williams - 1986
46 papers in library cite
Ilya Sutskever, Oriol Vinyals, Quoc V. Le - 2014
58 papers in library cite
Dumitru Erhan - 2015
11 papers in library cite
Alex Graves, G. Wayne, Ivo Danihelka - 2014
18 papers in library cite
Jason Weston, S. Chopra, Antoine Bordes - 2015
18 papers in library cite
Geoffrey Hinton - 2015
9 papers in library cite
A. Robinson - 1994
9 papers in library cite
J. Donahue, L. Hendricks, S. Guadarrama, M. Rohrbach, S. Venugopalan, K. Saenko, Trevor Darrell - 2014
4 papers in library cite
N. Srivastava, E. Mansimov, Ruslan Salakhutdinov - 2015
3 papers in library cite
Wojciech Zaremba, Ilya Sutskever - 2014
8 papers in library cite
R. L. Graham - 1972
1 paper in library cites
2015
1 paper in library cites
F. P. Preparata, S. J. Hong - 1977
1 paper in library cites
R. Bellman - 1962
1 paper in library cites
S. Rebay - 1993
1 paper in library cites
R. A. Jarvis - 1973
1 paper in library cites
2015
1 paper in library cites
Cited by
10
papers in your library
Cites
12
papers in your library
Read
on October 30, 2025
Your review
Tags
Paper Aliases
No aliases