2015

Pointer Networks

Oriol Vinyals, M. Fortunato, Navdeep Jaitly

citations

Cite Score

72

AI summary

This paper introduces Pointer Networks (Ptr-Nets) that use attention mechanisms as pointers to solve combinatorial optimization problems with variable-sized output dictionaries. Ptr-Nets are applied to planar convex hulls, Delaunay triangulations, and the Travelling Salesman Problem, improving over sequence-to-sequence models and generalizing to variable-sized outputs.

Main Contributions

  • Introduces the Pointer Net architecture, which uses attention as a pointer to handle variable length dictionaries.
  • Applies Pointer Nets to three geometric problems: convex hulls, Delaunay triangulations, and the Travelling Salesman Problem (TSP).
  • Demonstrates that Pointer Nets improve over sequence-to-sequence models with input attention.
  • Shows that Pointer Nets can generalize to variable size output dictionaries, even beyond the maximum lengths they were trained on.
  • Achieves competitive performance on small-scale TSP problems, demonstrating the potential of data-driven approaches for computationally intractable problems.

Abstract

We introduce a new neural architecture to learn the conditional probability of an output sequence with elements that are discrete tokens corresponding to positions in an input sequence. Such problems cannot be trivially addressed by existent approaches such as sequence-to-sequence [1] and Neural Turing Machines [2], because the number of target classes in each step of the output depends on the length of the input, which is variable. Problems such as sorting variable sized sequences, and various combinatorial optimization problems belong to this class. Our model solves the problem of variable size output dictionaries using a recently proposed mechanism of neural attention. It differs from the previous attention attempts in that, instead of using attention to blend hidden units of an encoder to a context vector at each decoder step, it uses attention as a pointer to select a member of the input sequence as the output. We call this architecture a Pointer Net (Ptr-Net). We show Ptr-Nets can be used to learn approximate solutions to three challenging geometric problems – finding planar convex hulls, computing Delaunay triangulations, and the planar Travelling Salesman Problem – using training examples alone. Ptr-Nets not only improve over sequence-to-sequence with input attention, but also allow us to generalize to variable size output dictionaries. We show that the learnt models generalize beyond the maximum lengths they were trained on. We hope our results on these tasks will encourage a broader exploration of neural learning for discrete problems.

Citation Graph

Loading graph...

References [20]

Sort:
Filter:

Sepp Hochreiter, Jürgen Schmidhuber - 1997

94 papers in library cite

D. Bahdanau, Kyunghyun Cho, Yoshua Bengio - 2014

59 papers in library cite

D. E. Rumelhart, Geoffrey E. Hinton, Ronald J. Williams - 1986

46 papers in library cite

Ilya Sutskever, Oriol Vinyals, Quoc V. Le - 2014

58 papers in library cite

Dumitru Erhan - 2015

11 papers in library cite

Alex Graves, G. Wayne, Ivo Danihelka - 2014

18 papers in library cite

Jason Weston, S. Chopra, Antoine Bordes - 2015

18 papers in library cite

Geoffrey Hinton - 2015

9 papers in library cite

A. Robinson - 1994

9 papers in library cite

J. Donahue, L. Hendricks, S. Guadarrama, M. Rohrbach, S. Venugopalan, K. Saenko, Trevor Darrell - 2014

4 papers in library cite

N. Srivastava, E. Mansimov, Ruslan Salakhutdinov - 2015

3 papers in library cite

Wojciech Zaremba, Ilya Sutskever - 2014

8 papers in library cite

R. L. Graham - 1972

1 paper in library cites

Missing author list

2015

1 paper in library cites

F. P. Preparata, S. J. Hong - 1977

1 paper in library cites

R. Bellman - 1962

1 paper in library cites

R. A. Jarvis - 1973

1 paper in library cites

Missing author list

2015

1 paper in library cites

Missing author list

2015

1 paper in library cites

Cited by

10

papers in your library

Cites

12

papers in your library

Read

on October 30, 2025

Your review

Tags

Paper Aliases

No aliases