2016

Long Short-Term Memory-Networks for Machine Reading

Mirella Lapata

citations

Cite Score

51

AI summary

This paper introduces the Long Short-Term Memory-Network (LSTMN), a machine reading simulator that extends the LSTM architecture with a memory network and attention mechanism. It achieves comparable or better performance than state-of-the-art models on language modeling, sentiment analysis, and natural language inference tasks.

Main Contributions

  • Introduces the Long Short-Term Memory-Network (LSTMN) for machine reading.
  • Replaces the single memory cell of an LSTM with a memory network, enabling adaptive memory usage and induction of relations among tokens.
  • Demonstrates how to integrate the LSTMN with an encoder-decoder architecture.
  • Achieves performance comparable or better to state-of-the-art models on language modeling, sentiment analysis, and natural language inference.
  • Introduces an intra-attention mechanism that induces undirected relations among tokens and optimizes the entire network in downstream tasks.

Abstract

In this paper we address the question of how to render sequence-level networks better at handling structured input. We propose a machine reading simulator which processes text incrementally from left to right and performs shallow reasoning with memory and attention. The reader extends the Long Short-Term Memory architecture with a memory network in place of a single memory cell. This enables adaptive memory usage during recurrence with neural attention, offering a way to weakly induce relations among tokens. The system is initially designed to process a single sequence but we also demonstrate how to integrate it with an encoder-decoder architecture. Experiments on language modeling, sentiment analysis, and natural language inference show that our model matches or outperforms the state of the art.

Citation Graph

Loading graph...

References [50]

Sort:
Filter:

D. P. Kingma, Jimmy Lei Ba - 2014

49 papers in library cite

Sepp Hochreiter, Jürgen Schmidhuber - 1997

94 papers in library cite

Jeffrey Pennington, Richard Socher, Christopher D. Manning - 2014

31 papers in library cite

D. Bahdanau, Kyunghyun Cho, Yoshua Bengio - 2014

59 papers in library cite

Kyunghyun Cho, B. V. Merrienboer, C. G. Gulcehre, D. Bahdanau, F. Bougares, Holger Schwenk, Yoshua Bengio - 2014

38 papers in library cite

Yoon Kim - 2014

8 papers in library cite

Yoshua Bengio, Patrice Simard, Paolo Frasconi - 1994

31 papers in library cite

Quoc Le, Tomas Mikolov - 2014

13 papers in library cite

Richard Socher, A. Perelygin, Jeffrey Wu, J. Chuang, C. Manning, A. Ng, Christopher Potts - 2013

24 papers in library cite

Razvan Pascanu, Tomas Mikolov, Yoshua Bengio - 2013

21 papers in library cite

Tomas Mikolov, M. Karafiat, Lukas Burget, Jan Cernocky, Sanjeev Khudanpur - 2010

36 papers in library cite

Alex Graves - 2013

27 papers in library cite

Samuel R. Bowman, G. Angeli, Christopher Potts, Christopher D. Manning - 2015

25 papers in library cite

Phil Blunsom, Edward Grefenstette, N. Kalchbrenner - 2014

7 papers in library cite

K. M. Hermann, T. Kocisky, Edward Grefenstette, L. Espeholt, W. Kay, M. Suleyman, Phil Blunsom - 2015

31 papers in library cite

Alexander M. Rush, S. Chopra, Jason Weston - 2015

13 papers in library cite

S. Sukhbaatar, A. Szlam, Jason Weston, Rob Fergus - 2015

18 papers in library cite

Ido Dagan, O. Glickman, Bernardo Magnini - 2005

19 papers in library cite

Jason Weston, S. Chopra, Antoine Bordes - 2015

18 papers in library cite

Richard Socher, Eric H. Huang, J. Pennin, C. Manning, A. Ng - 2011

10 papers in library cite

Tim Rocktaschel, Edward Grefenstette, K. Hermann, T. Kocisky, Phil Blunsom - 2016

5 papers in library cite

K. S. Tai, Richard Socher, Christopher D. Manning - 2015

6 papers in library cite

A. Kumar, O. Irsoy, P. Ondruska, M. Iyyer, J. Bradbury, I. Gulrajani, Victor Zhong, R. Paulus, Richard Socher - 2015

9 papers in library cite

J. Chung, C. G. Gulcehre, Kyunghyun Cho, Yoshua Bengio - 2015

3 papers in library cite

J. Koutnik, K. Greff, Faustino Gomez, Jürgen Schmidhuber - 2014

4 papers in library cite

Wojciech Zaremba, Ilya Sutskever - 2014

8 papers in library cite

Shijie Wang, J. J. Jiang - 2016

3 papers in library cite

Tomas Mikolov, Armand Joulin, S. Chopra, M. Mathieu, Marc'aurelio Ranzato - 2015

8 papers in library cite

Sepp Hochreiter - 1991

18 papers in library cite

S. Bowman, J. Gauthier, Abhinav Rastogi, R. Gupta, C. Manning, Christopher Potts - 2016

5 papers in library cite

Caiming Xiong, S. Merity, Richard Socher - 2016

5 papers in library cite

Edward Grefenstette, K. Hermann, M. Suleyman, Phil Blunsom - 2015

5 papers in library cite

C. Dyer, M. Ballesteros, W. Ling, A. Matthews, N. Smith - 2015

2 papers in library cite

Fanqing Meng, Z. L. Lu, Zhuowen Tu, H. Li, Qian Liu - 2015

1 paper in library cites

Peter Clark, P. Harrison, N. Balasubramanian - 2013

1 paper in library cites

Dan Klein, C. Manning - 2004

1 paper in library cites

O. Irsoy, C. Cardie - 2014

1 paper in library cites

K. Yao, T. Cohn, K. Vylomova, K. Duh, C. Dyer - 2015

1 paper in library cites

K. Rayner - 1998

1 paper in library cites

A. Fader, S. Soderland, Oren Etzioni - 2011

1 paper in library cites

S. Frank, R. Bod - 2011

1 paper in library cites

M. Tanenhaus, M. S. Knowlton, K. Eberhard, J. Sedivy - 1995

1 paper in library cites

Jacob Andreas, M. Rohrbach, Trevor Darrell, Dan Klein - 2016

1 paper in library cites

L. Konieczny - 2000

1 paper in library cites

T. Lei, R. Barzilay, T. Jaakkola - 2015

1 paper in library cites

Oren Etzioni, A. Fader, J. Christensen, S. Soderland, Mausam - 2011

1 paper in library cites

F. Ferreira, J. Henderson - 1991

1 paper in library cites

K. Tran, A. Bisazza, C. Monz - 2016

1 paper in library cites

H. Poon, P. Domingos - 2010

1 paper in library cites

Cited by

8

papers in your library

Cites

28

papers in your library

Read

on October 17, 2025

Your review

Tags

Paper Aliases

No aliases