2016
Cite Score
34
AI summary
This paper introduces a neural model using LSTMs and a word-by-word neural attention mechanism for recognizing textual entailment, achieving state-of-the-art accuracy of 83.5% on the SNLI dataset, demonstrating improved reasoning capabilities through qualitative analysis of attention weights.
Main Contributions
Abstract
While most approaches to automatically recognizing entailment relations have used classifiers employing hand engineered features derived from complex natural language processing pipelines, in practice their performance has been only slightly better than bag-of-word pair classifiers using only lexical similarity. The only attempt so far to build an end-to-end differentiable neural network for entailment failed to outperform such a simple similarity classifier. In this paper, we propose a neural model that reads two sentences to determine entailment using long short-term memory units. We extend this model with a word-by-word neural attention mechanism that encourages reasoning over entailments of pairs of words and phrases. Furthermore, we present a qualitative analysis of attention weights produced by this model, demonstrating such reasoning capabilities. On a large entailment dataset this model outperforms the previous best neural model and a classifier with engineered features by a substantial margin. It is the first generic end-to-end differentiable system that achieves state-of-the-art accuracy on a textual entailment dataset.
Citation Graph
References [30]
D. P. Kingma, Jimmy Lei Ba - 2014
49 papers in library cite
Sepp Hochreiter, Jürgen Schmidhuber - 1997
94 papers in library cite
Tomas Mikolov, Ilya Sutskever, K. Chen, G. S. Corrado, Jeffrey Dean - 2013
32 papers in library cite
D. Bahdanau, Kyunghyun Cho, Yoshua Bengio - 2014
59 papers in library cite
Ilya Sutskever, Oriol Vinyals, Quoc V. Le - 2014
58 papers in library cite
K. Xu, Jimmy Lei Ba, R. Kiros, Kyunghyun Cho, Aaron Courville, Ruslan Salakhutdinov, R. Zemel, Yoshua Bengio - 2015
12 papers in library cite
Alex Graves - 2013
27 papers in library cite
Samuel R. Bowman, G. Angeli, Christopher Potts, Christopher D. Manning - 2015
25 papers in library cite
V. Mnih, N. Heess, Alex Graves - 2014
5 papers in library cite
K. M. Hermann, T. Kocisky, Edward Grefenstette, L. Espeholt, W. Kay, M. Suleyman, Phil Blunsom - 2015
31 papers in library cite
Oriol Vinyals, M. Fortunato, Navdeep Jaitly - 2015
10 papers in library cite
Wojciech Zaremba, Ilya Sutskever, Oriol Vinyals - 2014
22 papers in library cite
Alexander M. Rush, S. Chopra, Jason Weston - 2015
13 papers in library cite
Alex Graves, G. Wayne, Ivo Danihelka - 2014
18 papers in library cite
S. Sukhbaatar, A. Szlam, Jason Weston, Rob Fergus - 2015
18 papers in library cite
Ido Dagan, O. Glickman, Bernardo Magnini - 2005
19 papers in library cite
Richard Socher, Eric H. Huang, J. Pennin, C. Manning, A. Ng - 2011
10 papers in library cite
Geoffrey Hinton - 2015
9 papers in library cite
Armand Joulin, Tomas Mikolov - 2015
9 papers in library cite
Alex Graves, Jürgen Schmidhuber - 2005
14 papers in library cite
B. Hu, Z. L. Lu, H. Li, Qinlang Chen - 2014
2 papers in library cite
Marco Marelli, L. Bentivogli, M. Baroni, R. Bernardi, S. Menini, R. Zamparelli - 2014
7 papers in library cite
A. Lai, J. Hockenmaier - 2014
5 papers in library cite
Edward Grefenstette, K. Hermann, M. Suleyman, Phil Blunsom - 2015
5 papers in library cite
J. Chorowski, D. Bahdanau, D. Serdyuk, Kyunghyun Cho, Yoshua Bengio - 2015
3 papers in library cite
W. Yin, Hinrich Schutze - 2015
2 papers in library cite
J. Zhao, T. T. Zhu, M. Lan - 2014
2 papers in library cite
S. Jimenez, G. Duenas, J. Baquero, A. Gelbukh, A. J. D. Batiz, A. Mendiz'abal - 2014
2 papers in library cite
G. Angeli, Christopher D. Manning - 2014
1 paper in library cites
I. Beltagy, S. Roller, P. Cheng, K. Erk, R. J. Mooney - 2015
1 paper in library cites
Cited by
5
papers in your library
Cites
21
papers in your library
Read
on October 23, 2025
Your review
Tags
Paper Aliases
No aliases