2016
Cite Score
52
AI summary
This paper introduces a decomposable attention model for natural language inference, achieving state-of-the-art results on the SNLI dataset with fewer parameters than previous LSTM-based approaches, by aligning local text substructure and aggregating the information using attention mechanisms.
Main Contributions
Abstract
We propose a simple neural architecture for natural language inference. Our approach uses attention to decompose the problem into subproblems that can be solved separately, thus making it trivially parallelizable. On the Stanford Natural Language Inference (SNLI) dataset, we obtain state-of-the-art results with almost an order of magnitude fewer parameters than previous work and without relying on any word-order information. Adding intra-sentence attention that takes a minimum amount of order into account yields further improvements.
Citation Graph
References [31]
Sepp Hochreiter, Jürgen Schmidhuber - 1997
94 papers in library cite
N. Srivastava, Geoffrey E. Hinton, Alex Krizhevsky, Ilya Sutskever, Ruslan R. Salakhutdinov - 2014
20 papers in library cite
Jeffrey Pennington, Richard Socher, Christopher D. Manning - 2014
31 papers in library cite
D. Bahdanau, Kyunghyun Cho, Yoshua Bengio - 2014
59 papers in library cite
John Duchi, Elad Hazan, Yoram Singer - 2011
19 papers in library cite
M. Abadi, Akshat Agarwal, P. Barham, E. Brevdo, Ziru Chen, C. Citro, G. Corrado, A. Davis, Jeffrey Dean, M. Devin, Sanjay Ghemawat, I. Goodfellow, A. Harp, Geoffrey Irving, M. Isard, Y. Jia, R. Jozefowicz, Lukasz Kaiser, M. Kudlur, J. Levenberg, D. Mane, R. Monga, S. Moore, D. Murray, Christopher Olah, M. Schuster, J. Shlens, B. Steiner, Ilya Sutskever, K. Talwar, P. Tucker, Vincent Vanhoucke, V. Vasudevan, F. Viegas, Oriol Vinyals, P. Warden, M. Wattenberg, M. Wicke, Y. Yu, Xiaoqiang Zheng - 2015
11 papers in library cite
Xavier Glorot, Antoine Bordes, Yoshua Bengio - 2011
17 papers in library cite
Yann Lecun, B. Boser, John S. Denker, D. Henderson, R. E. Howard, W. Hubbard, L. D. Jackel - 1990
10 papers in library cite
Samuel R. Bowman, G. Angeli, Christopher Potts, Christopher D. Manning - 2015
25 papers in library cite
Mirella Lapata - 2016
8 papers in library cite
Tim Rocktaschel, Edward Grefenstette, K. Hermann, T. Kocisky, Phil Blunsom - 2016
5 papers in library cite
B. Hu, Z. L. Lu, H. Li, Qinlang Chen - 2014
2 papers in library cite
W. Yin, Hinrich Schutze, Bing Xiang, B. Zhou - 2016
1 paper in library cites
I. Vendrov, R. Kiros, Sanja Fidler, R. Urtasun - 2016
4 papers in library cite
Shijie Wang, J. J. Jiang - 2016
3 papers in library cite
S. Bowman, J. Gauthier, Abhinav Rastogi, R. Gupta, C. Manning, Christopher Potts - 2016
5 papers in library cite
P. Koehn - 2010
5 papers in library cite
B. Maccartney, Christopher D. Manning - 2009
4 papers in library cite
J. Bos, Katja Markert - 2005
4 papers in library cite
L. Mou, M. Rui, G. Li, Yiheng Xu, Li Zhang, R. Yan, Z. Jin - 2016
3 papers in library cite
Dipanjan Das, Noah A. Smith - 2009
3 papers in library cite
A. Fader, Luke Zettlemoyer, Oren Etzioni - 2013
3 papers in library cite
J. V. Benthem - 2008
2 papers in library cite
E. Marsi, E. Krahmer - 2005
2 papers in library cite
J. J. Katz - 1972
2 papers in library cite
A. Hickl, J. Bensley - 2007
1 paper in library cites
B. Maccartney, M. Galley, Christopher D. Manning - 2008
1 paper in library cites
M. Chang, D. Goldwasser, Dan Roth, Vivek Srikumar - 2010
1 paper in library cites
B. Maccartney, T. Grenager, M. D. Marneffe, D. Cer, Christopher D. Manning - 2006
1 paper in library cites
A. D. Haghighi, Andrew Y. Ng, Christopher D. Manning - 2005
1 paper in library cites
Jacob Andreas, A. Vlachos, S. Clark - 2013
1 paper in library cites
Cited by
11
papers in your library
Cites
15
papers in your library
Read
on August 4, 2025
Your review
Tags
Paper Aliases
No aliases