2017
Cite Score
59
AI summary
This paper introduces the Bi-Directional Attention Flow (BIDAF) network for machine comprehension, utilizing bi-directional attention flow and hierarchical processing to achieve state-of-the-art results on the SQuAD dataset and CNN/DailyMail cloze test.
Main Contributions
Abstract
Machine comprehension (MC), answering a query about a given context paragraph, requires modeling complex interactions between the context and the query. Recently, attention mechanisms have been successfully extended to MC. Typically these methods use attention to focus on a small portion of the context and summarize it with a fixed-size vector, couple attentions temporally, and/or often form a uni-directional attention. In this paper we introduce the Bi-Directional Attention Flow (BIDAF) network, a multi-stage hierarchical process that represents the context at different levels of granularity and uses bi-directional attention flow mechanism to obtain a query-aware context representation without early summarization. Our experimental evaluations show that our model achieves the state-of-the-art results in Stanford Question Answering Dataset (SQUAD) and CNN/DailyMail cloze test.
Citation Graph
References [33]
Sepp Hochreiter, Jürgen Schmidhuber - 1997
94 papers in library cite
N. Srivastava, Geoffrey E. Hinton, Alex Krizhevsky, Ilya Sutskever, Ruslan R. Salakhutdinov - 2014
20 papers in library cite
Jeffrey Pennington, Richard Socher, Christopher D. Manning - 2014
31 papers in library cite
D. Bahdanau, Kyunghyun Cho, Yoshua Bengio - 2014
59 papers in library cite
Yoon Kim - 2014
8 papers in library cite
P. Rajpurkar, J. Zhang, K. Lopyrev, Percy Liang - 2016
37 papers in library cite
Matthew D. Zeiler - 2012
13 papers in library cite
K. M. Hermann, T. Kocisky, Edward Grefenstette, L. Espeholt, W. Kay, M. Suleyman, Phil Blunsom - 2015
31 papers in library cite
Jason Weston, S. Chopra, Antoine Bordes - 2015
18 papers in library cite
M. Richardson, C. J. C. Burges, Erin Renshaw - 2013
16 papers in library cite
F. Hill, Antoine Bordes, S. Chopra, Jason Weston - 2015
14 papers in library cite
Deli Chen, J. Bolton, Christopher D. Manning - 2016
9 papers in library cite
S. Antol, A. Agrawal, J. Lu, M. Mitchell, D. Batra, C. L. Zitnick, D. Parikh - 2015
6 papers in library cite
R. K. Srivastava, K. Greff, Jürgen Schmidhuber - 2015
6 papers in library cite
A. Fukui, D. H. Park, Diyi Yang, A. Rohrbach, Trevor Darrell, M. Rohrbach - 2016
2 papers in library cite
Shijie Wang, J. J. Jiang - 2017
6 papers in library cite
R. Kadlec, M. Schmid, O. Bajgar, Jan Kleindienst - 2016
7 papers in library cite
Caiming Xiong, S. Merity, Richard Socher - 2016
5 papers in library cite
Caiming Xiong, Victor Zhong, Richard Socher - 2017
3 papers in library cite
S. Kobayashi, R. Tian, N. Okazaki, K. Inui - 2016
3 papers in library cite
Y. Yu, Wenxuan Zhang, K. S. Hasan, M. Yu, Bing Xiang, B. Zhou - 2016
3 papers in library cite
B. Dhingra, Haozhe Liu, W. W. Cohen, Ruslan Salakhutdinov - 2016
3 papers in library cite
K. Lee, S. Salant, T. Kwiatkowski, A. P. Parikh, Dipanjan Das, Jonathan Berant - 2017
3 papers in library cite
Y. Shen, P. Huang, Jianfeng Gao, Weizhu Chen - 2017
3 papers in library cite
Y. Cui, Ziru Chen, S. Wei, Shijie Wang, T. Liu, G. Hu - 2016
2 papers in library cite
A. Sordoni, P. Bachman, Yoshua Bengio - 2016
2 papers in library cite
A. Trischler, Z. Ye, X. Yuan, K. Suleman - 2016
2 papers in library cite
M. Malinowski, M. Rohrbach, M. Fritz - 2015
1 paper in library cites
Hu Xu, K. Saenko - 2016
1 paper in library cites
J. Lu, Jihan Yang, D. Batra, D. Parikh - 2016
1 paper in library cites
Zhilin Yang, X. He, Jianfeng Gao, L. Deng, A. Smola - 2015
1 paper in library cites
Yuxuan Zhu, O. Groth, M. S. Bernstein, Li Fei Fei - 2016
1 paper in library cites
Zhilin Yang, B. Dhingra, Y. Yuan, Jiaxi Hu, W. W. Cohen, Ruslan Salakhutdinov - 2016
1 paper in library cites
Cited by
13
papers in your library
Cites
17
papers in your library
Read
on October 24, 2025
Your review
Tags
Paper Aliases
No aliases