2017

Bidirectional Attention Flow for Machine Comprehension

M. Seo, A. Kembhavi, Ali Farhadi, Hananneh Hajishirzi

citations

Cite Score

59

AI summary

This paper introduces the Bi-Directional Attention Flow (BIDAF) network for machine comprehension, utilizing bi-directional attention flow and hierarchical processing to achieve state-of-the-art results on the SQuAD dataset and CNN/DailyMail cloze test.

Main Contributions

  • Introduces the Bi-Directional Attention Flow (BIDAF) network, a multi-stage hierarchical architecture for machine comprehension.
  • Employs bi-directional attention flow to obtain a query-aware context representation.
  • Achieves state-of-the-art results on the Stanford Question Answering Dataset (SQuAD).
  • Achieves state-of-the-art results on the CNN/DailyMail cloze test.
  • Provides an in-depth ablation study of the model on the SQUAD development set.

Abstract

Machine comprehension (MC), answering a query about a given context paragraph, requires modeling complex interactions between the context and the query. Recently, attention mechanisms have been successfully extended to MC. Typically these methods use attention to focus on a small portion of the context and summarize it with a fixed-size vector, couple attentions temporally, and/or often form a uni-directional attention. In this paper we introduce the Bi-Directional Attention Flow (BIDAF) network, a multi-stage hierarchical process that represents the context at different levels of granularity and uses bi-directional attention flow mechanism to obtain a query-aware context representation without early summarization. Our experimental evaluations show that our model achieves the state-of-the-art results in Stanford Question Answering Dataset (SQUAD) and CNN/DailyMail cloze test.

Citation Graph

Loading graph...

References [33]

Sort:
Filter:

Sepp Hochreiter, Jürgen Schmidhuber - 1997

94 papers in library cite

N. Srivastava, Geoffrey E. Hinton, Alex Krizhevsky, Ilya Sutskever, Ruslan R. Salakhutdinov - 2014

20 papers in library cite

Jeffrey Pennington, Richard Socher, Christopher D. Manning - 2014

31 papers in library cite

D. Bahdanau, Kyunghyun Cho, Yoshua Bengio - 2014

59 papers in library cite

Yoon Kim - 2014

8 papers in library cite

P. Rajpurkar, J. Zhang, K. Lopyrev, Percy Liang - 2016

37 papers in library cite

Matthew D. Zeiler - 2012

13 papers in library cite

K. M. Hermann, T. Kocisky, Edward Grefenstette, L. Espeholt, W. Kay, M. Suleyman, Phil Blunsom - 2015

31 papers in library cite

Jason Weston, S. Chopra, Antoine Bordes - 2015

18 papers in library cite

M. Richardson, C. J. C. Burges, Erin Renshaw - 2013

16 papers in library cite

F. Hill, Antoine Bordes, S. Chopra, Jason Weston - 2015

14 papers in library cite

Deli Chen, J. Bolton, Christopher D. Manning - 2016

9 papers in library cite

S. Antol, A. Agrawal, J. Lu, M. Mitchell, D. Batra, C. L. Zitnick, D. Parikh - 2015

6 papers in library cite

R. K. Srivastava, K. Greff, Jürgen Schmidhuber - 2015

6 papers in library cite

A. Fukui, D. H. Park, Diyi Yang, A. Rohrbach, Trevor Darrell, M. Rohrbach - 2016

2 papers in library cite

Shijie Wang, J. J. Jiang - 2017

6 papers in library cite

R. Kadlec, M. Schmid, O. Bajgar, Jan Kleindienst - 2016

7 papers in library cite

Caiming Xiong, S. Merity, Richard Socher - 2016

5 papers in library cite

Caiming Xiong, Victor Zhong, Richard Socher - 2017

3 papers in library cite

S. Kobayashi, R. Tian, N. Okazaki, K. Inui - 2016

3 papers in library cite

Y. Yu, Wenxuan Zhang, K. S. Hasan, M. Yu, Bing Xiang, B. Zhou - 2016

3 papers in library cite

B. Dhingra, Haozhe Liu, W. W. Cohen, Ruslan Salakhutdinov - 2016

3 papers in library cite

K. Lee, S. Salant, T. Kwiatkowski, A. P. Parikh, Dipanjan Das, Jonathan Berant - 2017

3 papers in library cite

Y. Shen, P. Huang, Jianfeng Gao, Weizhu Chen - 2017

3 papers in library cite

Y. Cui, Ziru Chen, S. Wei, Shijie Wang, T. Liu, G. Hu - 2016

2 papers in library cite

A. Sordoni, P. Bachman, Yoshua Bengio - 2016

2 papers in library cite

A. Trischler, Z. Ye, X. Yuan, K. Suleman - 2016

2 papers in library cite

M. Malinowski, M. Rohrbach, M. Fritz - 2015

1 paper in library cites

J. Lu, Jihan Yang, D. Batra, D. Parikh - 2016

1 paper in library cites

Zhilin Yang, X. He, Jianfeng Gao, L. Deng, A. Smola - 2015

1 paper in library cites

Yuxuan Zhu, O. Groth, M. S. Bernstein, Li Fei Fei - 2016

1 paper in library cites

Zhilin Yang, B. Dhingra, Y. Yuan, Jiaxi Hu, W. W. Cohen, Ruslan Salakhutdinov - 2016

1 paper in library cites

Cited by

13

papers in your library

Cites

17

papers in your library

Read

on October 24, 2025

Your review

Tags

Paper Aliases

No aliases