2017

TriviaQA: A Large Scale Distantly Supervised Challenge Dataset for Reading Comprehension

M. Joshi, E. Choi, D. Weld, Luke Zettlemoyer

citations

Cite Score

67

AI summary

This paper introduces TriviaQA, a large-scale reading comprehension dataset with 650K question-answer-evidence triples from trivia enthusiasts, showing complexity, syntactic variability, and the need for multi-sentence reasoning, with baseline models underperforming human accuracy.

Main Contributions

  • Introduces TriviaQA, a dataset with over 650K question-answer-evidence triples, offering a new resource for reading comprehension models.
  • Demonstrates that TriviaQA contains complex questions with syntactic and lexical variability.
  • Shows that TriviaQA requires multi-sentence reasoning.
  • Presents a manual analysis of the dataset quality and the challenges involved.
  • Provides baseline experiments demonstrating that TriviaQA is not easily solved.

Abstract

We present TriviaQA, a challenging reading comprehension dataset containing over 650K question-answer-evidence triples. TriviaQA includes 95K question answer pairs authored by trivia enthusiasts and independently gathered evidence documents, six per question on average, that provide high quality distant supervision for answering the questions. We show that, in comparison to other recently introduced large-scale datasets, TriviaQA (1) has relatively complex, compositional questions, (2) has considerable syntactic and lexical variability between questions and corresponding answer-evidence sentences, and (3) requires more cross sentence reasoning to find answers. We also present two baseline algorithms: a feature based classifier and a state-of-the-art neural network, that performs well on SQuAD reading comprehension. Neither approach comes close to human performance (23% and 40% vs. 80%), suggesting that TriviaQA is a challenging testbed that is worth significant future study.

Citation Graph

Loading graph...

References [33]

Sort:
Filter:

K. Xu, Jimmy Lei Ba, R. Kiros, Kyunghyun Cho, Aaron Courville, Ruslan Salakhutdinov, R. Zemel, Yoshua Bengio - 2015

12 papers in library cite

P. Rajpurkar, J. Zhang, K. Lopyrev, Percy Liang - 2016

37 papers in library cite

K. M. Hermann, T. Kocisky, Edward Grefenstette, L. Espeholt, W. Kay, M. Suleyman, Phil Blunsom - 2015

31 papers in library cite

M. Seo, A. Kembhavi, Ali Farhadi, Hananneh Hajishirzi - 2017

13 papers in library cite

Guokun Lai, Q. Xie, Haozhe Liu, Yining Yang, Eduard Hovy - 2017

11 papers in library cite

M. Richardson, C. J. C. Burges, Erin Renshaw - 2013

16 papers in library cite

D. Paperno, German Kruszewski, A. Lazaridou, N. Q. Pham, R. Bernardi, S. Pezzelle, M. Baroni, G. Boleda, Raquel Fernandez - 2016

12 papers in library cite

F. Hill, Antoine Bordes, S. Chopra, Jason Weston - 2015

14 papers in library cite

Deli Chen, J. Bolton, Christopher D. Manning - 2016

9 papers in library cite

T. N. Nguyen, M. Rosenberg, X. Song, Jianfeng Gao, S. Tiwary, R. Majumder, L. Deng - 2016

8 papers in library cite

Antoine Bordes, Nicolas Usunier, S. Chopra, Jason Weston - 2015

5 papers in library cite

M. Dunn, L. Sagun, M. Higgins, V. U. Guney, V. Cirik, Kyunghyun Cho - 2017

5 papers in library cite

Jonathan Berant, A. Chou, R. Frostig, Percy Liang - 2013

8 papers in library cite

A. Trischler, Tianle Wang, X. Yuan, J. Harris, A. Sordoni, P. Bachman, K. Suleman - 2017

6 papers in library cite

T. Onishi, Haiming Wang, Mohit Bansal, Kevin Gimpel, D. Mcallester - 2016

4 papers in library cite

Yining Yang, W. T. Yih, C. Meek - 2015

4 papers in library cite

M. Iyyer, J. B. Graber, L. Claudino, Richard Socher - 2014

3 papers in library cite

D. Ferrucci, E. Brown, J. C. Carroll, J. Fan, D. Gondek, A. A. Kalyanpur, A. Lally, J. W. Murdock, E. Nyberg, J. Prager, N. Schlaefer, C. Welty - 2013

3 papers in library cite

P. L. Li, Wentao Li, Z. He, Xinpeng Wang, Yue Cao, Jingren Zhou, Weixin Xu - 2016

3 papers in library cite

Haiming Wang, Mohit Bansal, Kevin Gimpel, D. Mcallester - 2015

3 papers in library cite

Q. Wu, C. J. Burges, K. M. Svore, Jianfeng Gao - 2010

2 papers in library cite

E. M. Voorhees, D. M. Tice - 2000

2 papers in library cite

P. Pasupat, Percy Liang - 2015

2 papers in library cite

A. Fader, Luke Zettlemoyer, Oren Etzioni - 2014

2 papers in library cite

He He, J. B. Graber, K. Kwok, H. D. Iii - 2016

1 paper in library cites

J. B. Graber, B. Satinoff, He He, H. D. Iii - 2012

1 paper in library cites

Zhilin Yang, Diyi Yang, C. Dyer, X. He, A. Smola, Eduard Hovy - 2016

1 paper in library cites

M. Joshi, U. Sawant, S. Chakrabarti - 2014

1 paper in library cites

R. Hoffmann, Chiyuan Zhang, X. Ling, Luke Zettlemoyer, D. S. Weld - 2011

1 paper in library cites

Q. Cai, A. Yates - 2013

1 paper in library cites

U. Sawant, S. Chakrabarti - 2013

1 paper in library cites

Sebastian Riedel, L. Yao, Andrew Mccallum - 2010

1 paper in library cites

P. Ferragina, U. Scaiella - 2010

1 paper in library cites

Cited by

18

papers in your library

Cites

12

papers in your library

Read

on December 25, 2025

Your review

Tags

Paper Aliases

No aliases