2018

SWAG: A Large-Scale Adversarial Dataset for Grounded Commonsense Inference

Yejin Choi

citations

Cite Score

35

AI summary

This paper introduces SWAG, a new dataset with 113k multiple choice questions for grounded commonsense inference, unifying natural language inference and commonsense reasoning. To address the recurring challenges of the annotation artifacts and human biases found in many existing datasets, they propose Adversarial Filtering (AF).

Main Contributions

  • Introduces the task of grounded commonsense inference, unifying natural language inference and commonsense reasoning.
  • Presents SWAG, a new dataset with 113k multiple choice questions.
  • Proposes Adversarial Filtering (AF), a novel procedure that constructs a de-biased dataset by iteratively training an ensemble of stylistic classifiers, and using them to filter the data.
  • Demonstrates that while humans can solve the resulting inference problems with high accuracy (88%), various competitive models struggle on our task.
  • Provides comprehensive analysis that indicates significant opportunities for future research.

Abstract

Given a partial description like "she opened the hood of the car," humans can reason about the situation and anticipate what might come next ("then, she examined the engine"). In this paper, we introduce the task of grounded commonsense inference, unifying natural language inference and commonsense reasoning. We present SWAG, a new dataset with 113k multiple choice questions about a rich spectrum of grounded situations. To address the recurring challenges of the annotation artifacts and human biases found in many existing datasets, we propose Adversarial Filtering (AF), a novel procedure that constructs a de-biased dataset by iteratively training an ensemble of stylistic classifiers, and using them to filter the data. To account for the aggressive adversarial filtering, we use state-of-the-art language models to massively oversample a diverse set of potential counterfactuals. Empirical results demonstrate that while humans can solve the resulting inference problems with high accuracy (88%), various competitive models struggle on our task. We provide comprehensive analysis that indicates significant opportunities for future research.

Citation Graph

Loading graph...

References [60]

Sort:
Filter:

D. P. Kingma, Jimmy Lei Ba - 2014

49 papers in library cite

Sepp Hochreiter, Jürgen Schmidhuber - 1997

94 papers in library cite

Jeffrey Pennington, Richard Socher, Christopher D. Manning - 2014

31 papers in library cite

M. E. Peters, M. Neumann, M. Iyyer, Matt Gardner, C. Clark, K. Lee, L. S. Zettlemoyer - 2018

27 papers in library cite

A. Wang, A. Singh, J. Michael, F. Hill, Omer Levy, Samuel R. Bowman - 2018

26 papers in library cite

Samuel R. Bowman, G. Angeli, Christopher Potts, Christopher D. Manning - 2015

25 papers in library cite

A. Williams, Nikita Nangia, S. Bowman - 2018

19 papers in library cite

Yuxuan Zhu, R. Kiros, R. Zemel, Ruslan Salakhutdinov, R. Urtasun, Antonio Torralba, Sanja Fidler - 2015

18 papers in library cite

R. Kiros, Yuxuan Zhu, Ruslan Salakhutdinov, Richard S. Zemel, R. Urtasun, Antonio Torralba, Sanja Fidler - 2015

23 papers in library cite

Ido Dagan, O. Glickman, Bernardo Magnini - 2005

19 papers in library cite

Alexis Conneau, Douwe Kiela, Holger Schwenk, L. Barrault, Antoine Bordes - 2017

11 papers in library cite

A. P. Parikh, O. Tackstrom, Dipanjan Das, Jakob Uszkoreit - 2016

11 papers in library cite

Marco Marelli, S. Menini, M. Baroni, L. Bentivogli, R. Bernardi, R. Z. Elli - 2014

7 papers in library cite

Suchin Gururangan, Swabha Swayamdipta, Omer Levy, Richard Schwartz, S. Bowman, Noah A. Smith - 2018

6 papers in library cite

D. Paperno, German Kruszewski, A. Lazaridou, N. Q. Pham, R. Bernardi, S. Pezzelle, M. Baroni, G. Boleda, Raquel Fernandez - 2016

12 papers in library cite

O. Press, Lior Wolf - 2017

7 papers in library cite

Reference title contains 'et al'

D. M. Blei, Andrew Y. Ng, Michael I. Jordan - 2003

10 papers in library cite

F. C. Heilbron, V. Escorcia, B. Ghanem, J. C. Niebles - 2015

1 paper in library cites

B. H. Zhang, B. Lemoine, M. Mitchell - 2018

1 paper in library cites

R. Krishna, K. Hata, F. Ren, Li Fei Fei, J. C. Niebles - 2017

2 papers in library cite

Qinlang Chen, X. Zhu, Z. H. Ling, S. Wei, H. Jiang, D. Inkpen - 2017

5 papers in library cite

Matt Gardner, J. Grus, M. Neumann, Oyvind Tafjord, P. Dasigi, N. Liu, M. Peters, M. Schmitz, Luke Zettlemoyer - 2018

5 papers in library cite

A. Poliak, J. Naradowsky, A. Haldar, R. Rudinger, B. V. Durme - 2018

5 papers in library cite

A. Rohrbach, A. Torabi, M. Rohrbach, N. Tandon, C. Pal, Hugo Larochelle, Aaron Courville, B. Schiele - 2017

2 papers in library cite

Richard Schwartz, Maarten Sap, I. Konstas, L. Zilles, Yejin Choi, Noah A. Smith - 2017

3 papers in library cite

M. Glockner, V. Shwartz, Y. Goldberg - 2018

3 papers in library cite

R. Rudinger, C. May, B. V. Durme - 2017

3 papers in library cite

A. Jabri, Armand Joulin, Laurens Van Der Maaten - 2016

2 papers in library cite

V. Vapnik - 1995

9 papers in library cite

Geoffrey Zweig, C. J. Burges - 2011

6 papers in library cite

H. Inan, K. Khosravi, Richard Socher - 2017

6 papers in library cite

N. Mostafazadeh, N. Chambers, X. He, D. Parikh, D. Batra, L. Vanderwende, P. Kohli, J. Allen - 2016

5 papers in library cite

Armand Joulin, E. Grave, Piotr Bojanowski, Tomas Mikolov - 2017

4 papers in library cite

R. Cooper, D. Crouch, J. Eijckl, C. Fox, J. Genabith, J. Japars, H. Kamp, D. Milward, M. Pinkal, M. Poesio - 1996

2 papers in library cite

G. Chierchia, S. M. Ginet - 2001

2 papers in library cite

R. Pasunuru, Mohit Bansal - 2018

2 papers in library cite

Rowan Zellers, M. Yatskar, S. Thomson, Yejin Choi - 2018

2 papers in library cite

Zhipeng Cai, L. Tu, Kevin Gimpel - 2017

2 papers in library cite

Richard Socher, Deli Chen, Christopher D. Manning, A. Ng - 2013

2 papers in library cite

M. Stern, Jacob Andreas, Dan Klein - 2017

1 paper in library cites

Xiang Lisa Li, A. Taheri, L. Tu, Kevin Gimpel - 2016

1 paper in library cites

R. Speer, J. Chin, C. Havasi - 2017

1 paper in library cites

Maarten Sap, M. C. Prasettio, Ari Holtzman, Hannah Rashkin, Yejin Choi - 2017

1 paper in library cites

B. A. Plummer, Lisa Wang, C. M. Cervantes, J. C. Caicedo, J. Hockenmaier, Svetlana Lazebnik - 2017

1 paper in library cites

A. Schofield, L. Mehr - 2016

1 paper in library cites

J. Zhao, Tianle Wang, M. Yatskar, V. Ordonez, K. W. Chang - 2017

1 paper in library cites

R. Pasunuru, Mohit Bansal - 2017

1 paper in library cites

A. Lai, Yonatan Bisk, J. Hockenmaier - 2017

1 paper in library cites

S. Zhang, R. Rudinger, K. Duh, B. V. Durme - 2017

1 paper in library cites

R. C. Schank, R. P. Abelson - 1975

1 paper in library cites

R. Sharma, J. Allen, O. Bakhshandeh, N. Mostafazadeh - 2018

1 paper in library cites

C. F. Baker, C. J. Fillmore, J. B. Lowe - 1998

1 paper in library cites

J. Gibson - 1979

1 paper in library cites

P. Lobue, A. Yates - 2011

1 paper in library cites

N. Chambers, Dan Jurafsky - 2009

1 paper in library cites

Longhui Yu, E. Park, A. C. Berg, T. L. Berg - 2015

1 paper in library cites

P. Felsen, P. Agrawal, Jitendra Malik - 2017

1 paper in library cites

K. Ehsani, H. Bagherinezhad, Joseph Redmon, R. Mottaghi, Ali Farhadi - 2018

1 paper in library cites

Rowan Zellers, Yejin Choi - 2017

1 paper in library cites

Cited by

5

papers in your library

Cites

28

papers in your library

Read

on October 25, 2025

Your review

Tags

Paper Aliases

No aliases