Papperoni

2013

MCTest: A Challenge Dataset for the Open-Domain Machine Comprehension of Text

M. Richardson, C. J. C. Burges, Erin Renshaw

Open PDF Google Scholar

citations

Cite Score

34

AI summary

This paper introduces MCTest, a new dataset of 500 fictional stories with multiple-choice questions, created using crowdsourcing, to advance open-domain machine comprehension, testing abilities like causal reasoning with a clear metric and restricted world knowledge.

Main Contributions

Introduces MCTest, a freely available dataset for machine comprehension of text.
Presents a scalable crowd-sourcing method to construct a dataset of 500 stories and 2000 questions.
Provides a clear metric for advancement on the machine comprehension of text.
Introduces two baseline systems using lexical features, achieving approximately 58% accuracy on test data.
Releases the dataset to encourage research and innovation in machine comprehension.

Abstract

We present MCTest, a freely available set of stories and associated questions intended for research on the machine comprehension of text. Previous work on machine comprehension (e.g., semantic modeling) has made great strides, but primarily focuses either on limited-domain datasets, or on solving a more restricted goal (e.g., open-domain relation extraction). In contrast, MCTest requires machines to answer multiple-choice reading comprehension questions about fictional stories, directly tackling the high-level goal of open-domain machine comprehension. Reading comprehension can test advanced abilities such as causal reasoning and understanding the world, yet, by being multiple-choice, still provide a clear metric. By being fictional, the answer typically can be found only in the story itself. The stories and questions are also carefully limited to those a young child would understand, reducing the world knowledge that is required for the task. We present the scalable crowd-sourcing methods that allow us to cheaply construct a dataset of 500 stories and 2000 questions. By screening workers (with grammar tests) and stories (with grading), we have ensured that the data is the same quality as another set that we manually edited, but at one tenth the editing cost. By being open-domain, yet carefully restricted, we hope MCTest will serve to encourage research and provide a clear metric for advancement on the machine comprehension of text.

Citation Graph

Loading graph...

References [25]

Sort:

Filter:

[1]The PASCAL Recognising Textual Entailment Challenge

Ido Dagan, O. Glickman, Bernardo Magnini - 2005

19 papers in library cite

It's very nice how they had the foresight to create a challenge that became relevant like 10 years later.

[2]The trec-8 Question Answering Track Evaluation

E. M. Voorhees, D. M. Tice - 1999

5 papers in library cite

[3]Deep Read: A Reading Comprehension System

L. Hirschman, M. Light, E. Breck, J. D. Burger - 1999

2 papers in library cite

[4]A Challenge Set for Advancing Language Modeling

Geoffrey Zweig, C. J. C. Burges - 2012

1 paper in library cites

[5]A Confidence Model for Syntactically-Motivated Entailment Proofs

A. Stern, Ido Dagan - 2011

1 paper in library cites

[6]A Survey of Modern Authorship Attribution Methods

E. Stamatatos - 2009

1 paper in library cites

[7]Age-of-Acquisition Ratings for 30,000 English Words

V. Kuperman, H. S. Gonzalez, M. Brysbaert - 2012

1 paper in library cites

[8]An Entailment-Based Approach to the QA4MRE Challenge

Peter Clark, P. Harrison, Xingcheng Yao - 2012

1 paper in library cites

[9]Automatic Gap-Fill Question Generation From Text Books

M. Agarwal, P. Mannem - 2011

1 paper in library cites

[10]Automatic Multi-Layer Corpus Annotation for Evaluating Question Answering Methods: CBC4Kids

J. L. Leidner, T. Dalmas, B. Webber, J. Bos, C. Grover - 2003

1 paper in library cites

[11]Confidence Driven Unsupervised Semantic Parsing

D. Goldwasser, R. Reichart, J. Clarke, Dan Roth - 2011

1 paper in library cites

[12]Factoid Question Answering Over Unstructured and Structured Content on the Web

S. Cucerzan, E. Agichtein - 2005

1 paper in library cites

[13]Learning Context-Dependent Mappings From Sentences to Logical Form

L. S. Zettlemoyer, Michael Collins - 2009

1 paper in library cites

[14]Learning Strategies for Story Comprehension: A Reinforcement Learning Approach

E. Grois, D. C. Wilkins - 2005

1 paper in library cites

[15]Learning to Parse Database Queries Using Inductive Logic Programming

J. M. Zelle, R. J. Mooney - 1996

1 paper in library cites

Missing year

[16]Lecture Notes in Computer Science

F. D. Buc

1 paper in library cites

[17]Looking Under the Hood: Tools for Diagnosing Your Question Answering Engine

E. Breck, M. Light, G. S. Mann, E. Riloff, B. Brown, P. Anand, M. Rooth, M. Thelen - 2001

1 paper in library cites

[18]Open-Domain Textual Question Answering Techniques

S. M. Harabagiu, S. J. Maiorano, M. A. Pasca - 2003

1 paper in library cites

[19]Reading Comprehension Tests for Computer-Based Understand Evaluation

B. Wellner, L. Ferro, W. Greiff, L. Hirschman - 2005

1 paper in library cites

[20]Running Experiments on Amazon Mechanical Turk

G. Paolacci, J. Chandler, P. Iperirotis - 2010

1 paper in library cites

[21]Story Understanding Resources

E. T. Mueller - 2010

1 paper in library cites

[22]The Labor Economics of Paid Crowdsourcing

J. Horton, L. Chilton - 2010

1 paper in library cites

[23]Toward a Model of Children's Story Comprehension

E. Charniak - 1972

1 paper in library cites

[24]Using Multiple Clause Constructors in Inductive Logic Programming for Semantic Parsing

L. R. Tang, R. J. Mooney - 2001

1 paper in library cites

[25]What Is Left to Be Understood in ATIS?

G. Tur, D. H. Tur, L. Heck - 2010

1 paper in library cites

Cited by

16

papers in your library

Cites

1

papers in your library

Read

on December 26, 2025

Maybe the best dataset paper I have ever read. So well explained, thoroughly thought! It's a shame it's a very small dataset...

Tags

Paper Aliases

No aliases