View Project


TriviaQA: A Large Scale Distantly Supervised Challenge Dataset for Reading Comprehension


We present TriviaQA, a challenging reading comprehension dataset containing over 650K question-answer-evidence triples. TriviaQA includes 95K question-answer pairs authored by trivia enthusiasts and independently gathered evidence documents, six per question on average, that provide high quality distant supervision for answering the questions. We show that, in comparison to other recently introduced large-scale datasets, TriviaQA (1) has relatively complex, compositional questions, (2) has considerable syntactic and lexical variability between questions and corresponding answer-evidence sentences, and (3) requires more cross sentence reasoning to find answers. We also present two baseline algorithms: a feature-based classifier and a state-of-the-art neural network, that performs well on SQuAD reading comprehension. Neither approach comes close to human performance (23% and 40% vs. 80%), suggesting that TriviaQA is a challenging test bed that is worth significant future study.


Login to edit or delete this resource.

Suggested Topics (up to Top 50)

Full Matches (full topic name in abstract)

Partial Matches (at least half of words topic name appear in abstract)

Suggested Resources

Uses abstract to search the content of resources available in Topics. Sorted by relevance.

# Title Author Topic Medium Score
1 TriviaQA: A Large Scale Dataset for Reading Comprehension and Question Answering Mandar Joshi, Eunsol Choi, Daniel Weld, Luke Zettlemoyer 412 corpus 298.37
2 awesome-qa seriousmac 411 resource 281.19
3 Recent Evolution of QA Datasets and Going Forward Jiwoong Im 412 tutorial 258.88
4 SQuAD: 100,000+ Questions for Machine Comprehension of Text Pranav Rajpurkar, Jian Zhang, Konstantin Lopyrev, Percy Liang 999 paper 211.28
5 The Stanford Question Answering Dataset Pranav Rajpurkar 411 resource 197.95
6 Topics, Trends, and Resources in NLP Mohit Bansal 133 tutorial 191.21
7 Natural language question answering: the view from here L. Hirschman, R. Gaizauskas 411 survey 183.70
8 A Thorough Examination of the CNN/Daily Mail Reading Comprehension Task Danqi Chen, Jason Bolton, Christopher D. Manning 999 paper 183.06
9 Open-Domain Question Answering Mark Andrew Greenwood 411 survey 181.40
10 WikiReading: A Novel Large-scale Language Understanding Task over Wikipedia Daniel Hewlett, Alexandre Lacoste, Llion Jones, Illia Polosukhin, A... 999 paper 178.36
11 Natural Language Processing Jacob Eisenstein 711 survey 177.69
12 Multilingual Relation Extraction using Compositional Universal Schema Patrick Verga, David Belanger, Emma Strubell, Benjamin Roth, Andrew... 999 paper 176.39
13 NLP’s generalization problem, and how researchers are tackling it Ana Marasovic 711 resource 175.95
14 Natural Language Processing Jacob Eisenstein 711 survey 175.63
15 Paraphrase-Driven Learning for Open Question Answering Anthony Fader, Luke Zettlemoyer, Oren Etzioni 999 paper 175.13
16 Key-Value Memory Networks for Directly Reading Documents Alexander Miller, Adam Fisch, Jesse Dodge, Amir-Hossein Karimi, Ant... 999 paper 174.73
17 Opinion mining and sentiment analysis Bo Pang and Lillian Lee 381 survey 174.12
18 Authorship Attribution Patrick Juola 965 survey 172.05
19 Natural Language Processing for Precision Medicine Hoifung Poon, Chris Quirk, Kristina Toutanova, Scott Wen-tau Yih 973 tutorial 171.07
20 TriviaQA: A Large Scale Distantly Supervised Challenge Dataset for Reading Comprehension Mandar Joshi, Eunsol Choi, Daniel Weld, Luke Zettlemoyer 412 library 171.06
21 An Introduction to Deep Learning for Natural Language Processing Jianfeng Gao 711 tutorial 170.73
22 Natural Language Processing Jacob Eisenstein 711 survey 169.93
23 Learning to Compose Neural Networks for Question Answering Jacob Andreas, Marcus Rohrbach, Trevor Darrell, Dan Klein 999 paper 169.36
24 Open-Domain Question Answering John Prager 411 survey 167.85
25 Summarizing Source Code using a Neural Attention Model Srinivasan Iyer, Ioannis Konstas, Alvin Cheung, Luke Zettlemoyer 999 paper 167.42
26 Memory Networks for Language Understanding Jason Weston tutorial 166.93
27 Memory Networks for Language Understanding Efstratios Gavves 745 lecture 166.93
28 Speech and Language Processing Daniel Jurafsky, James H. Martin 133 survey 164.91
29 Automatic Summarization Ani Nenkova and Kathleen McKeown 421 survey 163.66
30 Generating Natural Questions About an Image Nasrin Mostafazadeh, Ishan Misra, Jacob Devlin, Margaret Mitchell, ... 999 paper 163.61
31 The Elements of Automatic Summarization Daniel Jacob Gillick 421 survey 163.15
32 Tackling the Limits of Deep Learning for NLP Richard Socher resource 162.45
33 Who did What: A Large-Scale Person-Centered Cloze Dataset Takeshi Onishi, Hai Wang, Mohit Bansal, Kevin Gimpel, David McAllester 999 paper 161.99
34 An Introduction to Neural Information Retrieval Bhaskar Mitra, Nick Craswell 232 survey 160.78
35 Combining Natural Logic and Shallow Reasoning for Question Answering Gabor Angeli, Neha Nayak, Christopher D. Manning 999 paper 160.69
36 Long Short-Term Memory-Networks for Machine Reading Jianpeng Cheng, Li Dong, Mirella Lapata 999 paper 160.58
37 Dialog Systems and Chatbots Daniel Jurafsky, James H. Martin 445 survey 160.20
38 Don't count, predict! A systematic comparison of context-counting vs. context-predicting semantic vectors Marco Baroni, Georgiana Dinu, Germán Kruszewski 999 paper 159.84
39 Word Sense Disambiguation: A Survey Roberto Navigli 391 survey 159.77
40 Search Engines Information Retrieval in Practice W. Bruce Croft, Donald Metzler, Trevor Strohman 612 survey 159.71
41 Natural Language Processing with Python Steven Bird, Ewan Klein, Edward Loper 132 survey 158.75
42 Textual Entailment Ido Dagan, Dan Roth, Fabio Massimo Zanzotto 351 tutorial 158.69
43 Natural Language Inference, Reading Comprehension and Deep Learning Christopher Manning 711 tutorial 158.51
44 Natural Language Data Management and Interfaces Yunyao Li, Davood Rafiei 974 tutorial 158.44
45 Construction and Querying of Large-scale Knowledge Bases Xiang Ren, Yu Su, Xifeng Yan 974 tutorial 158.01
46 Artificial Intelligence and Games Georgios N. Yannakakis and Julian Togelius 825 survey 156.46
47 A Survey of Paraphrasing and Textual Entailment Methods Ion Androutsopoulos, Prodromos Malakasiotis 351 survey 155.88
48 DrQA Adam Fisch 755 library 155.82
49 Open Domain Question Answering: Techniques, Resources and Systems Bernardo Magnini 411 tutorial 155.52
50 Question Answering Techniques for the World Wide Web Jimmy Lin, Boris Katz 411 tutorial 154.91