View Project


Towards End-to-End Reinforcement Learning of Dialogue Agents for Information Access


This paper proposes KB-InfoBot — a multi-turn dialogue agent w hich helps users search Knowledge Bases (KBs) without composing complicated queries. Such goal-oriented dialogue agents typically need to interact with an external database to a ccess real-world knowledge. Previous systems achieved this by issuing a symbolic query to the KB to retrieve entries based on their attributes. However, such symbolic operations break the differ entiability of the system and prevent end-to-end training of neural dialogue agents. In this paper, we address this limitation by replacing symbolic queries with an induced “soft” posterior dist ribution over the KB that indicates which entities the user is interested in. Integrating the soft retrieval process with a reinforcement learner leads to higher task success rate and reward in both s imulations and against real users. We also present a fully neural end-to-end agent, trained entirely from user feedback, and discuss its application towards personalized dialogue agents.



Suggested Topics

Full Matches (full topic name in abstract)

Partial Matches (at least half of words topic name appear in abstract)

Suggested Resources

Uses abstract to search the content of resources available in Topics. Sorted by relevance.

# Title Author Topic Medium Score
1 DEEP LEARNING FOR CHATBOTS, PART 1 - INTRODUCTION Denny Britz 445 tutorial 226.74
2 Neural Information Retrieval: At the End of the Early Years Kezban Dilek Onal, Ye Zhang, Ismail Sengor Altingovde, Md Mustafizur Rahman, Pinar Karagoz, Alex Braylan, Brandon Dang, Heng-Lu Chang, Henna Kim, Quinten McNamara, Aaron Angert, Edward Banner, Vivek K 713 resource 222.73
3 Rohan #2: Artificial intelligence, ?Progress/?Time Rohan Kapur 811 tutorial 208.91
4 Highlights of EMNLP 2017: Exciting Datasets, Return of the Clusters, and More! Sebastian Ruder 641 resource 199.48
5 A Beginner’s Guide to Deep Reinforcement Learning Adam Gibson, Chris Nicholson, Josh Patterson 857 library 191.63
6 Recent Advances in Document Summarization Jin-ge Yao, Xiaojun Wan, Jianguo Xiao 421 survey 189.54
7 Clustering cliques for graph-based summarization of the biomedical research literature Han Zhang, Marcelo Fiszman, Dongwook Shin, Bartomiej Wilkowski, Thomas Rindflesch 999 paper 183.60
8 Building Cross-Lingual End-to-End Product Search with Tensorflow Han Xiao 731 resource 177.01
9 The Definitive Guide to Natural Language Processing Javier Couto 133 tutorial 174.14
10 Machine Learning for Humans Vishal Maini, Samer Sabri 134 tutorial 172.15
11 Analyzing the Meaning of Sentences Steven Bird, Ewan Klein, Edward Loper 721 course 171.32
12 Introduction to Learning to Trade with Reinforcement Learning Denny Britz 857 resource 169.59
13 Simple Beginner’s guide to Reinforcement Learning & its implementation Faizan Shaikh 713 tutorial 169.25
14 Similarity-driven Semantic Role Induction via Graph Partitioning Joel Lang, Mirella Lapata 999 paper 168.46
15 The 7 NLP Techniques That Will Change How You Communicate in the Future (Part I) James Le 112 resource 167.38
16 Codra: A Novel Discriminative Framework for Rhetorical Analysis Shafiq Joty, Giuseppe Carenini, Raymond T. Ng 999 paper 167.37
17 Summaries and notes on Deep Learning research papers Denny Britz 713 resource 166.82
18 Recurrent Neural Networks Tutorial, Part 1 - Introduction to RNNs Denny Britz 741 tutorial 166.78
19 Artificial Intelligence’s Next Big Step: Reinforcement Learning Mary Branscombe 857 resource 163.46
20 Recurrent Neural Network Tutorial, Part 4 – Implementing a GRU/LSTM RNN with Python and Theano Denny Britz 742 tutorial 162.20
21 The Future (and Present) of Artificial Intelligence AMA Various Authors 811 resource 162.00
22 Deconstruction with Discrete Embeddings R2RT 711 resource 161.00
23 A Comparative Analysis of ChatBots APIs Author Unknown 921 resource 158.73
24 Chatsbots with Machine Learning: Building Neural Conversational Agents Dmitry Persiyanov 999 resource 156.84
25 Deep Learning Achievements Over the Past Year Eduard Tyantov 711 resource 156.49
26 Negated bio-events: analysis and identification Raheel Nawaz, Paul Thompson, Sophia Ananiadou 999 paper 155.81
27 The Evolution and Core Concepts of Deep Learning & Neural Networks Guest Blog 711 tutorial 155.74
28 State-of-the-art neural coreference resolution for chatbots Thomas Wolf 756 tutorial 155.64
30 Transfer Learning - Machine Learnings Next Frontier Sebastian Ruder 978 tutorial 153.90
31 Rules of Machine Learning: Best Practices for ML Engineering Martin Zinkevich 711 resource 153.72
32 Diversity and network coherence as indicators of interdisciplinarity: case studies in bionanoscience Ismael Rafols, Martin Meyer 999 paper 153.49
33 Recurrent Neural Networks Tutorial, Part 2 - Implementing a RNN with Python, Numpy, and Theano Denny Britz 741 tutorial 153.41
34 The bAbI project Jason Weston, Antoine Bordes, Sumit Chopra, Alexander M. Rush, Bart van Merriënboer, Armand Joulin and Tomas Mikolov 755 tutorial 152.70
35 The bAbI project Facebook 134 resource 152.70
36 Neural Text Embeddings for IR Bhaskar Mitra, Nick Craswell 721 tutorial 152.52
37 Generative Models Andrej Karpathy, Pieter Abbeel, Greg Brockman, Peter Chen, Vicki Cheung, Rocky Duan, Ian Goodfellow, Durk Kingma, Jonathan Ho, Rein Houthooft, Tim Salimans, John Schulman, Ilya Sutskever, Wojciech Zar 756 resource 151.32
38 Implementing the DistBelief Deep Neural Network Training Framework with Akka Alex Minnaar 713 tutorial 151.04
39 ACL 2017 Report Yuta Kikuchi, Sosuke Kobayashi 711 resource 150.21
40 Prodigy: A new tool for radically efficient machine teaching Matthew Honnibal, Ines Montani 134 resource 150.03
41 ParlAI FAIR 753 library 147.24
42 Your TL;DR by an AI: A Deep Reinforced Model for Abstractive Summarization Romain Paulus, Caiming Xiong and Richard Socher 754 resource 146.73
43 parlai 0.1.0 Alexander H Miller 756 library 146.72
44 Deep Learning for Computer Vision - Introduction to Convolution Neural Networks Aarshay Jain 744 tutorial 146.47
45 Recurrent Neural Networks Tutorial, Part 3- Backpropagation Through Time and Vanishing Gradients Denny Britz 741 tutorial 146.07
46 Do Altmetrics Work? Twitter and Ten Other Social Web Services Mike Thelwall, Stefanie Haustein, Vincent Larivière, Cassidy R. Sugimoto 999 paper 145.48
47 Getting Ready for AI based gaming agents - Overview of Open Source Reinforcement Learning Patterns Faizan Shaikh 713 resource 144.99
48 Quora Duplicate Questions Corpus Quora 151 corpus 143.91
49 DeepPavlov deepmipt 811 library 143.69
50 Lexicalization and Generative Power in Ccg Marco Kuhlmann, Alexander Koller, Giorgio Satta 999 paper 143.62
51 A large-scale community structure analysis in Facebook Emilio Ferrara 999 paper 143.57
52 Introduction to Neural Machine Translation with GPUs (part 3) Kyunghyun Cho 753 tutorial 142.29
53 Literature Mining for the Discovery of Hidden Connections between Drugs, Genes and Diseases Frijters, Raoul AND van Vugt, Marianne AND Smeets, Ruben AND van Schaik, René AND de Vlieg, Jacob AND Alkema, Wynand 999 paper 141.88
54 Reinforcement Learning Florentin Woergoetter, Bernd Porr 857 paper 141.44
55 Reinforcement Learning and DQN, learning to play from pixels Ruben Fiszel 857 tutorial 140.99
56 News Article Wikipedia Dataset Author Unknown 999 library 140.74
57 Gensim integration with scikit-learn and Keras Chinmaya Pancholi 713 library 140.02
58 An Overview of Multi-Task Learning in Deep Neural Networks Sebastian Ruder 829 tutorial 139.93
59 Open Machine Learning Course. Topic 3. Classification, Decision Trees and k Nearest Neighbors Yury Kashnitskiy 711 resource 139.47
60 Awesome Python Vinta 131 resource 139.17
61 Demystifying Deep Reinforcement Learning Tambet Matiisen 857 tutorial 139.04
62 Bayesian Statistics explained to Beginners in Simple English NSS 102 tutorial 138.88
63 Learning to Communicate Pieter Abbeel, Igor Mordatch, Ryan Lowe, Jon Gauthier & Jack Clark 861 tutorial 138.38
64 Deep Learning for NLP, advancements and trends in 2017 Javier 711 resource 138.36
65 Machine Learning For Beginners Divyansh Dwivedi 711 resource 137.48
66 Maluuba Frames Datasets Maluuba Frames 999 corpus 137.33
67 The history and meaning of the journal impact factor Eugene Garfield 999 paper 136.59
68 A general framework for analysing diversity in science, technology and society Andy Stirling 999 paper 136.45
69 19 Data Science Tools for people who aren’t so good at Programming Aarshay Jain 107 tutorial 135.77
70 Under the Hood with Reinforcement Learning – Understanding Basic RL Models Bill Vorhies 857 resource 135.60
71 The NeuroEvolution of Augmenting Topologies (NEAT) Users Page Author Unknown 999 resource 135.60
72 40 Interview Questions asked at Startups in Machine Learning / Data Science ANALYTICS VIDHYA CONTENT TEAM 107 tutorial 135.04
73 A Dozen Times Artificial Intelligence Startled the World Sumeet Agrawal 811 resource 134.96
74 Ideas on interpreting machine learning Patrick Hall, Wen Phan, SriSatish Ambati 134 tutorial 134.94
75 Introduction to Semi-Supervised Learning Xiaojin Zhu and Andrew B. Goldberg 581 survey 134.71
76 The Unreasonable Effectiveness of Recurrent Neural Networks Andrej Karpathy 741 survey 134.48
77 A New Multi-Turn Multi-Domain Task-Oriented Dialogue Dataset Stanford 756 corpus 133.84
78 Supervised learning is great — it’s data collection that’s broken Ines Montani, Matthew Honnibal 134 tutorial 133.69
79 Gimli: open source and high-performance biomedical name recognition David Campos, Sergio Matos, Jose Oliveira 999 paper 133.67
80 Deep Reinforcement Learning: Pong from Pixels Andrej Karpathy 857 tutorial 133.25
81 A Practitioner's Guide to Natural Language Processing (Part I)?—?Processing & Understanding Text Dipanjan (DJ) Sarker 112 resource 132.98
82 Four deep learning trends from ACL 2017: Part 2 Abigail See 713 resource 132.68
83 Discriminative Syntax-based Word Ordering for Text Generation Yue Zhang, Stephen Clark 999 paper 132.56
84 The Building Blocks of Interpretability Chris Olah 614 resource 132.44
85 An index to quantify an individual’s scientific research output that takes into account the effect of multiple coauthorship J. E. Hirsch 999 paper 132.38
86 Natural Language Processing (NLP) for Computational Social Science Cristian Danescu-Niculescu-Mizil, Lillian Lee 133 tutorial 131.08
87 Artificial Intelligence Demystified Guest Blog 811 resource 130.97
88 A Beginner's Guide to Reinforcement Learning (for Java) Adam Gibson, Chris Nicholson, Josh Patterson 857 resource 130.52
89 Neural Nets for Generating Music Kyle McDonald 999 resource 130.41
90 A survey of transfer learning Karl Weiss, Taghi M. Khoshgoftaar and DingDing Wang 978 resource 130.27
91 Recurrent Neural Networks Stephen Grossberg 741 paper 129.95
92 Learning AI if You Suck at Math?—?P7?—?The Magic of Natural Language Processing Daniel Jeffries 133 tutorial 129.35
93 Reinforcement Learning Part 3 – Challenges & Considerations Bill Vorhies 857 resource 129.26
94 Under the Hood with Reinforcement Learning - Understanding Basic RL Models William Vorhies 857 resource 129.19