View Project


TutorialBank: Using a Manually-Collected Corpus for Prerequisite Chains, Survey Extraction and Resource Recommendation


The field of Natural Language Processing (NLP) is growing rapidly, with new research published daily along with an abundance of tutorials, codebases and other online resources. In order to learn this dynamic field or stay up-to-date on the latest research, students as well as educators and researchers must constantly sift through multiple sources to find valuable, relevant information. To address this situation, we introduce TutorialBank, a new, publicly available dataset which aims to facilitate NLP education and research. We have manually collected and categorized over 5,600 resources on NLP as well as the related fields of Artificial Intelligence (AI), Machine Learning (ML) and Information Retrieval (IR). Our dataset is notably the largest manually-picked corpus of resources intended for NLP education which does not include only academic papers. Additionally, we have created both a search engine and a command-line tool for the resources and have annotated the corpus to include lists of research topics, relevant resources for each topic, prerequisite relations among topics, relevant sub-parts of individual resources, among other annotations. We are releasing the dataset and present several avenues for further research.


Login to edit or delete this resource.

Suggested Topics (up to Top 50)

Full Matches (full topic name in abstract)

Partial Matches (at least half of words topic name appear in abstract)

Suggested Resources

Uses abstract to search the content of resources available in Topics. Sorted by relevance.

# Title Author Topic Medium Score
1 Opinion mining and sentiment analysis Bo Pang and Lillian Lee 381 survey 224.91
2 Natural Language Processing for Intelligent Access to Scientific Information Francesco Ronzano, Horacio Saggion 232 tutorial 222.28
3 Scholarly Data Mining: Making Sense of Scientific Literature Horacio Saggion, Francesco Ronzano 974 tutorial 219.97
4 Scholarly Data Mining: Making Sense of Scientific Literature Horacio Saggion & Francesco Ronzano 974 tutorial 219.97
5 Automatic Summarization Ani Nenkova and Kathleen McKeown 421 survey 216.73
6 Extracting World and Linguistic Knowledge from Wikipedia Simone Paolo Ponzetto, Michael Strube 232 tutorial 213.35
7 Sentiment Analysis and Opinion Mining Bing Liu 381 survey 212.98
8 Word Sense Disambiguation: A Survey Roberto Navigli 391 survey 206.11
9 Text Data Management and Analysis M. Tamer O zsu 342 survey 205.78
10 Natural language question answering: the view from here L. Hirschman, R. Gaizauskas 411 survey 203.82
11 Information Extraction Sunita Sarawagi 232 survey 203.69
12 Natural Language Data Management and Interfaces Yunyao Li, Davood Rafiei 974 tutorial 203.55
13 Question Answering Techniques for the World Wide Web Jimmy Lin, Boris Katz 411 tutorial 200.47
14 Artificial Intelligence and Games Georgios N. Yannakakis and Julian Togelius 825 survey 198.54
15 Computational Analysis of Affect and Emotion in Language Saif M. Mohammad, Cecilia Ovesdotter Alm 381 tutorial 197.21
16 A Survey on Automatic Text Summarization Dipanjan Das, Andre F.T. Martins 411 survey 196.71
17 Natural Language Processing with Python Steven Bird, Ewan Klein, Edward Loper 132 survey 191.35
18 Spoken Content Retrieval: A Survey of Techniques and Technologies Martha Larson and Gareth J. F. Jones 962 survey 191.20
19 An Introduction to Neural Information Retrieval Bhaskar Mitra, Nick Craswell 232 survey 190.16
20 Open-Domain Question Answering John Prager 411 survey 189.90
21 Sentiment Analysis and Subjectivity Bing Liu 381 survey 188.67
22 The Creation and Analysis of a Website Privacy Policy Corpus Shomir Wilson, Florian Schaub, Aswarth Abhilash Dara, Frederick Liu... 999 paper 188.47
23 Information Extraction Katharina Kaiser and Silvia Miksch 232 survey 187.10
24 Applications of Social Media Text Analysis Atefeh Farzindar, Diana Inkpen 957 tutorial 186.27
25 A Corpus and Cloze Evaluation for Deeper Understanding of Commonsense Stories Nasrin Mostafazadeh, Nathanael Chambers, Xiaodong He, Devi Parikh, ... 999 paper 184.57
26 A Survey of Text Summarization Techniques Ani Nenkova, Kathleen McKeown 421 survey 184.25
27 All Fingers are not Equal: Intensity of References in Scientific Articles Tanmoy Chakraborty, Ramasuri Narayanam 999 paper 184.16
28 Tables as Semi-structured Knowledge for Question Answering Sujay Kumar Jauhar, Peter Turney, Eduard Hovy 999 paper 184.02
29 Search Engines Information Retrieval in Practice W. Bruce Croft, Donald Metzler, Trevor Strohman 612 survey 183.98
30 Web Crawling Christopher Olston and Marc Najork 625 survey 183.21
31 Cross Lingual Information Retrieval Jian-Yun Nie 611 survey 182.40
32 Introduction to Text Summarization and Other Information Access Technologies Horacio Saggion 421 tutorial 182.27
33 Neural Information Retrieval: At the End of the Early Years Kezban Dilek Onal, Ye Zhang, Ismail Sengor Altingovde, Md Mustafizu... 713 resource 181.99
34 Automated Question Answering: Review of the Main Approaches Andrea Andrenucci, Eriks Sneiders 411 survey 181.51
35 Tutorial on BioText Mining Martin Krallinge 973 tutorial 180.35
36 Opinion Mining: Exploiting the Sentiment of the Crowd Diana Maynard, Adam Funk, Kalina Bontcheva 381 tutorial 180.22
37 The Elements of Automatic Summarization Daniel Jacob Gillick 421 survey 180.02
38 Named Entity Recognition and Classification David Nadeau, Satoshi Sekine 232 survey 177.98
39 Open-Domain Question Answering Mark Andrew Greenwood 411 survey 177.86
40 From Frequency to Meaning: Vector Space Models of Semantics Peter D. Turney, Patrick Pantel 322 paper 177.52
41 Introduction to the Special Issue on Summarization Dragomir R. Radev, Eduard Hovy, Kathleen McKeown 421 survey 176.67
42 Generating Natural Questions About an Image Nasrin Mostafazadeh, Ishan Misra, Jacob Devlin, Margaret Mitchell, ... 999 paper 176.39
43 Semantic Role Labeling: An Introduction to the Special Issue Lluis Marquez, Xavier Carreras, Kenneth C.Litkowski, Suzanne Stevenson 367 survey 175.91
44 SQuAD: 100,000+ Questions for Machine Comprehension of Text Pranav Rajpurkar, Jian Zhang, Konstantin Lopyrev, Percy Liang 999 paper 174.47
45 A Survey of Paraphrasing and Textual Entailment Methods Ion Androutsopoulos, Prodromos Malakasiotis 351 survey 173.21
46 Topics, Trends, and Resources in NLP Mohit Bansal 133 tutorial 173.14
47 Authorship Attribution Patrick Juola 965 survey 172.56
48 Multilingual Sentiment and Subjectivity Analysis Rada Mihalcea, Carmen Banea, Janyce Wiebe 381 tutorial 172.29
49 Endangered Languages Richard Littauer survey 170.33
50 A Survey of Automatic Query Expansion in Information Retrieval Claudio Carpineto And Giovanni Romano 232 survey 170.00