Paper: Google Web 1T 5-Grams Made Easy (but not for the computer)

ACL ID W10-1505
Title Google Web 1T 5-Grams Made Easy (but not for the computer)
Venue Proceedings of the NAACL HLT 2010 Sixth Web as Corpus Workshop
Session  
Year 2010
Authors

This paper introduces Web1T5-Easy, a sim- ple indexing solution that allows interactive searches of the Web 1T 5-gram database and a derived database of quasi-collocations. The latter is validated against co-occurrence data from the BNC and ukWaC on the automatic identification of non-compositional VPC.

@InProceedings{evert:2010:WAC6,
  author    = {Evert, Stefan},
  title     = {Google Web 1T 5-Grams Made Easy (but not for the computer)},
  booktitle = {Proceedings of the NAACL HLT 2010 Sixth Web as Corpus Workshop},
  month     = {June},
  year      = {2010},
  address   = {NAACL-HLT, Los Angeles},
  publisher = {Association for Computational Linguistics},
  pages     = {32--40},
  url       = {http://www.aclweb.org/anthology/W10-1505}
}