Paper: Building a Korean Web Corpus for Analyzing Learner Language

ACL ID W10-1502
Title Building a Korean Web Corpus for Analyzing Learner Language
Venue Proceedings of the NAACL HLT 2010 Sixth Web as Corpus Workshop
Session  
Year 2010
Authors

Post-positional particles are a significant source of errors for learners of Korean. Fol- lowing methodology that has proven effective in handling English preposition errors, we are beginning the process of building a machine learner for particle error detection in L2 Ko- rean writing. As a first step, however, we must acquire data, and thus we present a method- ology for constructing large-scale corpora of Korean from the Web, exploring the feasibil- ity of building corpora appropriate for a given topic and grammatical construction.

@InProceedings{dickinson-israel-lee:2010:WAC6,
  author    = {Dickinson, Markus  and  Israel, Ross  and  Lee, Sun-Hee},
  title     = {Building a Korean Web Corpus for Analyzing Learner Language},
  booktitle = {Proceedings of the NAACL HLT 2010 Sixth Web as Corpus Workshop},
  month     = {June},
  year      = {2010},
  address   = {NAACL-HLT, Los Angeles},
  publisher = {Association for Computational Linguistics},
  pages     = {8--16},
  url       = {http://www.aclweb.org/anthology/W10-1502}
}