Paper: k-NN For Local Probability Estimation In Generative Parsing Models

ACL ID W05-1528
Title k-NN For Local Probability Estimation In Generative Parsing Models
Venue Workshop On Parsing Technology
Session  
Year 2005
Authors

We describe a history-based generative parsing model which uses a k-nearest neighbour (k-NN) technique to estimate the model’s parameters. Taking the output of a base n-best parser we use our model to re-estimate the log probability of each parse tree in the n-best list for sentences from the Penn Wall Street Journal treebank. By further decomposing the local probability distributions of the base model, enriching the set of conditioning features used to estimate the model’s parameters, and using k-NN as opposed to the Witten-Bell estimation of the base model, we achieve an f-score of 89.2%, representing a 4% relative decrease in f-score error over the 1-best output of the base parser.

@InProceedings{hogan:2005:IWPT,
  author    = {Hogan, Deirdre},
  title     = {{k-NN} for Local Probability Estimation in Generative Parsing Models},
  booktitle = {Proceedings of the Ninth International Workshop on Parsing Technology},
  month     = {October},
  year      = {2005},
  address   = {Vancouver, British Columbia},
  publisher = {Association for Computational Linguistics},
  pages     = {202--203},
  url       = {http://www.aclweb.org/anthology/W/W05/W05-1528}
}