Authors
John Blitzer, Ryan McDonald, Fernando Pereira
Publication date
2006/7
Conference
Proceedings of the 2006 conference on empirical methods in natural language processing
Pages
120-128
Description
Discriminative learning methods are widely used in natural language processing. These methods work best when their training and test data are drawn from the same distribution. For many NLP tasks, however, we are confronted with new domains in which labeled data is scarce or non-existent. In such cases, we seek to adapt existing models from a resourcerich source domain to a resource-poor target domain. We introduce structural correspondence learning to automatically induce correspondences among features from different domains. We test our technique on part of speech tagging and show performance gains for varying amounts of source and target training data, as well as improvements in target domain parsing accuracy using our improved tagger.
Total citations
200720082009201020112012201320142015201620172018201920202021202220232024143046829910211412414614315315916914712912111625
Scholar articles
J Blitzer, R McDonald, F Pereira - Proceedings of the 2006 conference on empirical …, 2006