Authors
Fernando Pereira, Yves Schabes
Publication date
1992/6
Conference
30th Annual Meeting of the Association for Computational Linguistics
Pages
128-135
Description
The inside-outside algorithm for inferring the parameters of a stochastic context-free grammar is extended to take advantage of constituent information (constituent bracketing) in a partially parsed corpus. Experiments on formal and natural language parsed corpora show that the new algorithm can achieve faster convergence and better modeling of hierarchical structure than the original one. In particular, over 90% test set bracketing accuracy was achieved for grammars inferred by our algorithm from a training set of handparsed part-of-speech strings for sentences in the Air Travel Information System spoken language corpus. Finally, the new algorithm has better time complexity than the original one when sufficient bracketing is provided.
Total citations
199219931994199519961997199819992000200120022003200420052006200720082009201020112012201320142015201620172018201920202021202220232024420222824322523232619312223262717121720201513114243597141
Scholar articles
F Pereira, Y Schabes - 30th Annual Meeting of the Association for …, 1992