Authors
Michael Johnston, Srinivas Bangalore
Publication date
2000
Conference
COLING 2000 Volume 1: The 18th International Conference on Computational Linguistics
Description
Multimodal interfaces require effective parsing and understanding of utterances whose content is distributed across multiple input modes. Johnston 1998 presents an approach in which strategies for multimodal integration are stated declaratively using a unification-based grammar that is used by a multidimensional chart parser to compose inputs. This approach is highly expressive and supports a broad class of interfaces, but offers only limited potential for mutual compensation among the input modes, is subject to significant concerns in terms of computational complexity, and complicates selection among alternative multimodal interpretations of the input. In this paper, we present an alternative approach in which multimodal parsing and understanding are achieved using a weighted finite-state device which takes speech and gesture streams as inputs and outputs their joint interpretation. This approach is significantly more efficient, enables tight-coupling of multimodal understanding with speech recognition, and provides a general probabilistic framework for multimodal ambiguity resolution.
Total citations
20002001200220032004200520062007200820092010201120122013201420152016201720182019202020212022241351513191111181083653752111
Scholar articles
M Johnston, S Bangalore - COLING 2000 Volume 1: The 18th International …, 2000