View article

[HTML] from acm.org

Designing the user interface for multimodal speech and pen-based gesture applications: State-of-the-art systems and future research directions

Authors

Sharon Oviatt, Phil Cohen, Lizhong Wu, Lisbeth Duncan, Bernhard Suhm, Josh Bers, Thomas Holzman, Terry Winograd, James Landay, Jim Larson, David Ferro

Publication date

2000/12/1

Journal

Human-computer interaction

Volume

Issue

Pages

263-322

Publisher

Lawrence Erlbaum Associates, Inc.

Description

The growing interest in multimodal interface design is inspired in large part by the goals of supporting more transparent, flexible, efficient, and powerfully expressive means of human-computer interaction than in the past. Multimodal interfaces are expected to support a wider range of diverse applications, be usable by a broader spectrum of the average population, and function more reliably under realistic and challenging usage conditions. In this article, we summarize the emerging architectural approaches for interpreting speech and pen-based gestural input in a robust manner-including early and late fusion approaches, and the new hybrid symbolic-statistical approach. We also describe a diverse collection of state-of-the-art multimodal systems that process users' spoken and gestural input. These applications range from map-based and virtual reality systems for engaging in simulations and training, to field medic …

Total citations

Cited by 576

20002001200220032004200520062007200820092010201120122013201420152016201720182019202020212022202320245 15 43 41 40 41 35 36 30 32 36 17 18 20 19 25 14 21 11 15 11 16 9 15 2

Scholar articles

Designing the user interface for multimodal speech and pen-based gesture applications: State-of-the-art systems and future research directions

S Oviatt, P Cohen, L Wu, L Duncan, B Suhm, J Bers… - Human-computer interaction, 2000