Authors
Simon Tucker, Nicos Kyprianou, Steve Whittaker
Publication date
2008
Conference
Machine Learning for Multimodal Interaction: 5th International Workshop, MLMI 2008, Utrecht, The Netherlands, September 8-10, 2008. Proceedings 5
Pages
226-235
Publisher
Springer Berlin Heidelberg
Description
A major problem for users exploiting speech archives is the laborious nature of speech access. Prior work has developed methods that allow users to efficiently identify and access the gist of an archive using textual transcripts of the conversational recording. Text processing techniques are applied to these transcripts to identify unimportant parts of the recording and to excise these, reducing the time taken to identify the main points of the recording. However our prior work has relied on human-generated as opposed to automatically generated transcripts. Our study compares excision methods applied to human-generated and automatically generated transcripts with state of the art word error rates (38%). We show that both excision techniques provide equivalent support for gist extraction. Furthermore, both techniques perform better than the standard speedup techniques used in current applications. This …
Total citations
20092010201120122013201420152016212211
Scholar articles
S Tucker, N Kyprianou, S Whittaker - Machine Learning for Multimodal Interaction: 5th …, 2008