View article

Time-compressing speech: ASR transcripts are an effective way to support gist extraction

Authors

Simon Tucker, Nicos Kyprianou, Steve Whittaker

Publication date

2008

Conference

Machine Learning for Multimodal Interaction: 5th International Workshop, MLMI 2008, Utrecht, The Netherlands, September 8-10, 2008. Proceedings 5

Pages

226-235

Publisher

Springer Berlin Heidelberg

Description

A major problem for users exploiting speech archives is the laborious nature of speech access. Prior work has developed methods that allow users to efficiently identify and access the gist of an archive using textual transcripts of the conversational recording. Text processing techniques are applied to these transcripts to identify unimportant parts of the recording and to excise these, reducing the time taken to identify the main points of the recording. However our prior work has relied on human-generated as opposed to automatically generated transcripts. Our study compares excision methods applied to human-generated and automatically generated transcripts with state of the art word error rates (38%). We show that both excision techniques provide equivalent support for gist extraction. Furthermore, both techniques perform better than the standard speedup techniques used in current applications. This …

Total citations

Cited by 9

200920102011201220132014201520162 1 2 2 1 1

Scholar articles

Time-compressing speech: ASR transcripts are an effective way to support gist extraction

S Tucker, N Kyprianou, S Whittaker - Machine Learning for Multimodal Interaction: 5th …, 2008