View article

Temporal compression of speech: An evaluation

Authors

Simon Tucker, Steve Whittaker

Publication date

2008/4/15

Journal

IEEE transactions on audio, speech, and language processing

Volume

Issue

Pages

790-796

Publisher

IEEE

Description

Efficient browsing of speech recordings is problematic. The linear nature of speech, coupled with the lack of abstraction that the medium affords, means that listeners have to listen to long segments of a recording to locate points of interest. We explore temporal compression algorithms that attempt to reduce the amount of time users require to listen to speech recordings, while retaining the important content. This paper implements two main approaches to temporal compression: artificial speech rate alteration (speed-up) and unimportant segment removal (excision). We evaluate the effectiveness of these approaches by having listeners rate comprehension and listening effort for different types of temporal compression. For different compression levels, we compare performance of various implementations of speed-up and excision as well as techniques based on semantic features and acoustic features. Our results …

Total citations

Cited by 19

20082009201020112012201320142015201620172018201920202021202220231 1 1 3 2 3 1 3 2 2

Scholar articles

Temporal compression of speech: An evaluation

S Tucker, S Whittaker - IEEE transactions on audio, speech, and language …, 2008