Creating and sharing knowledge for telecommunications

Query by Example Search with Segmented Dynamic Time Warping for Non-Exact Spoken Queries

Proença , J. ; Veiga, A. ; Perdigão, F.

Query by Example Search with Segmented Dynamic Time Warping for Non-Exact Spoken Queries, Proc European Signal Processing Conference EUSIPCO, Nice, France, Vol. -, pp. 1691 - 1695, August, 2015.

Digital Object Identifier: 0

 

Abstract
This paper presents an approach to the Query-by-Example task of finding spoken queries on speech databases when the intended match may be non-exact or slightly complex. The built system is low-resource as it tries to solve the problem where the language of queries and searched audio is unspecified. Our method is based on a modified Dynamic Time Warping (DTW) algorithm using posteriorgrams and extracting intricate paths to account for special cases of query match such as word re-ordering, lexical variations and filler con-tent. This system was evaluated on the MediaEval 2014 task of Query by Example Search on Speech (QUESST) where the spoken data is from different languages, unknown to the participant. We combined the results of five DTW modifications computed on the output of three phoneme recognizers of different languages. The combination of all systems provided the best perfor-mance overall and improved detection of complex case queries.