Title: Autoregressive clustering for HMM speech synthesis
Authors: Shannon, Matt
Byrne, William
Issue Date: 27-Sep-2010
Publisher: ISCA (International Speech Communication Association)
Citation: M. Shannon and W. Byrne, "Autoregressive clustering for HMM speech synthesis," in Proc. Interspeech 2010, 2010, pp 829-832, http://mi.eng.cam.ac.uk/~sms46/papers/shannon2010autoregressive.pdf
Abstract: The autoregressive HMM has been shown to provide efficient parameter estimation and high-quality synthesis, but in previous experiments decision trees derived from a non-autoregressive system were used. In this paper we investigate the use of autoregressive clustering for autoregressive HMM-based speech synthesis. We describe decision tree clustering for the autoregressive HMM and highlight differences to the standard clustering procedure. Subjective listening evaluation results suggest that autoregressive clustering improves the naturalness of the resulting speech. We find that the standard minimum description length (MDL) criterion for selecting model complexity is inappropriate for the autoregressive HMM. Investigating the effect of model complexity on naturalness, we find that a large degree of overfitting is tolerated without a substantial decrease in naturalness.
URI: http://mi.eng.cam.ac.uk/~sms46/papers/shannon2010autoregressive.pdf
http://www.dspace.cam.ac.uk/handle/1810/226374
http://www.isca-speech.org/archive/interspeech_2010/i10_0829.html
Appears in Collections:Scholarly works - Information Engineering

Files in This Item:

File Description SizeFormat
shannon2010autoregressive.pdf169.29 kBAdobe PDFThumbnail
View/Open
Additional resources for this item
search for alternative versions in eresources@cambridge
retrieve citation metadata in EndNote format

This item is licensed under a Creative Commons License
Creative Commons

This item has been accessed 902 times.

Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.