A hybrid text-to-speech based on sub-band approach

Takuma Inoue, Sunao Hara, Masanobu Abe

研究成果

2 被引用数 (Scopus)

抄録

This paper proposes a sub-band speech synthesis approach to develop high-quality Text-to-Speech (TTS). For the low-frequency band and high-frequency band, Hidden Markov Model (HMM)-based speech synthesis and waveform-based speech synthesis are used, respectively. Both speech synthesis methods are widely known to show good performance and to have benefits and shortcomings from different points of view. One motivation is to apply the right speech synthesis method in the right frequency band. Experiment results show that in terms of the smoothness the proposed approach shows better performance than waveform-based speech synthesis, and in terms of the clarity it shows better than HMM-based speech synthesis. Consequently, the proposed approach combines the inherent benefits from both waveform-based speech synthesis and HMM-based speech synthesis.

本文言語English
ホスト出版物のタイトル2014 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, APSIPA 2014
出版社Institute of Electrical and Electronics Engineers Inc.
ISBN(電子版)9786163618238
DOI
出版ステータスPublished - 2月 12 2014
イベント2014 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, APSIPA 2014 - Chiang Mai
継続期間: 12月 9 201412月 12 2014

出版物シリーズ

名前2014 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, APSIPA 2014

Other

Other2014 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, APSIPA 2014
国/地域Thailand
CityChiang Mai
Period12/9/1412/12/14

ASJC Scopus subject areas

  • 信号処理
  • 情報システム

フィンガープリント

「A hybrid text-to-speech based on sub-band approach」の研究トピックを掘り下げます。これらがまとまってユニークなフィンガープリントを構成します。

引用スタイル