WEKO3
アイテム
Perceptual Evaluation of Quality Deterioration Owing to Prosody Modification
http://hdl.handle.net/10061/8223
http://hdl.handle.net/10061/8223ce18f4c4-d0ca-4038-a5d1-3ec99e1fcf9e
名前 / ファイル | ライセンス | アクション |
---|---|---|
![]() |
|
Item type | 会議発表論文 / Conference Paper(1) | |||||
---|---|---|---|---|---|---|
公開日 | 2012-08-22 | |||||
タイトル | ||||||
タイトル | Perceptual Evaluation of Quality Deterioration Owing to Prosody Modification | |||||
言語 | ||||||
言語 | eng | |||||
キーワード | ||||||
主題Scheme | Other | |||||
主題 | Speech synthesis | |||||
キーワード | ||||||
主題Scheme | Other | |||||
主題 | prosody modification | |||||
キーワード | ||||||
主題Scheme | Other | |||||
主題 | speech database | |||||
キーワード | ||||||
主題Scheme | Other | |||||
主題 | analysis-synthesis | |||||
キーワード | ||||||
主題Scheme | Other | |||||
主題 | perceptual evaluation | |||||
資源タイプ | ||||||
資源タイプ | conference paper | |||||
アクセス権 | ||||||
アクセス権 | open access | |||||
著者 |
Adachi, Kazuki
× Adachi, Kazuki× Toda, Tomoki× Kawanami, Hiromichi× Saruwatari, Hiroshi× Shikano, Kiyohiro |
|||||
抄録 | ||||||
内容記述タイプ | Abstract | |||||
内容記述 | Our reasearch goal is to construct a Japanese TTS (Text-to-Speech) system that can output various kinds of prosody. Since such synthetic speech is useful for a practical use, many TTS systems have implemented global prosodic control processing. But fundamentally they're designed to output speech with standard pitch and speech rate. We discuss synthesis method for high quality speech with extreme prosody (very high, low, fast and slow) from a viewpoint of a speech database. As a speech synthesis method, we employ a unit selection-concatenation method. We also introduce an analysis-synthesis process to give precise target prosody to output speech. Many research has reported that speech quality get worse in proportion to an amount of prosody modification by analysis-synthesis or PSOLA. Following the reports, we take an approach to reduce prosody modification of a speech segment. Nine Japanese speech databases with different characteristics in prosody are prepared. First we confirm relationship between speech quality deterioration and prosody modification, using synthetic speech with through objective and subjective tests. We also investigate relationship between a speech deterioration tendency and each speech database. The result indicates that the tendencies depend on prosodic features of original speech. | |||||
書誌情報 |
p. 2159-2162, 発行日 2004-05 |
|||||
会議情報 | ||||||
会議名 | LREC2004: the 4th International Conference on Language Resources and Evaluation | |||||
開催期間 | May 24-30, 2004 | |||||
開催地 | Lisbon | |||||
開催国 | PRT | |||||
著者版フラグ | ||||||
出版タイプ | VoR |