WEKO3
アイテム
ZeST: A Zero-Resourced Speech-to-Speech Translation Approach for Unknown, Unpaired, and Untranscribed Languages
http://hdl.handle.net/10061/0002001294
http://hdl.handle.net/10061/000200129420ac9b4c-f91f-4b86-8a00-df6a6f95ead8
| アイテムタイプ | 学術雑誌論文 / Journal Article(1) | |||||||||
|---|---|---|---|---|---|---|---|---|---|---|
| 公開日 | 2025-12-16 | |||||||||
| タイトル | ||||||||||
| タイトル | ZeST: A Zero-Resourced Speech-to-Speech Translation Approach for Unknown, Unpaired, and Untranscribed Languages | |||||||||
| 言語 | ||||||||||
| 言語 | eng | |||||||||
| キーワード | ||||||||||
| 主題Scheme | Other | |||||||||
| 主題 | Translation | |||||||||
| キーワード | ||||||||||
| 主題Scheme | Other | |||||||||
| 主題 | Speech processing | |||||||||
| キーワード | ||||||||||
| 主題Scheme | Other | |||||||||
| 主題 | Feature extraction | |||||||||
| キーワード | ||||||||||
| 主題Scheme | Other | |||||||||
| 主題 | Data models | |||||||||
| キーワード | ||||||||||
| 主題Scheme | Other | |||||||||
| 主題 | Decoding | |||||||||
| キーワード | ||||||||||
| 主題Scheme | Other | |||||||||
| 主題 | Training | |||||||||
| キーワード | ||||||||||
| 主題Scheme | Other | |||||||||
| 主題 | Transformers | |||||||||
| キーワード | ||||||||||
| 主題Scheme | Other | |||||||||
| 主題 | Noise reduction | |||||||||
| キーワード | ||||||||||
| 主題Scheme | Other | |||||||||
| 主題 | Visualization | |||||||||
| キーワード | ||||||||||
| 主題Scheme | Other | |||||||||
| 主題 | Text to speech | |||||||||
| キーワード | ||||||||||
| 主題Scheme | Other | |||||||||
| 主題 | Speech-to-speech translation | |||||||||
| キーワード | ||||||||||
| 主題Scheme | Other | |||||||||
| 主題 | self-supervised speech representation | |||||||||
| キーワード | ||||||||||
| 主題Scheme | Other | |||||||||
| 主題 | zero-resourced | |||||||||
| 資源タイプ | ||||||||||
| 資源タイプ | journal article | |||||||||
| アクセス権 | ||||||||||
| アクセス権 | open access | |||||||||
| 著者 |
Thanh, Nguyen Luan
× Thanh, Nguyen Luan
× Sakti, Sakriani
|
|||||||||
| 抄録 | ||||||||||
| 内容記述タイプ | Abstract | |||||||||
| 内容記述 | Speech-to-speech translation (S2ST) has emerged as a practical solution for overcoming linguistic barriers, enabling direct translation between spoken languages without relying on intermediate text representations. However, existing S2ST systems face significant challenges, including the requirement for extensive parallel speech data and the limitations of known written languages. This paper proposes ZeST, a novel zero-resourced approach to speech-to-speech translation that addresses the challenges of processing unknown, unpaired, and untranscribed languages. ZeST consists of two main phases: (1) Discovering semantically related speech pairs from unpaired data by leveraging self-supervised visually grounded speech (VGS) models and (2) Achieving textless speech-to-speech translation for untranscribed languages using discrete speech representations and sequence-to-sequence modeling. Experimental evaluations using three different data scenarios demonstrate that the ZeST system effectively performs direct speech-to-speech translation without relying on transcribed data or parallel corpora. The experimental results highlight the potential of ZeST in contributing to the field of zero-resourced speech processing and improving communication in multilingual societies. | |||||||||
| 書誌情報 |
en : IEEE Access 巻 13, p. 8638-8648, ページ数 11, 発行日 2025-01-08 |
|||||||||
| 出版者 | ||||||||||
| 出版者 | IEEE | |||||||||
| ISSN | ||||||||||
| 収録物識別子タイプ | EISSN | |||||||||
| 収録物識別子 | 2169-3536 | |||||||||
| 出版者版DOI | ||||||||||
| 関連タイプ | isReplacedBy | |||||||||
| 識別子タイプ | DOI | |||||||||
| 関連識別子 | https://doi.org/10.1109/ACCESS.2025.3527012 | |||||||||
| 出版者版URI | ||||||||||
| 関連タイプ | isReplacedBy | |||||||||
| 識別子タイプ | URI | |||||||||
| 関連識別子 | https://ieeexplore.ieee.org/abstract/document/10833610 | |||||||||
| 権利 | ||||||||||
| 権利情報Resource | https://creativecommons.org/licenses/by/4.0/ | |||||||||
| 権利情報 | © 2025 The Authors. This work is licensed under a Creative Commons Attribution 4.0 License. For more information, see https://creativecommons.org/licenses/by/4.0/ | |||||||||
| 著者版フラグ | ||||||||||
| 出版タイプ | NA | |||||||||
| 助成情報 | ||||||||||
| 助成機関名 | Japan Society for the Promotion of Science (JSPS) | |||||||||
| 研究課題番号 | JP21H05054 | |||||||||
| 研究課題番号URI | https://kaken.nii.ac.jp/grant/KAKENHI-PROJECT-21H05054/ | |||||||||
| 研究課題名 | 多元自動通訳システムと評価法に関する研究とその応用展開 | |||||||||
| 助成情報 | ||||||||||
| 助成機関名 | Japan Society for the Promotion of Science (JSPS) | |||||||||
| 研究課題番号 | JP23K21681 | |||||||||
| 研究課題番号URI | https://kaken.nii.ac.jp/grant/KAKENHI-PROJECT-23K21681/ | |||||||||
| 研究課題名 | 言語の壁を超える低資源多言語Machine Speech Chain技術の構築 | |||||||||