ログイン
Language:

WEKO3

  • トップ
  • ランキング
To
lat lon distance
To

Field does not validate



インデックスリンク

インデックスツリー

メールアドレスを入力してください。

WEKO

One fine body…

WEKO

One fine body…

アイテム

  1. 02 情報科学
  2. 02 国際会議論文

An Efficient Keyword Spotting Technique Using a Complementary Language for Filler Models Training

http://hdl.handle.net/10061/7970
http://hdl.handle.net/10061/7970
3e893b3e-c2ef-49d2-a7cf-c719206c67d0
名前 / ファイル ライセンス アクション
EUROSPEECH_2003_921.pdf fulltext (608.0 kB)
アイテムタイプ 会議発表論文 / Conference Paper(1)
公開日 2012-08-22
タイトル
タイトル An Efficient Keyword Spotting Technique Using a Complementary Language for Filler Models Training
言語
言語 eng
資源タイプ
資源タイプ conference paper
アクセス権
アクセス権 open access
著者 Heracleous, Panikos

× Heracleous, Panikos

WEKO 11609

en Heracleous, Panikos

Search repository
Shimizu, Tohru

× Shimizu, Tohru

WEKO 11610

en Shimizu, Tohru

Search repository
抄録
内容記述タイプ Abstract
内容記述 The task of keyword spotting is to detect a set of keywords in the input continuous speech. In a keyword spotter, not only the keywords, but also the non-keyword intervals must be modeled. For this purpose, filler (or garbage) models are used. To date, most of the keyword spotters have been based on hidden Markov models (HMM). More specifically, a set of HMM is used as garbage models. In this paper, a two-pass keyword spotting technique based on bilingual hidden Markov models is presented. In the first pass, our technique uses phonemic garbage models to represent the non-keyword intervals, and in the second stage the putative hits are verified using normalized scores. The main difference from similar approaches lies in the way the non-keyword intervals are modeled. In this work, the target language is Japanese, and English was chosen as the `garbage' language for training the phonemic garbage models. Experimental results on both clean and noisy telephone speech data showed higher performance compared with using a common set of acoustic models. Moreover, parameter tuning (e.g. word insertion penalty tuning) does not have a serious effect on the performance. For a vocabulary of 100 keywords and using clean telephone speech test data we achieved a 92.04% recognition rate with only a 7.96% false alarm rate, and without word insertion penalty tuning. Using noisy telephone speech test data we achieved a 87.29% recognition rate with only a 12.71% false alarm rate.
書誌情報
p. 921-924, 発行日 2003-09
会議情報
会議名 EUROSPEECH2003: 8th European Conference on Speech Communication and Technology
開催期間 September 1-4, 2003
開催地 Geneva
開催国 CHE
ISSN
収録物識別子タイプ ISSN
収録物識別子 1018-4074
権利
権利情報 Copyright 2003 ISCA
著者版フラグ
出版タイプ VoR
戻る
0
views
See details
Views

Versions

Ver.1 2023-07-25 14:00:53.657498
Show All versions

Share

Share
tweet

Cite as

Other

print

エクスポート

OAI-PMH
  • OAI-PMH JPCOAR 2.0
  • OAI-PMH JPCOAR 1.0
  • OAI-PMH DublinCore
  • OAI-PMH DDI
Other Formats
  • JSON
  • BIBTEX
  • ZIP

コミュニティ

確認

確認

確認


Powered by WEKO3


Powered by WEKO3