Study on Optimal Spoken Dialogue System for Robust Information Search in the Real World

国立国会図書館永続的識別子: info:ndljp/pid/10224574

資料種別: 博士論文

著者: 徐, 昕

出版者: Hokkaido University

出版年: 2016-09-26

資料形態: デジタル

ページ数・大きさ等: -

授与大学名・学位: 北海道大学,博士(工学)

すべて見る

国立国会図書館での利用に関する注記

本資料は、掲載誌(URI)等のリンク先にある学位授与機関のWebサイトやCiNii Dissertationsから、本文を自由に閲覧できる場合があります。

資料に関する注記

一般注記：: Recently, the spoken dialogue systems those enable users to intuitively and directly operate services and smartphones with voice commands and informat...

書店で探す

障害者向け資料で読む

障害者向け資料を見る（1種類）

全国の図書館の所蔵

国立国会図書館以外の全国の図書館の所蔵状況を表示します。

連携機関・データベースの一覧

所蔵のある図書館から取寄せることが可能かなど、資料の利用方法は、ご自身が利用されるお近くの図書館へご相談ください

その他

北海道大学学術成果コレクション
デジタル
連携先のサイトで、学術機関リポジトリデータベース（IRDB）（機関リポジトリ）が連携している機関・データベースの所蔵状況を確認できます。
北海道大学学術成果コレクションのサイトでこの本を確認

書店で探す

障害者向け資料で読む

他サービス
- テキストデータ国立国会図書館デジタルコレクションで確認する

書誌情報

この資料の詳細や典拠（同じ主題の資料を指すキーワード、著者名）等を確認できます。

デジタル

資料種別: 博士論文
タイトル: Study on Optimal Spoken Dialogue System for Robust Information Search in the Real World
著者・編者: 徐, 昕
著者標目: 徐, 昕
出版事項: Hokkaido University
出版年月日等: 2016-09-26
出版年（W3CDTF）: 2016-09-26
並列タイトル等: 実環境下におけるロバスト情報検索のための最適音声対話システムに関する研究
寄与者: 宮永, 喜一
齊藤, 晋聖
大鐘, 武雄
筒井, 弘
授与機関名: 北海道大学
授与年月日: 2016-09-26
授与年月日（W3CDTF）: 2016-09-26
報告番号: 甲第12405号
学位: 博士(工学)
博論授与番号: 甲第12405号
本文の言語コード: eng
NDC: 500
対象利用者: 一般
一般注記: Recently, the spoken dialogue systems those enable users to intuitively and directly operate services and smartphones with voice commands and information search become popular. However, there is still a remaining challenge that there are not many users with the habitual and continual use of the spoken dialogue systems for information search in the real world, though most of them have devices in which the spoken dialogue system is implemented. To solve this challenge, three researches in different aspects have been done in this thesis, to realize an optimal spoken dialogue system for robust information search in the real world.The first research practices human-centered design (HCD) to design a dialogue agent and a dialogue scenario promoting a daily use of the spoken dialogue interface, which is based on the cognitive science and the gamification theory. The author proposes a design concept of breeding a character, which is actually a dialogue agent, through taking care and having a dialogue in order to make users graduallyfeel that speaking to the dialogue agent is natural and fun. The real-world data prove the novelty of the proposed design, in which over 23% users keep speaking continually. More than 95% conversations from the dialogue agent are responded by the users.The second research improves the efficiency and robustness of the dialogue management for information search based on the information theory. The author proposes two strategies to optimize question selection for information search and to decrease failures in information search mainly caused by mistaken queries. Onestrategy applies optimal question selection in a knowledge-based spontaneous dialogue system, which has been verified to be effective to assist the users’ operation for information search. The other strategy applies a robust and fast search method based on phoneme strings matching. It decreases the failures caused by the queriescontaining incorrect parts. Experimental results show that the proposed search method increases search accuracy by 4.4% and reduces processing time by at least 86.2%.The third research practices signal processing technologies to emphasize the usability of spoken dialogue systems. The author proposes a novel pitch detection method applying an adaptive filtering algorithm to restore the amplitude spectra of speech corrupted by additive noises. The periodic structures in the amplitude spectra are kept against noise distortion. Experimental results verify that the proposed pitch detection method achieved the highest robustness in a variety of noise type and noise level. With the high-accuracy pitch information, emotion recognition isgoing to be established in the next step of this research. Understanding speaker’s emotion helps to generate the appropriate dialogue actions to present superiority and differentiation to other modalities.Furthermore, based on the above researches, this thesis proposes a dialogue structure to build a personalized dialogue system applying emotion recognition and multidevice interface for further real-world use in the future.
(主査) 教授宮永喜一, 教授齊藤晋聖, 教授大鐘武雄, 准教授筒井弘
情報科学研究科（メディアネットワーク専攻）
DOI: 10.14943/doctoral.k12405
https://doi.org/10.14943/doctoral.k12405
国立国会図書館永続的識別子: info:ndljp/pid/10224574
https://dl.ndl.go.jp/pid/10224574
コレクション（共通）: 障害者向け資料
コレクション（障害者向け資料：レベル1）: テキストデータ
コレクション（個別）: 国立国会図書館デジタルコレクション > デジタル化資料 > 博士論文
https://dl.ndl.go.jp/collections/A00014
収集根拠: 博士論文（自動収集）
受理日（W3CDTF）: 2016-12-01T22:39:34+09:00
作成日（W3CDTF）: 2016-08-22
記録形式（IMT）: PDF
オンライン閲覧公開範囲: 国立国会図書館内限定公開
デジタル化資料送信: 図書館・個人送信対象外
遠隔複写可否（NDL）: 可
掲載誌（URI）: http://dx.doi.org/10.14943/doctoral.k12405
http://hdl.handle.net/2115/63374
参照（URI）: http://hdl.handle.net/2115/63361
連携機関・データベース: 国立国会図書館 : 国立国会図書館デジタルコレクション
https://dl.ndl.go.jp