Incorporating Multi-granularity Linguistic Units in Character-based Word Segmentation

デジタルデータあり（東京工業大学リサーチリポジトリ）

学術機関リポジトリデータベース（IRDB）（機関リポジトリ）

Incorporating Multi-granularity Linguistic Units in Character-based Word Segmentation

資料種別: 博士論文

著者: CHAY-INTR, Thodsapornほか

出版者: -

出版年: 2023-09

資料形態: デジタル

ページ数・大きさ等: -

授与大学名・学位: 東京工業大学,博士(工学)

すべて見る

資料に関する注記

一般注記：: A character sequence tends to comprise segmentation alternatives, leading to segmentation ambiguity. Properly handling this ambiguity using multi-gran...

書店で探す

全国の図書館の所蔵

国立国会図書館以外の全国の図書館の所蔵状況を表示します。

連携機関・データベースの一覧

所蔵のある図書館から取寄せることが可能かなど、資料の利用方法は、ご自身が利用されるお近くの図書館へご相談ください

その他

東京工業大学リサーチリポジトリ
デジタル
連携先のサイトで、学術機関リポジトリデータベース（IRDB）（機関リポジトリ）が連携している機関・データベースの所蔵状況を確認できます。
東京工業大学リサーチリポジトリのサイトでこの本を確認

書店で探す

書誌情報

この資料の詳細や典拠（同じ主題の資料を指すキーワード、著者名）等を確認できます。

デジタル

資料種別: 博士論文
タイトル: Incorporating Multi-granularity Linguistic Units in Character-based Word Segmentation
著者・編者: CHAY-INTR, Thodsaporn
Chay-intr, Thodsaporn
著者標目: CHAY-INTR, Thodsaporn
Chay-intr, Thodsaporn
出版年月日等: 2023-09
出版年（W3CDTF）: 2023-09
授与機関名: 東京工業大学
授与年月日: 2023-09-22
報告番号: 甲第12542号
学位: 博士(工学)
本文の言語コード: eng
件名標目: Word segmentation
Representation learning
Linguistic units
対象利用者: 一般
一般注記: A character sequence tends to comprise segmentation alternatives, leading to segmentation ambiguity. Properly handling this ambiguity using multi-granularity linguistic units, such as character clusters, subwords, and words, can improve word segmentation performance and lessen ambiguous boundary decisions. We conduct a study to investigate the potential of using various linguistic units and leveraging segmentation alternatives for character-based word segmentation. Our experimental results demonstrated improvements in segmentation performance, outperforming previous work on the BCCWJ, CTB6, and BEST2010 datasets in Japanese, Chinese, and Thai, respectively.
identifier:oai:t2r2.star.titech.ac.jp:50672947
一次資料へのリンクURL: http://t2r2.star.titech.ac.jp/rrws/file/CTT100902372/ATD100000413/19D10554_CHAY-INTR-Thodsaporn_thesis.pdf （fulltext）
http://t2r2.star.titech.ac.jp/rrws/file/CTT100902372/ATD100000413/19D10554_CHAY-INTR-Thodsaporn_thesis.pdf
オンライン閲覧公開範囲: インターネット公開
連携機関・データベース: 国立情報学研究所 : 学術機関リポジトリデータベース（IRDB）（機関リポジトリ）
https://irdb.nii.ac.jp
提供元機関・データベース: 東京工業大学 : 東京工業大学リサーチリポジトリ