A Research on Learned Image/Video Restoration and Compression for Solving Real-World Degradation

表紙は所蔵館によって異なることがあります

国立国会図書館館内限定公開

収録元データベースで確認する

国立国会図書館デジタルコレクション

デジタルデータあり

公開元のウェブサイトで確認する

DOI[10.15002/00025235]のデータに遷移します

A Research on Learned Image/Video Restoration and Compression for Solving Real-World Degradation

国立国会図書館永続的識別子: info:ndljp/pid/12302976

資料種別: 博士論文

著者: HO, Minh Man

出版者: -

出版年: 2022-03-24

資料形態: デジタル

ページ数・大きさ等: -

授与大学名・学位: 法政大学 (Hosei University),博士(工学)

すべて見る

国立国会図書館での利用に関する注記

本資料は、掲載誌(URI)等のリンク先にある学位授与機関のWebサイトやCiNii Dissertationsから、本文を自由に閲覧できる場合があります。

資料に関する注記

一般注記：: type:ThesisThe adage, "A picture is worth a thousand words", has proved the effectiveness of image and video in delivering information. Hence, the Int...

書店で探す

障害者向け資料で読む

障害者向け資料を見る（1種類）

書店で探す

障害者向け資料で読む

他サービス
- テキストデータ国立国会図書館デジタルコレクションで確認する

書誌情報

この資料の詳細や典拠（同じ主題の資料を指すキーワード、著者名）等を確認できます。

デジタル

資料種別: 博士論文
タイトル: A Research on Learned Image/Video Restoration and Compression for Solving Real-World Degradation
著者・編者: HO, Minh Man
著者標目: HO, Minh Man
出版年月日等: 2022-03-24
出版年（W3CDTF）: 2022-03-24
授与機関名: 法政大学 (Hosei University)
授与年月日: 2022-03-24
授与年月日（W3CDTF）: 2022-03-24
報告番号: 甲第542号
学位: 博士(工学)
博論授与番号: 甲第542号
本文の言語コード: eng
件名標目: Deep Learning
Super-Resolution
Video Compression
Skip Connection
Colorization
対象利用者: 一般
一般注記: type:Thesis
The adage, "A picture is worth a thousand words", has proved the effectiveness of image and video in delivering information. Hence, the Internet becomes wonderful when we can share image/video media with people worldwide in this digital era. It must be more incredible if image/video media can precisely show what we see in real life with our eyes. Unfortunately, due to natural causes (e.g., shooting devices and environments) or artificial causes (e.g., image/video compression sacrificing information to achieve better transmission), the image/video media is not always in the best visual quality which human expects to see (ground-truth), reducing user experience in receiving the information. The loss of an image compared with its ground-truth is called degradation, and the act of solving degradation is called restoration. Even though many advanced techniques have been proposed to restore degraded images/videos, the real-world degradation remains unsolved. Hence, this thesis will dive into and solve specific types of real-world degradation, including (1) artificial degradation in image/video compression and (2) naturally affected degradation in smartphone photo scanning.Regarding (1), we leverage deep learning techniques to solve compression degradation and recover other missing information caused by our effort in reducing compression complexity. Concretely, we sacrifice numerous pixels by down-sampling and color information. It creates a new challenge in compensating for the massively missing information through down-sampling, color removal, and compression. By adopting advanced techniques in computer vision, we propose a specific deep neural network, named restoration-reconstruction deep neural network (RR-DnCNN), to solve Super-Resolution with compression degradation. Furthermore, we also introduce a scheme to compensate for color information with Color Learning and enhance image quality with Deep Motion Compensation for P-frame coding. As a result, our works outperform the standard codec and the previous works in the field.Regarding (2), one solution is to train a supervised deep neural network on many digital images and smartphone-scanned versions. However, it requires a high labor cost, leading to limited training data. Previous works create training pairs by simulating degradation using low-level image processing techniques. Their synthetic images are then formed with perfectly scanned photos in latent space. Even so, the real-world degradation in smartphone photo scanning remains unsolved since it is more complicated due to lens defocus, low-cost cameras, losing details via printing. Besides, locally structural misalignment still occurs in data due to distorted shapes captured in a 3-D world, reducing restoration performance and the reliability of the quantitative evaluation. To address these problems, we propose a semi-supervised Deep Photo Scan (DPScan). First, we present a way to produce real-world degradation and provide the DIV2K-SCAN dataset for smartphone-scanned photo restoration. Also, Local Alignment is proposed to reduce the minor misalignment remaining in data. Second, we simulate many different variants of the real-world degradation using low-level image transformation to gain a generalization in smartphone-scanned image properties, then train a degradation network to learn how to degrade unscanned images as if a smartphone scanned them. Finally, we propose a Semi-Supervised Learning that allows our restoration network to be trained on both scanned and unscanned images, diversifying training image content. As a result, the proposed DPScan quantitatively and qualitatively outperforms its baseline architecture, state-of-the-art academic research, and industrial products in the field.
DOI: 10.15002/00025235
https://doi.org/10.15002/00025235
国立国会図書館永続的識別子: info:ndljp/pid/12302976
https://dl.ndl.go.jp/pid/12302976
コレクション（共通）: 障害者向け資料
コレクション（障害者向け資料：レベル1）: テキストデータ
コレクション（個別）: 国立国会図書館デジタルコレクション > デジタル化資料 > 博士論文
https://dl.ndl.go.jp/collections/A00014
収集根拠: 博士論文（自動収集）
受理日（W3CDTF）: 2022-07-05T02:30:21+09:00
作成日（W3CDTF）: 2022-06-16
記録形式（IMT）: PDF
application/pdf
オンライン閲覧公開範囲: 国立国会図書館内限定公開
デジタル化資料送信: 図書館・個人送信対象外
遠隔複写可否（NDL）: 可
掲載誌（URI）: https://doi.org/10.15002/00025235
http://hdl.handle.net/10114/00025235
連携機関・データベース: 国立国会図書館 : 国立国会図書館デジタルコレクション
https://dl.ndl.go.jp

A Research on Learned Image/Video Restoration and Compression for Solving Real-World Degradation

書店で探す

障害者向け資料で読む

目次

書店で探す

障害者向け資料で読む

書誌情報

デジタル