Pythonで学ぶ強化学習 = Reinforcement Learning by Python : 入門から実践まで

改訂第2版

(機械学習スタートアップシリーズ)

Call No. (NDL): M159-M194
Bibliographic ID of National Diet Library: 029951878

Material type: 図書

Author: 久保隆宏著

Publisher: 講談社

Publication date: 2019.9

Material Format: Paper

Capacity, size, etc.: 297p ; 21cm

NDC: 007.13

View All

Detailed bibliographic record

Summary, etc.：: 「Ｐｙｔｈｏｎで強化学習が実装できる！」と好評を得た入門書の改訂版。読者からの要望・指摘を反映させた。(Provided by: 出版情報登録センター（JPRO）)

Author introduction：: 久保隆宏 TIS株式会社戦略技術センター所属。Twitter：@icoxfog417。現在は、「人のための要約」を目指し、少ない学習データによる要約の作成・図表化に取り組む。また、論文のまとめを共有するarXivTimesの運営、『直感 Deep Learning』オライリージャパン(2018)...

Search by Bookstore

Provided by：出版情報登録センター（JPRO）

Day1　強化学習の位置づけを知る　強化学習とさまざまなキーワードの関係　強化学習のメリット・デメリット　強化学習における問題設定：Markov Decision Process　 Day2　強化学習の解法(1): 環境から計画を立てる　価値の定義と算出: Bellman Equation 　動的計画法による状態評価の学習: Value Iteration 　動的計画法による戦略の学習: Policy Iteration 　モデルベースとモデルフリーとの違い Day3　強化学習の解法(2): 経験から計画を立てる　経験の蓄積と活用のバランス: Epsilon-Greedy法　計画の修正を実績から行うか、予測で行うか: Monte Carlo vs Temporal Difference 　経験を価値評価、戦略どちらの更新に利用するか：Valueベース vs Policyベース Day4　強化学習に対するニューラルネットワークの適用　強化学習にニューラルネットワークを適用する　価値評価を、パラメーターを持った関数で実装する：Value Function Approximation 　価値評価に深層学習を適用する：Deep Q-Network 　戦略を、パラメーターを持った関数で実装する：Policy Gradient 　戦略に深層学習を適用する：Advantage Actor Critic (A2C) 　価値評価か、戦略か Day5　強化学習の弱点　サンプル効率が悪い　局所最適な行動に陥る、過学習をすることが多い　再現性が低い　弱点を前提とした対応策 Day6　強化学習の弱点を克服するための手法　サンプル効率の悪さへの対応: モデルベースとの併用/表現学習　再現性の低さへの対応: 進化戦略　局所最適な行動/過学習への対応: 模倣学習/逆強化学習 Day7　強化学習の活用領域　行動の最適化　学習の最適化

Holdings of Libraries in Japan

This page shows libraries in Japan other than the National Diet Library that hold the material.

List of Cooperating Institutions and Databases

Please contact your local library for information on how to use materials or whether it is possible to request materials from the holding libraries.

Northern Japan

札幌市中央図書館
Recording Media
Call No.：
007.13//
Book Registration Number：
9510113211
札幌市中央図書館

Kanto

さいたま市立中央図書館
Paper
Call No.：
007.13 ｸﾎﾞ
Book Registration Number：
22100299332
さいたま市立中央図書館
横浜市立図書館
Paper
Call No.：
007.1
Book Registration Number：
2071599690
横浜市立図書館
川崎市立図書館
Paper
Call No.：
007.1 ﾊﾟｲ
Book Registration Number：
430018444132
川崎市立図書館

Kinki

大阪府立中央図書館
Paper
Call No.：
007.1/13NX/(2)
Book Registration Number：
1119194213
大阪府立中央図書館

Shikoku

徳島県立図書館
Paper
Call No.：
007.1-ｸﾎ
Book Registration Number：
00112110017
徳島県立図書館
高知県立図書館
Paper
Call No.：
007.13-ｸﾎ
Book Registration Number：
1109665776
高知県立図書館

Kyushu

佐賀県立図書館
Paper
Call No.：
/007.1/KU11
Book Registration Number：
116140161
佐賀県立図書館
長崎県立長崎図書館
Paper
Call No.：
007.1/ｸ-21/
Book Registration Number：
1212210828
長崎県立長崎図書館

other

Agriculture Library Information system - WebOPAC
Search Service
Paper
You can check the holdings of institutions and databases with which Agriculture Library Information system - WebOPAC is linked at the site of Agriculture Library Information system - WebOPAC.
Check the holdings of this book
CiNii Research
Search Service
Paper
You can check the holdings of institutions and databases with which CiNii Research is linked at the site of CiNii Research.
Check the holdings of this book

Search by Bookstore

Publication bibliographic database Find a bookstore where you can purchase books from

Books is a database of the publishing industry with information provided by publishers. You can search for currently available paperbacks and eBooks.

Search Bookstores (digital books)Search Bookstores (paper books)

Find by another way

Bibliographic Record

You can check the details of this material, its authority (keywords that refer to materials on the same subject, author's name, etc.), etc.

Paper Recording Media Digital

Material Type: 図書
ISBN: 978-4-06-517251-3
Title: Pythonで学ぶ強化学習 : 入門から実践まで
Title Transcription: パイソンデマナブキョウカガクシュウ
Author/Editor: 久保隆宏著
Edition: 改訂第2版
Series Title: 機械学習スタートアップシリーズ
Author Heading: 久保, 隆宏クボ, タカヒロ ( 001300203 )Authorities
Publication, Distribution, etc.: 東京 : 講談社
Publication Date: 2019.9
Publication Date (W3CDTF): 2019
Extent: 297p
Size: 21cm
Alternative Title: Reinforcement Learning by Python ニュウモンカラジッセンマデ
Reinforcement Learning by Python
Place of Publication (Country Code): JP
Text Language Code: jpn
Subject Heading: プログラミング (コンピュータ) プログラミング (コンピュータ) ( 00569223 )Authorities
機械学習キカイガクシュウ ( 001210569 )Authorities
NDC 10th ed.: 007.13 : Information science. Informatics
NDLC: M159
Target Audience: 一般
Note (Bibliography): 文献あり索引あり
Price: 2800円
Holding library: 国立国会図書館
Call No.: M159-M194
Data Provider (Database): 国立国会図書館 : 国立国会図書館蔵書
https://ndlsearch.ndl.go.jp
Bibliographic ID (NDL): 029951878
http://id.ndl.go.jp/bib/029951878
National Bibliography No. (JPNO): 23276069
TOHAN MARC No.: 33975504
Cataloging Rule: Nippon Cataloguing Rules 1987 Revised Edition
Bibliographic Record Category (NDL): 111

Pythonで学ぶ強化学習 = Reinforcement Learning by Python : 入門から実践まで

改訂第2版

(機械学習スタートアップシリーズ)

Search by Bookstore

Table of Contents

Holdings of Libraries in Japan

Search by Bookstore

Bibliographic Record

Paper Recording Media Digital