000 04666nam a22006135i 4500
001 978-981-15-4095-0
003 DE-He213
005 20240423125109.0
007 cr nn 008mamaa
008 200629s2020 si | s |||| 0|eng d
020 _a9789811540950
_9978-981-15-4095-0
024 7 _a10.1007/978-981-15-4095-0
_2doi
050 4 _aQ325.5-.7
072 7 _aUYQM
_2bicssc
072 7 _aMAT029000
_2bisacsh
072 7 _aUYQM
_2thema
082 0 4 _a006.31
_223
245 1 0 _aDeep Reinforcement Learning
_h[electronic resource] :
_bFundamentals, Research and Applications /
_cedited by Hao Dong, Zihan Ding, Shanghang Zhang.
250 _a1st ed. 2020.
264 1 _aSingapore :
_bSpringer Nature Singapore :
_bImprint: Springer,
_c2020.
300 _aXXVII, 514 p. 489 illus., 208 illus. in color.
_bonline resource.
336 _atext
_btxt
_2rdacontent
337 _acomputer
_bc
_2rdamedia
338 _aonline resource
_bcr
_2rdacarrier
347 _atext file
_bPDF
_2rda
505 0 _aPreface -- Contributors -- Acknowledgements -- Mathematical Notation -- Acronyms -- Introduction -- Part 1: Foundamentals -- Chapter 1: Introduction to Deep Learning -- Chapter 2: Introduction to Reinforcement Learning -- Chapter 3: Taxonomy of Reinforcement Learning Algorithms -- Chapter 4: Deep Q-Networks -- Chapter 5: Policy Gradient -- Chapter 6: Combine Deep Q-Networks with Actor-Critic -- Part II: Research -- Chapter 7: Challenges of Reinforcement Learning -- Chapter 8: Imitation Learning -- Chapter 9: Integrating Learning and Planning -- Chapter 10: Hierarchical Reinforcement Learning -- Chapter 11: Multi-Agent Reinforcement Learning -- Chapter 12: Parallel Computing -- Part III: Applications -- Chapter 13: Learning to Run -- Chapter 14: Robust Image Enhancement -- Chapter 15: AlphaZero -- Chapter 16: Robot Learning in Simulation -- Chapter 17: Arena Platform for Multi-Agent Reinforcement Learning -- Chapter 18: Tricks of Implementation -- Part IV: Summary -- Chapter 19: Algorithm Table -- Chapter 20: Algorithm Cheatsheet.
520 _aDeep reinforcement learning (DRL) is the combination of reinforcement learning (RL) and deep learning. It has been able to solve a wide range of complex decision-making tasks that were previously out of reach for a machine, and famously contributed to the success of AlphaGo. Furthermore, it opens up numerous new applications in domains such as healthcare, robotics, smart grids and finance. Divided into three main parts, this book provides a comprehensive and self-contained introduction to DRL. The first part introduces the foundations of deep learning, reinforcement learning (RL) and widely used deep RL methods and discusses their implementation. The second part covers selected DRL research topics, which are useful for those wanting to specialize in DRL research. To help readers gain a deep understanding of DRL and quickly apply the techniques in practice, the third part presents mass applications, such as the intelligent transportation system and learning to run, with detailed explanations. The book is intended for computer science students, both undergraduate and postgraduate, who would like to learn DRL from scratch, practice its implementation, and explore the research topics. It also appeals to engineers and practitioners who do not have strong machine learning background, but want to quickly understand how DRL works and use the techniques in their applications.
650 0 _aMachine learning.
650 0 _aData mining.
650 0 _aComputer vision.
650 0 _aRobotics.
650 0 _aComputer programming.
650 0 _aNatural language processing (Computer science).
650 1 4 _aMachine Learning.
650 2 4 _aData Mining and Knowledge Discovery.
650 2 4 _aComputer Vision.
650 2 4 _aRobotics.
650 2 4 _aProgramming Techniques.
650 2 4 _aNatural Language Processing (NLP).
700 1 _aDong, Hao.
_eeditor.
_4edt
_4http://id.loc.gov/vocabulary/relators/edt
700 1 _aDing, Zihan.
_eeditor.
_4edt
_4http://id.loc.gov/vocabulary/relators/edt
700 1 _aZhang, Shanghang.
_eeditor.
_4edt
_4http://id.loc.gov/vocabulary/relators/edt
710 2 _aSpringerLink (Online service)
773 0 _tSpringer Nature eBook
776 0 8 _iPrinted edition:
_z9789811540943
776 0 8 _iPrinted edition:
_z9789811540967
776 0 8 _iPrinted edition:
_z9789811540974
856 4 0 _uhttps://doi.org/10.1007/978-981-15-4095-0
912 _aZDB-2-SMA
912 _aZDB-2-SXMS
942 _cSPRINGER
999 _c174210
_d174210