000 | 04666nam a22006135i 4500 | ||
---|---|---|---|
001 | 978-981-15-4095-0 | ||
003 | DE-He213 | ||
005 | 20240423125109.0 | ||
007 | cr nn 008mamaa | ||
008 | 200629s2020 si | s |||| 0|eng d | ||
020 |
_a9789811540950 _9978-981-15-4095-0 |
||
024 | 7 |
_a10.1007/978-981-15-4095-0 _2doi |
|
050 | 4 | _aQ325.5-.7 | |
072 | 7 |
_aUYQM _2bicssc |
|
072 | 7 |
_aMAT029000 _2bisacsh |
|
072 | 7 |
_aUYQM _2thema |
|
082 | 0 | 4 |
_a006.31 _223 |
245 | 1 | 0 |
_aDeep Reinforcement Learning _h[electronic resource] : _bFundamentals, Research and Applications / _cedited by Hao Dong, Zihan Ding, Shanghang Zhang. |
250 | _a1st ed. 2020. | ||
264 | 1 |
_aSingapore : _bSpringer Nature Singapore : _bImprint: Springer, _c2020. |
|
300 |
_aXXVII, 514 p. 489 illus., 208 illus. in color. _bonline resource. |
||
336 |
_atext _btxt _2rdacontent |
||
337 |
_acomputer _bc _2rdamedia |
||
338 |
_aonline resource _bcr _2rdacarrier |
||
347 |
_atext file _bPDF _2rda |
||
505 | 0 | _aPreface -- Contributors -- Acknowledgements -- Mathematical Notation -- Acronyms -- Introduction -- Part 1: Foundamentals -- Chapter 1: Introduction to Deep Learning -- Chapter 2: Introduction to Reinforcement Learning -- Chapter 3: Taxonomy of Reinforcement Learning Algorithms -- Chapter 4: Deep Q-Networks -- Chapter 5: Policy Gradient -- Chapter 6: Combine Deep Q-Networks with Actor-Critic -- Part II: Research -- Chapter 7: Challenges of Reinforcement Learning -- Chapter 8: Imitation Learning -- Chapter 9: Integrating Learning and Planning -- Chapter 10: Hierarchical Reinforcement Learning -- Chapter 11: Multi-Agent Reinforcement Learning -- Chapter 12: Parallel Computing -- Part III: Applications -- Chapter 13: Learning to Run -- Chapter 14: Robust Image Enhancement -- Chapter 15: AlphaZero -- Chapter 16: Robot Learning in Simulation -- Chapter 17: Arena Platform for Multi-Agent Reinforcement Learning -- Chapter 18: Tricks of Implementation -- Part IV: Summary -- Chapter 19: Algorithm Table -- Chapter 20: Algorithm Cheatsheet. | |
520 | _aDeep reinforcement learning (DRL) is the combination of reinforcement learning (RL) and deep learning. It has been able to solve a wide range of complex decision-making tasks that were previously out of reach for a machine, and famously contributed to the success of AlphaGo. Furthermore, it opens up numerous new applications in domains such as healthcare, robotics, smart grids and finance. Divided into three main parts, this book provides a comprehensive and self-contained introduction to DRL. The first part introduces the foundations of deep learning, reinforcement learning (RL) and widely used deep RL methods and discusses their implementation. The second part covers selected DRL research topics, which are useful for those wanting to specialize in DRL research. To help readers gain a deep understanding of DRL and quickly apply the techniques in practice, the third part presents mass applications, such as the intelligent transportation system and learning to run, with detailed explanations. The book is intended for computer science students, both undergraduate and postgraduate, who would like to learn DRL from scratch, practice its implementation, and explore the research topics. It also appeals to engineers and practitioners who do not have strong machine learning background, but want to quickly understand how DRL works and use the techniques in their applications. | ||
650 | 0 | _aMachine learning. | |
650 | 0 | _aData mining. | |
650 | 0 | _aComputer vision. | |
650 | 0 | _aRobotics. | |
650 | 0 | _aComputer programming. | |
650 | 0 | _aNatural language processing (Computer science). | |
650 | 1 | 4 | _aMachine Learning. |
650 | 2 | 4 | _aData Mining and Knowledge Discovery. |
650 | 2 | 4 | _aComputer Vision. |
650 | 2 | 4 | _aRobotics. |
650 | 2 | 4 | _aProgramming Techniques. |
650 | 2 | 4 | _aNatural Language Processing (NLP). |
700 | 1 |
_aDong, Hao. _eeditor. _4edt _4http://id.loc.gov/vocabulary/relators/edt |
|
700 | 1 |
_aDing, Zihan. _eeditor. _4edt _4http://id.loc.gov/vocabulary/relators/edt |
|
700 | 1 |
_aZhang, Shanghang. _eeditor. _4edt _4http://id.loc.gov/vocabulary/relators/edt |
|
710 | 2 | _aSpringerLink (Online service) | |
773 | 0 | _tSpringer Nature eBook | |
776 | 0 | 8 |
_iPrinted edition: _z9789811540943 |
776 | 0 | 8 |
_iPrinted edition: _z9789811540967 |
776 | 0 | 8 |
_iPrinted edition: _z9789811540974 |
856 | 4 | 0 | _uhttps://doi.org/10.1007/978-981-15-4095-0 |
912 | _aZDB-2-SMA | ||
912 | _aZDB-2-SXMS | ||
942 | _cSPRINGER | ||
999 |
_c174210 _d174210 |