000 05431nam a22006255i 4500
001 978-981-19-7784-8
003 DE-He213
005 20240207153549.0
007 cr nn 008mamaa
008 230405s2023 si | s |||| 0|eng d
020 _a9789811977848
_9978-981-19-7784-8
050 4 _aQ325.5-.7
072 7 _aUYQM
_2bicssc
072 7 _aCOM004000
_2bisacsh
072 7 _aUYQM
_2thema
082 0 4 _a006.31
_223
100 1 _aLi, Shengbo Eben.
_eauthor.
_4aut
_4http://id.loc.gov/vocabulary/relators/aut
245 1 0 _aReinforcement Learning for Sequential Decision and Optimal Control
_h[electronic resource] /
_cby Shengbo Eben Li.
250 _a1st ed. 2023.
264 1 _aSingapore :
_bSpringer Nature Singapore :
_bImprint: Springer,
_c2023.
300 _aXXX, 462 p. 217 illus., 213 illus. in color.
_bonline resource.
336 _atext
_btxt
_2rdacontent
337 _acomputer
_bc
_2rdamedia
338 _aonline resource
_bcr
_2rdacarrier
347 _atext file
_bPDF
_2rda
500 _aAcceso multiusuario
505 0 _aChapter 1 Introduction of Reinforcement Learning -- Chapter 2 Principles of RL Problems -- Chapter 3 Model-free Indirect RL: Monte Carlo -- Chapter 4 Model-Free Indirect RL: Temporal-Difference -- Chapter 5 Model-based Indirect RL: Dynamic Programming -- Chapter 6 Indirect RL with Function Approximation -- Chapter 7 Direct RL with Policy Gradient -- Chapter 8 Infinite Horizon Approximate Dynamic Programming -- Chapter 9 Finite Horizon ADP and State Constraints -- Chapter 10 Deep Reinforcement Learning -- Chapter 11 Advanced RL Topics.
520 _aHave you ever wondered how AlphaZero learns to defeat the top human Go players? Do you have any clues about how an autonomous driving system can gradually develop self-driving skills beyond normal drivers? What is the key that enables AlphaStar to make decisions in Starcraft, a notoriously difficult strategy game that has partial information and complex rules? The core mechanism underlying those recent technical breakthroughs is reinforcement learning (RL), a theory that can help an agent to develop the self-evolution ability through continuing environment interactions. In the past few years, the AI community has witnessed phenomenal success of reinforcement learning in various fields, including chess games, computer games and robotic control. RL is also considered to be a promising and powerful tool to create general artificial intelligence in the future. As an interdisciplinary field of trial-and-error learning and optimal control, RL resembles how humans reinforce their intelligence by interacting with the environment and provides a principled solution for sequential decision making and optimal control in large-scale and complex problems. Since RL contains a wide range of new concepts and theories, scholars may be plagued by a number of questions: What is the inherent mechanism of reinforcement learning? What is the internal connection between RL and optimal control? How has RL evolved in the past few decades, and what are the milestones? How do we choose and implement practical and effective RL algorithms for real-world scenarios? What are the key challenges that RL faces today, and how can we solve them? What is the current trend of RL research? You can find answers to all those questions in this book. The purpose of the book is to help researchers and practitioners take a comprehensive view of RL and understand the in-depth connection between RL and optimal control. The book includes not only systematic and thorough explanations of theoretical basics but also methodical guidance of practical algorithm implementations. The book intends to provide a comprehensive coverage of both classic theories and recent achievements, and the content is carefully and logically organized, including basic topics such as the main concepts and terminologies of RL, Markov decision process (MDP), Bellman's optimality condition, Monte Carlo learning, temporal difference learning, stochastic dynamic programming, function approximation, policy gradient methods, approximate dynamic programming, and deep RL, as well as the latest advances in action and state constraints, safety guarantee, reference harmonization, robust RL, partially observable MDP, multiagent RL, inverse RL, offline RL, and so on.
541 _fUABC ;
_cPerpetuidad
650 0 _aMachine learning.
650 0 _aComputational intelligence.
650 0 _aSystem theory.
650 0 _aControl theory.
650 0 _aEngineering mathematics.
650 0 _aControl engineering.
650 0 _aRobotics.
650 0 _aAutomation.
650 1 4 _aMachine Learning.
650 2 4 _aComputational Intelligence.
650 2 4 _aSystems Theory, Control .
650 2 4 _aEngineering Mathematics.
650 2 4 _aControl, Robotics, Automation.
710 2 _aSpringerLink (Online service)
773 0 _tSpringer Nature eBook
776 0 8 _iPrinted edition:
_z9789811977831
776 0 8 _iPrinted edition:
_z9789811977855
776 0 8 _iPrinted edition:
_z9789811977862
856 4 0 _zLibro electrónico
_uhttp://libcon.rec.uabc.mx:2048/login?url=https://doi.org/10.1007/978-981-19-7784-8
912 _aZDB-2-SCS
912 _aZDB-2-SXCS
942 _cLIBRO_ELEC
999 _c261398
_d261397