Reinforcement Learning for Sequential Decision and Optimal Control (Registro nro. 261398)

MARC details
000 -LIDER
fixed length control field	05431nam a22006255i 4500
001 - CONTROL NUMBER
control field	978-981-19-7784-8
003 - CONTROL NUMBER IDENTIFIER
control field	DE-He213
005 - DATE AND TIME OF LATEST TRANSACTION
control field	20240207153549.0
007 - PHYSICAL DESCRIPTION FIXED FIELD--GENERAL INFORMATION
fixed length control field	cr nn 008mamaa
008 - FIXED-LENGTH DATA ELEMENTS--GENERAL INFORMATION
fixed length control field	230405s2023 si \| s \|\|\|\| 0\|eng d
020 ## - INTERNATIONAL STANDARD BOOK NUMBER
International Standard Book Number	9789811977848
--	978-981-19-7784-8
050 #4 - LIBRARY OF CONGRESS CALL NUMBER
Classification number	Q325.5-.7
072 #7 - SUBJECT CATEGORY CODE
Subject category code	UYQM
Source	bicssc
072 #7 - SUBJECT CATEGORY CODE
Subject category code	COM004000
Source	bisacsh
072 #7 - SUBJECT CATEGORY CODE
Subject category code	UYQM
Source	thema
082 04 - DEWEY DECIMAL CLASSIFICATION NUMBER
Classification number	006.31
Edition number	23
100 1# - MAIN ENTRY--PERSONAL NAME
Personal name	Li, Shengbo Eben.
Relator term	author.
Relator code	aut
--	http://id.loc.gov/vocabulary/relators/aut
245 10 - TITLE STATEMENT
Title	Reinforcement Learning for Sequential Decision and Optimal Control
Medium	[electronic resource] /
Statement of responsibility, etc.	by Shengbo Eben Li.
250 ## - EDITION STATEMENT
Edition statement	1st ed. 2023.
264 #1 -
--	Singapore :
--	Springer Nature Singapore :
--	Imprint: Springer,
--	2023.
300 ## - PHYSICAL DESCRIPTION
Extent	XXX, 462 p. 217 illus., 213 illus. in color.
Other physical details	online resource.
336 ## -
--	text
--	txt
--	rdacontent
337 ## -
--	computer
--	c
--	rdamedia
338 ## -
--	online resource
--	cr
--	rdacarrier
347 ## -
--	text file
--	PDF
--	rda
500 ## - GENERAL NOTE
General note	Acceso multiusuario
505 0# - FORMATTED CONTENTS NOTE
Formatted contents note	Chapter 1 Introduction of Reinforcement Learning -- Chapter 2 Principles of RL Problems -- Chapter 3 Model-free Indirect RL: Monte Carlo -- Chapter 4 Model-Free Indirect RL: Temporal-Difference -- Chapter 5 Model-based Indirect RL: Dynamic Programming -- Chapter 6 Indirect RL with Function Approximation -- Chapter 7 Direct RL with Policy Gradient -- Chapter 8 Infinite Horizon Approximate Dynamic Programming -- Chapter 9 Finite Horizon ADP and State Constraints -- Chapter 10 Deep Reinforcement Learning -- Chapter 11 Advanced RL Topics.
520 ## - SUMMARY, ETC.
Summary, etc.	Have you ever wondered how AlphaZero learns to defeat the top human Go players? Do you have any clues about how an autonomous driving system can gradually develop self-driving skills beyond normal drivers? What is the key that enables AlphaStar to make decisions in Starcraft, a notoriously difficult strategy game that has partial information and complex rules? The core mechanism underlying those recent technical breakthroughs is reinforcement learning (RL), a theory that can help an agent to develop the self-evolution ability through continuing environment interactions. In the past few years, the AI community has witnessed phenomenal success of reinforcement learning in various fields, including chess games, computer games and robotic control. RL is also considered to be a promising and powerful tool to create general artificial intelligence in the future. As an interdisciplinary field of trial-and-error learning and optimal control, RL resembles how humans reinforce their intelligence by interacting with the environment and provides a principled solution for sequential decision making and optimal control in large-scale and complex problems. Since RL contains a wide range of new concepts and theories, scholars may be plagued by a number of questions: What is the inherent mechanism of reinforcement learning? What is the internal connection between RL and optimal control? How has RL evolved in the past few decades, and what are the milestones? How do we choose and implement practical and effective RL algorithms for real-world scenarios? What are the key challenges that RL faces today, and how can we solve them? What is the current trend of RL research? You can find answers to all those questions in this book. The purpose of the book is to help researchers and practitioners take a comprehensive view of RL and understand the in-depth connection between RL and optimal control. The book includes not only systematic and thorough explanations of theoretical basics but also methodical guidance of practical algorithm implementations. The book intends to provide a comprehensive coverage of both classic theories and recent achievements, and the content is carefully and logically organized, including basic topics such as the main concepts and terminologies of RL, Markov decision process (MDP), Bellman's optimality condition, Monte Carlo learning, temporal difference learning, stochastic dynamic programming, function approximation, policy gradient methods, approximate dynamic programming, and deep RL, as well as the latest advances in action and state constraints, safety guarantee, reference harmonization, robust RL, partially observable MDP, multiagent RL, inverse RL, offline RL, and so on.
541 ## - IMMEDIATE SOURCE OF ACQUISITION NOTE
Owner	UABC ;
Method of acquisition	Perpetuidad
650 #0 - SUBJECT ADDED ENTRY--TOPICAL TERM
Término temático o nombre geográfico como elemento de entrada	Machine learning.
650 #0 - SUBJECT ADDED ENTRY--TOPICAL TERM
Término temático o nombre geográfico como elemento de entrada	Computational intelligence.
650 #0 - SUBJECT ADDED ENTRY--TOPICAL TERM
Término temático o nombre geográfico como elemento de entrada	System theory.
650 #0 - SUBJECT ADDED ENTRY--TOPICAL TERM
Término temático o nombre geográfico como elemento de entrada	Control theory.
650 #0 - SUBJECT ADDED ENTRY--TOPICAL TERM
Término temático o nombre geográfico como elemento de entrada	Engineering mathematics.
650 #0 - SUBJECT ADDED ENTRY--TOPICAL TERM
Término temático o nombre geográfico como elemento de entrada	Control engineering.
650 #0 - SUBJECT ADDED ENTRY--TOPICAL TERM
Término temático o nombre geográfico como elemento de entrada	Robotics.
650 #0 - SUBJECT ADDED ENTRY--TOPICAL TERM
Término temático o nombre geográfico como elemento de entrada	Automation.
650 14 - SUBJECT ADDED ENTRY--TOPICAL TERM
Término temático o nombre geográfico como elemento de entrada	Machine Learning.
650 24 - SUBJECT ADDED ENTRY--TOPICAL TERM
Término temático o nombre geográfico como elemento de entrada	Computational Intelligence.
650 24 - SUBJECT ADDED ENTRY--TOPICAL TERM
Término temático o nombre geográfico como elemento de entrada	Systems Theory, Control .
650 24 - SUBJECT ADDED ENTRY--TOPICAL TERM
Término temático o nombre geográfico como elemento de entrada	Engineering Mathematics.
650 24 - SUBJECT ADDED ENTRY--TOPICAL TERM
Término temático o nombre geográfico como elemento de entrada	Control, Robotics, Automation.
710 2# - ADDED ENTRY--CORPORATE NAME
Corporate name or jurisdiction name as entry element	SpringerLink (Online service)
773 0# - HOST ITEM ENTRY
Title	Springer Nature eBook
776 08 - ADDITIONAL PHYSICAL FORM ENTRY
Relationship information	Printed edition:
International Standard Book Number	9789811977831
776 08 - ADDITIONAL PHYSICAL FORM ENTRY
Relationship information	Printed edition:
International Standard Book Number	9789811977855
776 08 - ADDITIONAL PHYSICAL FORM ENTRY
Relationship information	Printed edition:
International Standard Book Number	9789811977862
856 40 - ELECTRONIC LOCATION AND ACCESS
Public note	Libro electrónico
Uniform Resource Identifier	http://libcon.rec.uabc.mx:2048/login?url=https://doi.org/10.1007/978-981-19-7784-8
912 ## -
--	ZDB-2-SCS
912 ## -
--	ZDB-2-SXCS
942 ## - ADDED ENTRY ELEMENTS (KOHA)
Koha item type	Libro Electrónico

Existencias
Estado de retiro	Colección	Ubicación permanente	Ubicación actual	Fecha de ingreso	Total Checkouts	Date last seen	Número de copia	Tipo de material
	Colección de Libros Electrónicos	Biblioteca Electrónica	Biblioteca Electrónica	07/02/2024		07/02/2024	1	Libro Electrónico

Imprimir
Sugerir para compra
Guardar registro
BIBTEX Dublin Core MARCXML MARC (no-Unicode/MARC-8) MARC (Unicode/UTF-8) MARC (Unicode/UTF-8, estándar) MODS (XML) RIS
Más búsquedas

Buscar este título en:
Other Libraries (WorldCat) Other Databases (Google Scholar) Online Stores (Bookfinder.com) Open Library (openlibrary.org)