Recent Advances in Reinforcement Learning

9th European Workshop, EWRL 2011, Athens, Greece, September 9-11, 2011, Revised and Selected Papers

Scott Sanner, Marcus Hutter (Herausgeber)

Buch | Softcover
XIII, 345 Seiten
2012 | 2012
Springer Berlin (Verlag)
978-3-642-29945-2 (ISBN)
53,49 inkl. MwSt
This book constitutes revised and selected papers of the 9th European Workshop on Reinforcement Learning, EWRL 2011, which took place in Athens, Greece in September 2011. The papers presented were carefully reviewed and selected from 40 submissions. The papers are organized in topical sections online reinforcement learning, learning and exploring MDPs, function approximation methods for reinforcement learning, macro-actions in reinforcement learning, policy search and bounds, multi-task and transfer reinforcement learning, multi-agent reinforcement learning, apprenticeship and inverse reinforcement learning and real-world reinforcement learning.

Marcus Hutter received his masters in computer sciences in 1992 at the Technical University in Munich, Germany. After his PhD in theoretical particle physics he developed algorithms in a medical software company for 5 years. For four years he has been working as a researcher at the AI institute IDSIA in Lugano, Switzerland. His current interests are centered around reinforcement learning, algorithmic information theory and statistics, universal induction schemes, adaptive control theory, and related areas.

Invited Talk Abstracts.-Invited Talk: UCRL and Autonomous Exploration.-Invited Talk: Increasing Representational Power and Scaling Inference in Reinforcement Learning.-Invited Talk: PRISM - Practical RL: Representation, Interaction, Synthesis, and Mortality.-Invited Talk: Towards Robust Reinforcement Learning Algorithms.-Online Reinforcement Learning Automatic Discovery of Ranking Formulas for Playing with Multi-armed Bandits.-Goal-Directed Online Learning of Predictive Models.-Gradient Based Algorithms with Loss Functions and Kernels for Improved On-Policy Control.-Learning and Exploring MDPs -Active Learning of MDP Models.-Handling Ambiguous Effects in Action Learning.-Feature Reinforcement Learning in Practice.-Function Approximation Methods for Reinforcement Learning Reinforcement Learning with a Bilinear Q Function.-1-Penalized Projected Bellman Residual.-Regularized Least Squares Temporal Difference Learning with Nested 2 and 1 Penalization.-Recursive Least-Squares Learning with Eligibility Traces.-Value Function Approximation through Sparse Bayesian Modeling.-Automatic Construction of Temporally Extended Actions for MDPs Using Bisimulation Metrics.-Unified Inter and Intra Options Learning Using Policy Gradient Methods.-Options with Exceptions.-Policy Search and Bounds.-Robust Bayesian Reinforcement Learning through Tight Lower Bounds.-Optimized Look-ahead Tree Search Policies.-A Framework for Computing Bounds for the Return of a Policy.-Multi-Task and Transfer Reinforcement Learning.-Transferring Evolved Reservoir Features in Reinforcement Learning Task.-Transfer Learning via Multiple Inter-task Mappings.-Multi-Task Reinforcement Learning: Shaping and Feature Selection.-Multi-Agent Reinforcement Learning.-Transfer Learning in Multi-Agent Reinforcement Learning Domains.-An Extension of a Hierarchical Reinforcement Learning Algorithm for Multiagent Settings.-Apprenticeship and Inverse Reinforcement Learning Bayesian Multitask Inverse ReinforcementLearning.-Batch, Off-Policy and Model-Free Apprenticeship Learning.-Real-World Reinforcement Learning Introduction of Fixed Mode States into Online Profit Sharing and Its Application to Waist Trajectory Generation of Biped Robot.-MapReduce for Parallel Reinforcement Learning.-Compound Reinforcement Learning: Theory and an Application to Finance.-Proposal and Evaluation of the Active Course Classification Support System with Exploitation-Oriented Learning.

Erscheint lt. Verlag 22.5.2012
Reihe/Serie Lecture Notes in Artificial Intelligence
Lecture Notes in Computer Science
Zusatzinfo XIII, 345 p. 98 illus.
Verlagsort Berlin
Sprache englisch
Gewicht 545 g
Themenwelt Mathematik / Informatik Informatik Datenbanken
Informatik Software Entwicklung User Interfaces (HCI)
Informatik Theorie / Studium Algorithmen
Informatik Theorie / Studium Künstliche Intelligenz / Robotik
Schlagworte Algorithm analysis and problem complexity • Bayesian inference • multitask learning • predictive state representation • real-time control • Recommender System
ISBN-10 3-642-29945-8 / 3642299458
ISBN-13 978-3-642-29945-2 / 9783642299452
Zustand Neuware
Haben Sie eine Frage zum Produkt?
Mehr entdecken
aus dem Bereich
Aus- und Weiterbildung nach iSAQB-Standard zum Certified Professional …

von Mahbouba Gharbi; Arne Koschel; Andreas Rausch; Gernot Starke

Buch | Hardcover (2023)
dpunkt Verlag
34,90
Lean UX und Design Thinking: Teambasierte Entwicklung …

von Toni Steimle; Dieter Wallach

Buch | Hardcover (2022)
dpunkt (Verlag)
34,90
Wissensverarbeitung - Neuronale Netze

von Uwe Lämmel; Jürgen Cleve

Buch | Hardcover (2023)
Carl Hanser (Verlag)
34,99