Reinforcement Learning

Reinforcement learning is the learning of a mapping from situations to actions so as to maximize a scalar reward or reinforcement signal. The learner is not told which action to take, as in most forms of machine learning, but instead must discover which actions yield the highest reward by trying the...

Full description

Bibliographic Details
Other Authors: Sutton, Richard S. (Editor)
Format: eBook
Language:English
Published: New York, NY Springer US 1992, 1992
Edition:1st ed. 1992
Series:The Springer International Series in Engineering and Computer Science
Subjects:
Online Access:
Collection: Springer Book Archives -2004 - Collection details see MPG.ReNa
LEADER 03142nmm a2200313 u 4500
001 EB000625181
003 EBX01000000000000000478263
005 00000000000000.0
007 cr|||||||||||||||||||||
008 140122 ||| eng
020 |a 9781461536185 
100 1 |a Sutton, Richard S.  |e [editor] 
245 0 0 |a Reinforcement Learning  |h Elektronische Ressource  |c edited by Richard S. Sutton 
250 |a 1st ed. 1992 
260 |a New York, NY  |b Springer US  |c 1992, 1992 
300 |a 172 p  |b online resource 
505 0 |a Reinforcement Learning -- A Special Issue of Machine Learning on Reinforcement Learning -- Introduction: The Challenge of Reinforcement Learning -- Simple Statistical Gradient-Following Algorithms for Connectionist Reinforcement Learning -- Practical Issues in Temporal Difference Learning -- Technical Note: Q-Learning -- Self-Improving Reactive Agents Based on Reinforcement Learning, Planning and Teaching -- Transfer of Learning by Composing Solutions of Elemental Sequential Tasks -- The Convergence of TD(?) for General ? -- A Reinforcement Connectionist Approach to Robot Path Finding in Non-Maze Like Environments 
653 |a Statistical Physics and Dynamical Systems 
653 |a Statistical physics 
653 |a Artificial Intelligence 
653 |a Complex Systems 
653 |a Artificial intelligence 
653 |a Dynamical systems 
041 0 7 |a eng  |2 ISO 639-2 
989 |b SBA  |a Springer Book Archives -2004 
490 0 |a The Springer International Series in Engineering and Computer Science 
856 4 0 |u https://doi.org/10.1007/978-1-4615-3618-5?nosfx=y  |x Verlag  |3 Volltext 
082 0 |a 006.3 
520 |a Reinforcement learning is the learning of a mapping from situations to actions so as to maximize a scalar reward or reinforcement signal. The learner is not told which action to take, as in most forms of machine learning, but instead must discover which actions yield the highest reward by trying them. In the most interesting and challenging cases, actions may affect not only the immediate reward, but also the next situation, and through that all subsequent rewards. These two characteristics -- trial-and-error search and delayed reward -- are the most important distinguishing features of reinforcement learning. Reinforcement learning is both a new and a very old topic in AI. The term appears to have been coined by Minsk (1961), and independently in control theory by Walz and Fu (1965). The earliest machine learning research now viewed as directly relevant was Samuel's (1959) checker player, which used temporal-difference learning to manage delayed reward much as it is used today. Of course learning and reinforcement have been studied in psychology for almost a century, and that work has had a very strong impact on the AI/engineering work. One could in fact consider all of reinforcement learning to be simply the reverse engineering of certain psychological learning processes (e.g. operant conditioning and secondary reinforcement). Reinforcement Learning is an edited volume of original research, comprising seven invited contributions by leading researchers