Markov Decision Processes and the Belief-Desire-Intention Model Bridging the Gap for Autonomous Agents

In this work, we provide a treatment of the relationship between two models that have been widely used in the implementation of autonomous agents: the Belief DesireIntention (BDI) model and Markov Decision Processes (MDPs). We start with an informal description of the relationship, identifying the c...

Full description

Bibliographic Details
Main Authors: Simari, Gerardo I., Parsons, Simon D. (Author)
Format: eBook
Language:English
Published: New York, NY Springer New York 2011, 2011
Edition:1st ed. 2011
Series:SpringerBriefs in Computer Science
Subjects:
Online Access:
Collection: Springer eBooks 2005- - Collection details see MPG.ReNa
LEADER 02214nmm a2200301 u 4500
001 EB000363913
003 EBX01000000000000000216965
005 00000000000000.0
007 cr|||||||||||||||||||||
008 130626 ||| eng
020 |a 9781461414728 
100 1 |a Simari, Gerardo I. 
245 0 0 |a Markov Decision Processes and the Belief-Desire-Intention Model  |h Elektronische Ressource  |b Bridging the Gap for Autonomous Agents  |c by Gerardo I. Simari, Simon D. Parsons 
250 |a 1st ed. 2011 
260 |a New York, NY  |b Springer New York  |c 2011, 2011 
300 |a VIII, 63 p  |b online resource 
505 0 |a Introduction -- Preliminary Concepts -- An Empirical Comparison of Models -- A Theoretical Comparison of Models -- Related Work. Conclusions, Limitations, and Future Directions 
653 |a Computer simulation 
653 |a Artificial Intelligence 
653 |a Artificial intelligence 
653 |a Simulation and Modeling 
700 1 |a Parsons, Simon D.  |e [author] 
041 0 7 |a eng  |2 ISO 639-2 
989 |b Springer  |a Springer eBooks 2005- 
490 0 |a SpringerBriefs in Computer Science 
856 4 0 |u https://doi.org/10.1007/978-1-4614-1472-8?nosfx=y  |x Verlag  |3 Volltext 
082 0 |a 006.3 
520 |a In this work, we provide a treatment of the relationship between two models that have been widely used in the implementation of autonomous agents: the Belief DesireIntention (BDI) model and Markov Decision Processes (MDPs). We start with an informal description of the relationship, identifying the common features of the two approaches and the differences between them. Then we hone our understanding of these differences through an empirical analysis of the performance of both models on the TileWorld testbed. This allows us to show that even though the MDP model displays consistently better behavior than the BDI model for small worlds, this is not the case when the world becomes large and the MDP model cannot be solved exactly. Finally we present a theoretical analysis of the relationship between the two approaches, identifying mappings that allow us to extract a set of intentions from a policy (a solution to an MDP), and to extract a policy from a set of intentions