My Account
Log in New to Bookswagon?
Sign up
- Your Account
- Personal Settings
- Your Orders
- Your Wishlist
- Your Gift Certificate
- Your Addresses
- Change Password
- Currency AEDAED
Log out
0
0

My Account

Home

Account

Wishlist

Cart

Markov Decision Processes in Artificial Intelligence

Name: Markov Decision Processes in Artificial Intelligence
Brand: John Wiley & Sons Inc
SKU: 1118620100
Availability: OutOfStock
ISBN: 9781118620106

(Digital (delivered electronically)) | Released: 04 Mar 2013

By: Olivier Sigaud (Edited) , Olivier Buffet (Edited) | Publisher: John Wiley & Sons Inc | Publisher Imprint: Wiley-ISTE

Write Reviews

AED0

Out of Stock

Notify me when this book is in stock

Markov Decision Processes in Artificial Intelligence

Format: Digital (delivered electronically)

About the Book

Markov Decision Processes (MDPs) are a mathematical framework for modeling sequential decision problems under uncertainty as well as reinforcement learning problems.

Written by experts in the field, this book provides a global view of current research using MDPs in artificial intelligence. It starts with an introductory presentation of the fundamental aspects of MDPs (planning in MDPs, reinforcement learning, partially observable MDPs, Markov games and the use of non-classical criteria). It then presents more advanced research trends in the field and gives some concrete examples using illustrative real life applications.

Table of Contents:

Preface xvii

List of Authors xix

PART 1. MDPS: MODELS AND METHODS 1

Chapter 1. Markov Decision Processes 3
Frédérick GARCIA and Emmanuel RACHELSON

1.1. Introduction 3

1.2. Markov decision problems 4

1.3. Value functions 9

1.4. Markov policies 12

1.5. Characterization of optimal policies 14

1.6. Optimization algorithms for MDPs 28

1.7. Conclusion and outlook 37

1.8. Bibliography 37

Chapter 2. Reinforcement Learning 39
Olivier SIGAUD and Frédérick GARCIA

2.1. Introduction 39

2.2. Reinforcement learning: a global view 40

2.3. Monte Carlo methods 45

2.4. From Monte Carlo to temporal difference methods 45

2.5. Temporal difference methods 46

2.6. Model-based methods: learning a model 59

2.7. Conclusion 63

2.8. Bibliography 63

Chapter 3. Approximate Dynamic Programming 67
Rémi MUNOS

3.1. Introduction 68

3.2. Approximate value iteration (AVI) 70

3.3. Approximate policy iteration (API) 77

3.4. Direct minimization of the Bellman residual 87

3.5. Towards an analysis of dynamic programming in Lp-norm 88

3.6. Conclusions 93

3.7. Bibliography 93

Chapter 4. Factored Markov Decision Processes 99
Thomas DEGRIS and Olivier SIGAUD

4.1. Introduction 99

4.2. Modeling a problem with an FMDP 100

4.3. Planning with FMDPs 108

4.4. Perspectives and conclusion 122

4.5. Bibliography 123

Chapter 5. Policy-Gradient Algorithms 127
Olivier BUFFET

5.1. Reminder about the notion of gradient 128

5.2. Optimizing a parameterized policy with a gradient algorithm 130

5.3. Actor-critic methods 143

5.4. Complements 147

5.5. Conclusion 150

5.6. Bibliography 150

Chapter 6. Online Resolution Techniques 153
Laurent PÉRET and Frédérick GARCIA

6.1. Introduction 153

6.2. Online algorithms for solving an MDP 155

6.3. Controlling the search 167

6.4. Conclusion 180

6.5. Bibliography 180

PART 2. BEYOND MDPS 185

Chapter 7. Partially Observable Markov Decision Processes 187
Alain DUTECH and Bruno SCHERRER

7.1. Formal definitions for POMDPs 188

7.2. Non-Markovian problems: incomplete information 196

7.3. Computation of an exact policy on information states 202

7.4. Exact value iteration algorithms 207

7.5. Policy iteration algorithms 222

7.6. Conclusion and perspectives 223

7.7. Bibliography 225

Chapter 8. Stochastic Games 229
Andriy BURKOV, Laëtitia MATIGNON and Brahim CHAIB-DRAA

8.1. Introduction 229

8.2. Background on game theory 230

8.3. Stochastic games 245

8.4. Conclusion and outlook 269

8.5. Bibliography 270

Chapter 9. DEC-MDP/POMDP 277
Aurélie BEYNIER, François CHARPILLET, Daniel SZER and Abdel-Illah MOUADDIB

9.1. Introduction 277

9.2. Preliminaries 278

9.3. Multi agent Markov decision processes 279

9.4. Decentralized control and local observability 280

9.5. Sub-classes of DEC-POMDPs 285

9.6. Algorithms for solving DEC-POMDPs 295

9.7. Applicative scenario: multirobot exploration 310

9.8. Conclusion and outlook . . . 312

9.9. Bibliography 313

Chapter 10. Non-Standard Criteria 319
Matthieu BOUSSARD, Maroua BOUZID, Abdel-Illah MOUADDIB, Régis SABBADIN and Paul WENG

10.1. Introduction 319

10.2. Multicriteria approaches 320

10.3. Robustness in MDPs 327

10.4. Possibilistic MDPs 329

10.5. Algebraic MDPs 342

10.6. Conclusion 354

10.7. Bibliography 355

PART 3. APPLICATIONS 361

Chapter 11. Online Learning for Micro-Object Manipulation 363
Guillaume LAURENT

11.1. Introduction 363

11.2. Manipulation device 364

11.3. Choice of the reinforcement learning algorithm 367

11.4. Experimental results 370

11.5. Conclusion 373

11.6. Bibliography 373

Chapter 12. Conservation of Biodiversity 375
Iadine CHADÈS

12.1. Introduction 375

12.2. When to protect, survey or surrender cryptic endangered species 376

12.3. Can sea otters and abalone co-exist? 381

12.4. Other applications in conservation biology and discussions 391

12.5. Bibliography 392

Chapter 13. Autonomous Helicopter Searching for a Landing Area in an Uncertain Environment 395
Patrick FABIANI and Florent TEICHTEIL-KÖNIGSBUCH

13.1. Introduction 395

13.2. Exploration scenario 397

13.3. Embedded control and decision architecture 401

13.4. Incremental stochastic dynamic programming 404

13.5. Flight tests and return on experience 407

13.6. Conclusion 410

13.7. Bibliography 410

Chapter 14. Resource Consumption Control for an Autonomous Robot 413
Simon LE GLOANNEC and Abdel-Illah MOUADDIB

14.1. The rover’s mission 414

14.2. Progressive processing formalism 415

14.3. MDP/PRU model 416

14.4. Policy calculation 418

14.5. How to model a real mission 419

14.6. Extensions 422

14.7. Conclusion 423

14.8. Bibliography 423

Chapter 15. Operations Planning 425
Sylvie THIÉBAUX and Olivier BUFFET

15.1. Operations planning 425

15.2. MDP value function approaches 433

15.3. Reinforcement learning: FPG 442

15.4. Experiments 446

15.5. Conclusion and outlook 448

15.6. Bibliography 450

Index 453

About the Author :

Olivier Sigaud is a Professor of Computer Science at the University of Paris 6 (UPMC). He is the Head of the "Motion" Group in the Institute of Intelligent Systems and Robotics (ISIR).
Olivier Buffet has been an INRIA researcher in the Autonomous Intelligent Machines (MAIA) team of theLORIA laboratory, since November 2007.

Review :
"As an overall conclusion, this book is an extensive presentation of MDPs and their applications in modeling uncertain decision problems and in reinforcement learning." (Zentralblatt MATH, 2011)

"The range of subjects covered is fascinating, however, from game-theoretical applications to reinforcement learning, conservation of biodiversity and operations planning. Oriented towards advanced students and researchers in the fields of both artificial intelligence and the study of algorithms as well as discrete mathematics." (Book News, September 2010)

Best Sellers
See All

Quick View

Too Good To Be True Prajakta Koli

4.3

AED46

Quick View

Thank You for Leaving Rithvik Singh

2.0

AED44

Quick View

Atomic Habits (EXP) James Clear

0.0

AED99

Quick View

My First Library

4.1

AED64

Quick View

Dopamine Detox Thibaut Meurisse

0.0

AED43

Quick View

Money Myths and Mantras Devina Mehra

4.7

AED46

Quick View

Meditations Marcus Aurelius

4.3

AED42

Quick View

Harry Potter Box Set: The Complete Collection (Children’s Paperback) J.K. Rowling

4.3

AED238

Quick View

Atomic Habits James Clear

4.6

AED60

Quick View

The Art of Being Alone Renuka Gavrani

5.0

AED44

Quick View

Animals Tales From Panchtantra

4.5

AED46

Quick View

My First Book of Patterns Pencil Control

4.6

AED20

Product Details

ISBN-13: 9781118620106
Publisher: John Wiley & Sons Inc
Publisher Imprint: Wiley-ISTE
Language: English

ISBN-10: 1118620100
Publisher Date: 04 Mar 2013
Binding: Digital (delivered electronically)
No of Pages: 480

Related Categories

Similar Products

Markov Decision Processes in Artificial Intelligence

Quick View

Markov Decision Processes...Olivier Sigaud

No Review Yet

AED 0

Quick View

Markov Decision Processes...Olivier Sigaud

No Review Yet

AED 0

Markov Decision Processes & Artificial Intelligence

Quick View

Markov Decision Processes...O Sigaud

No Review Yet

AED 573

Quick View

Markov Decision Processes...O Sigaud

No Review Yet

AED 548

Quick View

Markov Decision ProcessesPaul Thie

No Review Yet

AED 34

Quick View

Markov Decision ProcessesM.L. Puterman

No Review Yet

AED 371

Quick View

Markov Decision ProcessesMartin L Puterman (Univ. of British Columbia)

No Review Yet

AED 0

Quick View

Markov Decision ProcessesD. J White

No Review Yet

AED 940

Quick View

Markov Decision ProcessesD. J. White

No Review Yet

AED 166

Quick View

Markov Decision ProcessesMartin L. Puterman

No Review Yet

AED 0

Quick View

Markov Decision ProcessesMartin L. Puterman

No Review Yet

AED 144

Quick View

Markov Decision ProcessesD. J. White

No Review Yet

AED 897

Quick View

Markov Decision ProcessesMartin L. Puterman

4.3

(4)

AED 646

Markov Decision Processes with Their Applications

Quick View

Markov Decision Processes...Qiying Hu

No Review Yet

AED 579

Quick View

Examples in Markov Decisi...A B Piunovskiy

No Review Yet

AED 443

Quick View

Competitive Markov Decisi...Koos Vrieze

No Review Yet

AED 78

Quick View

Examples in Markov Decisi...A B Piunovskiy

No Review Yet

AED 272

Quick View

Markov Decision Processes...Qiying Hu

No Review Yet

AED 22

Quick View

Examples in Markov Decisi...A B Piunovskiy

No Review Yet

AED 0

Quick View

Constrained Markov Decisi...Eitan Altman

No Review Yet

AED 748

Quick View

Markov Decision Processes...

No Review Yet

AED 0

Add Photo

Caption

Add Photo

Customer Reviews

REVIEWS 0
Click Here To Be The First to Review this Product

John Wiley & Sons Inc -
Markov Decision Processes in Artificial Intelligence

Writing guidlines
We want to publish your review, so please:

keep your review on the product. Review's that defame author's character will be rejected.
Keep your review focused on the product.
Avoid writing about customer service. contact us instead if you have issue requiring immediate attention.
Refrain from mentioning competitors or the specific price you paid for the product.
Do not include any personally identifiable information, such as full names.

Markov Decision Processes in Artificial Intelligence

Required fields are marked with *

Overall Rating
Please select Star.

Review Title*

Review

Add Photo Add up to 6 photos

Would you recommend this product to a friend?

Tag this Book Read more

Does your review contain spoilers?

Required: Does your review contain spoilers?

What type of reader best describes you?

User Name

Location

I agree to the terms & conditions

Required: Agreements

You may receive emails regarding this submission. Any emails will include the ability to opt-out of future communications.

CUSTOMER RATINGS AND REVIEWS AND QUESTIONS AND ANSWERS TERMS OF USE

These Terms of Use govern your conduct associated with the Customer Ratings and Reviews and/or Questions and Answers service offered by Bookswagon (the "CRR Service").

By submitting any content to Bookswagon, you guarantee that:

You are the sole author and owner of the intellectual property rights in the content;
All "moral rights" that you may have in such content have been voluntarily waived by you;
All content that you post is accurate;
You are at least 13 years old;
Use of the content you supply does not violate these Terms of Use and will not cause injury to any person or entity.

You further agree that you may not submit any content:

That is known by you to be false, inaccurate or misleading;
That infringes any third party's copyright, patent, trademark, trade secret or other proprietary rights or rights of publicity or privacy;
That violates any law, statute, ordinance or regulation (including, but not limited to, those governing, consumer protection, unfair competition, anti-discrimination or false advertising);
That is, or may reasonably be considered to be, defamatory, libelous, hateful, racially or religiously biased or offensive, unlawfully threatening or unlawfully harassing to any individual, partnership or corporation;
For which you were compensated or granted any consideration by any unapproved third party;
That includes any information that references other websites, addresses, email addresses, contact information or phone numbers;
That contains any computer viruses, worms or other potentially damaging computer programs or files.

You agree to indemnify and hold Bookswagon (and its officers, directors, agents, subsidiaries, joint ventures, employees and third-party service providers, including but not limited to Bazaarvoice, Inc.), harmless from all claims, demands, and damages (actual and consequential) of every kind and nature, known and unknown including reasonable attorneys' fees, arising out of a breach of your representations and warranties set forth above, or your violation of any law or the rights of a third party.

For any content that you submit, you grant Bookswagon a perpetual, irrevocable, royalty-free, transferable right and license to use, copy, modify, delete in its entirety, adapt, publish, translate, create derivative works from and/or sell, transfer, and/or distribute such content and/or incorporate such content into any form, medium or technology throughout the world without compensation to you. Additionally, Bookswagon may transfer or share any personal information that you submit with its third-party service providers, including but not limited to Bazaarvoice, Inc. in accordance with Privacy Policy.

All content that you submit may be used at Bookswagon's sole discretion. Bookswagon reserves the right to change, condense, withhold publication, remove or delete any content on Bookswagon's website that Bookswagon deems, in its sole discretion, to violate the content guidelines or any other provision of these Terms of Use. Bookswagon does not guarantee that you will have any recourse through Bookswagon to edit or delete any content you have submitted. Ratings and written comments are generally posted within two to four business days. However, Bookswagon reserves the right to remove or to refuse to post any submission to the extent authorized by law. You acknowledge that you, not Bookswagon, are responsible for the contents of your submission. None of the content that you submit shall be subject to any obligation of confidence on the part of Bookswagon, its agents, subsidiaries, affiliates, partners or third party service providers (including but not limited to Bazaarvoice, Inc.)and their respective directors, officers and employees.

Fresh on the Shelf
See All

Quick View

The Penguin History of Early India Thapar, Romila

4.2

AED61

Quick View

Poor Economics Abhijit Banerjee

4.9

AED52

Quick View

The Loom Of Time Kalidasa

4.8

AED42

Quick View

The Clash of Civilisations And the Making of the New Order Samuel P. Huntington

4.3

AED61

Quick View

The Economics of Small Things Sudipta Sarangi

4.4

AED62

Quick View

Jesus Lived In India Holger Kersten

4.0

AED48

Quick View

Ultimate Goal Vikram Sood

4.9

AED35

Quick View

A Case Of Exploding Mangoes Mohammed Hanif

4.1

AED50

Quick View

Imagining India Nandan Nilekani

4.5

AED64

Quick View

The Best Thing About You Is You! (Revised Edition) Anupam Kher

4.5

AED47

Quick View

The Comrades and the Mullahs Ananth Krishnan

4.5

AED59

Inspired by your browsing history

Quick View

Markov Decision Processes in Artificial Intelligence Olivier Sigaud

No Review Yet

AED0

Quick View

Reprogram Your Mind for Better Relationships Haris Nik

No Review Yet

AED72

Quick View

Réfutation De L'écrit De M. Le Duc De Rovigo: Avec Pièces Justificatives Et Des Observations Sur Les Explications De M. Le Comte Hullin; Suivie De L'éloge De M. Le Duc D'enghien Antoine F. Maquart

No Review Yet

AED96

Quick View

Non-Accelerator Astroparticle Physics - Proceedings of the 7th School, Ictp, Trieste, Italy 26 July - 6 August 2004 R A Carrigan

No Review Yet

AED0

Quick View

Making Progress in Writing MS Eve Bearne

No Review Yet

AED0

Quick View

Targeting Maths 2 Blake Publishing - Judy Tertini

No Review Yet

AED0

Quick View

Scheherazade Anthony O'Neill

No Review Yet

AED0

Quick View

Historic Fields and Mansions of Middlesex Samuel Adams 1833-1905 Drake

No Review Yet

AED63

Quick View

Welsh Baritones

No Review Yet

AED52

Quick View

Cairn Terrier Guide Cairn Terrier Guide Includes Blake Rees

No Review Yet

AED28

Quick View

Les Comores

No Review Yet

AED97

Quick View

Progress in Optics Emil Wolf

No Review Yet

AED923

Quick View

Bw Booklets Cat Bound 2nd Ed Geoffrey Kellow

No Review Yet

AED80

Quick View

Square Dance Mom Artee's Square Dance Publishing

No Review Yet

AED30

Your review has been submitted!

You've already reviewed this product!

Markov Decision Processes in Artificial Intelligence

Markov Decision Processes in Artificial Intelligence Format: Digital (delivered electronically)

Best Sellers See All

Similar Products

Customer Reviews

Markov Decision Processes in Artificial Intelligence

Fresh on the Shelf See All

Inspired by your browsing history

Markov Decision Processes in Artificial Intelligence

Format: Digital (delivered electronically)

Best Sellers
See All

Fresh on the Shelf
See All