Markov Decision Processes in Practice

This book presents classical Markov Decision Processes (MDP) for real-life applications and optimization. MDP allows users to develop and formally support approximate and simple decision rules, and this book showcases state-of-the-art applications in which MDP was key to the solution approach.  The book is divided into six parts. Part 1 is devoted to the state-of-the-art theoretical foundation of MDP, including approximate methods such as policy improvement, successive approximation and infinite state spaces as well as an instructive chapter on Approximate Dynamic Programming. It then continues with five parts of specific and non-exhaustive application areas. Part 2 covers MDP healthcare applications, which includes different screening procedures, appointment scheduling, ambulance scheduling and blood management. Part 3 explores MDP modeling within transportation.  This ranges from public to private transportation, from airports and traffic lights to car parking or charging your electric car . Part 4 contains three chapters that illustrates the structure of approximate policies for  production or manufacturing structures. In Part 5, communications is highlighted as an important application area for MDP. It includes Gittins indices, down-to-earth call centers and wireless sensor networks. Finally Part 6 is dedicated to financial modeling, offering an instructive review to account for financial portfolios and derivatives under proportional transactional costs. The MDP applications in this book illustrate a variety of both standard and non-standard aspects of MDP modeling and its practical use. This book should appeal to readers for practitioning, academic research and educational purposes, with a background in, among others, operations research, mathematics, computer science, and industrial engineering.



Richard Boucherie received M.Sc. degrees in 1988 in applied mathematics and theoretical physics from the Universiteit Leiden, and received the Ph.D. degree in econometrics in 1992 from the Vrije Universiteit, Amsterdam. Since 2000 he is with the department of Applied Mathematics of the University of Twente, where he was appointed in 2003 as full professor of Stochastic Operations Research.

His research interests are in queueing theory, Petri nets and random walks with application areas including wireless and sensor networks, healthcare, road traffic, and network intrusion detection and prevention. Richard is co-founder of the University of Twente Center for Healthcare Operations Improvement and Research (CHOIR) in the area of healthcare logistics, and chair of the Postdoctorate programme in healthcare logistics. In 2014 he co-founded the spin-off company Rhythm, that carries out actual implementations of healthcare logistics solutions in healthcare organisations.  <

Nico M. van Dijk  has been active in the area of Stochastic Operations Research for over 30 years. He has always been stimulated by real-life stochastics , as reflected by his Ph.D. on Controlled Markov Processes: Time-discretization (1983), by his Wiley book on Queueing Networks and Product Forms : A systems approach (1993) and by practical application papers such as on communications, call centers, railways and ICU (Intensive care unit) systems. For a decade he works in close cooperation with the Dutch Blood banks. For joint work on blood supply, the OR team that he guided, became finalist for the EURO 2009 Excellence in OR Practice Award and received the INFORMS 2011 First Prize Interactive Poster Award. He is affiliated with the Stochastic Operations Research group at the University of Twente.

Verwandte Artikel