LEARNING TO MAKE DECISIONS WITH INCOMPLETE INFORMATION: REINFORCEMENT LEARNING, INFORMATION GEOMETRY, AND REAL-LIFE APPLICATIONS
COM2 Level 4
Executive Classroom, COM2-04-02
closeAbstract:
Intelligence is the ability of accumulating information, processing these information in the form of general constructions, and learning these constructions to adapt to the environment. The components of intelligence- information accumulation, processing and learning, lead to efficient and effective decision making. While imbibing these components in a reinforcement learning algorithm, the reward function or the underlying dynamics of the decision making process is not often known a priori. Thus, the problems of learning by exploration, optimising the decision by exploitation, and balancing exploration-exploitaition take the central stage in designing a reinforcement learning algorithm.
We address these issues mathematically, algorithmically, and experimentally through various problem models, solution methodologies, and real-life applications respectively. We investigate multi-armed bandits, and Markov decision processes as the problem models. We use online functional approximation and optimisation, and information geometry as the solution methodologies. We apply the developed methodologies to address the real-life problems such as automated database tuning, energy- and performance-efficient live virtual machine migration in Clouds, speed optimisation of ships, and online scheduling of jobs arriving in queues to servers.
We also propose extension of the information geometric framework that addresses information accumulation, processing, and learning, for Markov decision processes.