You are here: Home Previous Editions ICT In Practice 2018 Talks Reinforcement Learning for Autonomous Mobile Robots

Reinforcement Learning for Autonomous Mobile Robots

Autonomous mobile robots, like e.g. Automated Guided Vehicles or soccer robots, are powered by smart embedded software. This software makes the robot sense the world, share its believes about the world state with other robots and select the best action to perform next, based upon its believes. The behavior of these robots is emerging from the action selection policy that has been coded in the embedded software. A well-designed, rational policy makes the robot maximize its expected utility.

Traditionally action selection policies for robots are modelled and implemented using hierarchical state machines or behavior trees. Although these methods are very good in describing the hierarchical decomposition of actions into smaller actions, they are also quite rigid in specifying how to choose among actions at runtime. The rigidity comes from the fact that actions are selected using hardcoded conditions on the world state. These conditions are in general inflexible and suboptimal.

Our research deals with replacing hardcoded action selection by a more flexible mechanism. We do not want to specify action selection conditions, but we want our robot to learn a good policy by training. To attain this goal we use “reinforcement learning”. We make the robot learn a non-linear policy function that maps a world state onto action utilities. A model of the robot is made to explore a simulated, but realistic environment where it gets feedback on its actions in the form of rewards. It gradually improves its policy function by maximizing rewards.

Tags: AI

All talks:

Robotics in Logistics The Robots Are Coming! Co-creating Open Educational Resources The software behind flexible robots for production and logistics Association game Automated Guided Vehicles in the factory of the future Close encounters with robots Dezyne for RobotiX Hackman versus Packman How to develop software for robots? 'My bad' doesn't cut it in our world Playful intelligence: how to motivate people using A.I.?! Practical applications of Deep learning Reinforcement Learning for Autonomous Mobile Robots Robot soccer as catalyst for industrial innovation Robotize Your Future So happy together? The Uncanny Valley Veggies for Techies Vision & Deep learning

Eric Dortmans and Peter Lambooij Lecturer / Researcher FHICT

About Eric

Eric Dortmans has a MSc degree in Electrical Engineering from the TU/e. He worked as a researcher at the TU/e from 1978 to 1983 and at Océ-Technologies BV R&D from 1983 to 2008. From 2004 to 2008 he also acted as lector Embedded Systems Architecture at the Fontys Hogeschool ICT. Since 2008 he is senior lecturer ICT & Technology and researcher in High Tech Embedded Software lectorate as well as Mechatronics & Robotics.

About Peter

Peter Lambooij has a PhD in Physics. He is a lecturer ICT and Technology. Furthermore he is in with Fontys’ Minor Applied Data Science as well as Human Technology Interaction group at Eindhoven University of Technology. He is a researcher in the High Tech Embedded Software lectorate.