Reinforcement Learning For Adaptive Dialogue Systems by Verena Rieser & Oliver Lemon