Learning Representation And Control In Markov Decision Processes by Sridhar Mahadevan