Handbook Of Learning And Approximate Dynamic Progr Amming by Andrew G. Barto, Jennie Si & Andy Barto