Barto, A. G.; Bradtke, S. J.; and Singh, S. P. 1995. Learning to Act using Real-Time Dynamic Programming. Artificial Intelligence 72(1): 81-138.
Referenced by page:
A Single-Plan Approach