×
+ All Categories
Log in
English
Français
Español
Deutsch
The top documents tagged [waytarget policy]
Home >
waytarget policy
Off-Policy Temporal-Difference Learning with Function Approximation Doina Precup McGill University Rich Sutton Sanjoy Dasgupta AT&T Labs.
218 views