Post on 28-Mar-2020
transcript
4/6/17
1
CS6140:MachineLearningSpring2017
Instructor:LuWangCollegeofComputerandInformaAonScience
NortheasternUniversityWebpage:www.ccs.neu.edu/home/luwang
Email:luwang@ccs.neu.edu
LogisAcs• GradesforA2isout.
• Nextweek:courseprojectpresentaAon.
• Thefinalreportisdueon4/24.Allassignmentshavetobeinby4/29.
• 4/20:finalexam
• AddiAonalofficehours:– 4.17,4-5pm,(Lu,448WVH)– 4.18,11am-12pm,(TA,166WVH)– 4.19,4-5pm,(Lu,448WVH)
WhatwelearnedlastAme
• IntroducAontoReinforcementLearning• TheReinforcementLearningProblem• MarkovDecisionProcess
4/6/17
2
4/6/17
3
4/6/17
4
4/6/17
5
Today’sOutline
• PlanningbyDynamicProgramming– PolicyevaluaAonandpolicyimprovement– ValueiteraAon
[SlidestakenfromDavidSilver’sreinforcementlearningcourse]
4/6/17
6
4/6/17
7
4/6/17
8
4/6/17
9
4/6/17
10