College of Computer and Information Science - cs6140 lec11 · 2017-04-06 · Today’s Outline •...

Post on 28-Mar-2020

1 views 0 download

transcript

4/6/17

1

CS6140:MachineLearningSpring2017

Instructor:LuWangCollegeofComputerandInformaAonScience

NortheasternUniversityWebpage:www.ccs.neu.edu/home/luwang

Email:luwang@ccs.neu.edu

LogisAcs•  GradesforA2isout.

•  Nextweek:courseprojectpresentaAon.

•  Thefinalreportisdueon4/24.Allassignmentshavetobeinby4/29.

•  4/20:finalexam

•  AddiAonalofficehours:–  4.17,4-5pm,(Lu,448WVH)–  4.18,11am-12pm,(TA,166WVH)–  4.19,4-5pm,(Lu,448WVH)

WhatwelearnedlastAme

•  IntroducAontoReinforcementLearning•  TheReinforcementLearningProblem•  MarkovDecisionProcess

4/6/17

2

4/6/17

3

4/6/17

4

4/6/17

5

Today’sOutline

•  PlanningbyDynamicProgramming– PolicyevaluaAonandpolicyimprovement– ValueiteraAon

[SlidestakenfromDavidSilver’sreinforcementlearningcourse]

4/6/17

6

4/6/17

7

4/6/17

8

4/6/17

9

4/6/17

10