web.stanford.eduweb.stanford.edu/class/cs234/past_projects/2017/... · Deep Reinforcement Learning...

Date post:	27-Apr-2020
Category:	Documents
Upload:	others
View:	4 times
Download:	0 times

Download Report this document

Share this document with a friend

Embed Size (px):

Transcript

Page 1: web.stanford.eduweb.stanford.edu/class/cs234/past_projects/2017/... · Deep Reinforcement Learning based Chinese Chess Player Chengshu Li Kedao wangl Zihua Liu 2. Related Work Early

Page 2: web.stanford.eduweb.stanford.edu/class/cs234/past_projects/2017/... · Deep Reinforcement Learning based Chinese Chess Player Chengshu Li Kedao wangl Zihua Liu 2. Related Work Early

Page 3: web.stanford.eduweb.stanford.edu/class/cs234/past_projects/2017/... · Deep Reinforcement Learning based Chinese Chess Player Chengshu Li Kedao wangl Zihua Liu 2. Related Work Early

Page 4: web.stanford.eduweb.stanford.edu/class/cs234/past_projects/2017/... · Deep Reinforcement Learning based Chinese Chess Player Chengshu Li Kedao wangl Zihua Liu 2. Related Work Early

Page 5: web.stanford.eduweb.stanford.edu/class/cs234/past_projects/2017/... · Deep Reinforcement Learning based Chinese Chess Player Chengshu Li Kedao wangl Zihua Liu 2. Related Work Early

Page 6: web.stanford.eduweb.stanford.edu/class/cs234/past_projects/2017/... · Deep Reinforcement Learning based Chinese Chess Player Chengshu Li Kedao wangl Zihua Liu 2. Related Work Early

Page 7: web.stanford.eduweb.stanford.edu/class/cs234/past_projects/2017/... · Deep Reinforcement Learning based Chinese Chess Player Chengshu Li Kedao wangl Zihua Liu 2. Related Work Early

Page 8: web.stanford.eduweb.stanford.edu/class/cs234/past_projects/2017/... · Deep Reinforcement Learning based Chinese Chess Player Chengshu Li Kedao wangl Zihua Liu 2. Related Work Early

1 Final Report 100 / 100

+ 0 pts Correct

+ 12 pts Clear description of the problem, and having it be clearly related to reinforcement learning

+ 8 pts Why is the problem important / significant / hard

+ 12 pts If the proposal is to tackle a new domain: why will the new domain be harder than prior work? Why

choose this?

+ 12 pts If the proposal is a new algorithm (plus potentially a new domain): what are the limitations of prior

approaches?

+ 12 pts If doing a replication study: why choose to replicate this particular algorithm, and why choose the

domains that you did?

+ 60 pts Provide a clear description of what was done and accomplished

+ 8 pts what are the next steps / open issues

+ 50 pts Description of work completed was a bit sparse in places

+ 4 pts Good but not detailed description of next steps

+ 55 pts description of work completed could've been further described in some places

+ 0 pts Click here to replace this description.

+ 100 Point adjustment

Nice work!

Much of�the paper I was wondering if you'd try using MCTS on top of the RL agent. I think that could

substantially further improve the results and can leverage the fact that the agent doesn't need to learn the

dynamics and reward. It would�

be interesting to hear what happens if you do try this!

Page 9

Documents

Presents PARADESmissouliantech.com/wonder/past_projects/2009/worldofWonders111… · SOURCES: World Book Encyclopedia, World Book Inc.; Decorate your own float 1. Use crepe paper

Documents

VIPER - Technical Manualspacegrant.colorado.edu/COSGC_Projects/Past_Projects/dino/Software/...Technical Manual Product Information ... Whilst Arcom’s sales team is always available

Documents

A collagenous protective coat enables Metarhizium ...A collagenous protective coat enables Metarhizium anisopliaeto evade insect immune responses Chengshu Wang and Raymond J. St. Leger*

Documents

Shenzhen Chengshu Technology Co.,LtdAdd：B, NO.21 liJing Center, Center City Longgang District, Shenzhen, Guangdong, China Tel：0755-28920996 84228306 Fax：0755-28904637 Http:

Documents

Developmental and Transcriptional Responses to Host and ... · Cuticles by the Speciﬁc Locust Pathogen Metarhizium anisopliae var. acridum† Chengshu Wang and Raymond J. St. Leger*

Documents

Structural Analysis of Tie-Back Retaining WallBack ...richardson.eng.ua.edu/.../Past_Projects/Ridgecrest_Retaining_Wall.pdf · Structural Analysis of Tie-Back Retaining WallBack Retaining

Documents

Mastering the game of Go from scratch - Stanford Universityweb.stanford.edu/class/cs234/CS234Win2019/past_projects/... · 2020-01-02 · Mastering the game of Go from scratch dimensional,

Documents

of GPUs? · Category: Astronomy & Astrophysics poster As01 contact name Long Wang: [email protected] Weighted essentially non-oscillatory (WENO) is a high order finite-difference method,

Documents

An In tegrated Solution for Secure Group Comm …sprout.ics.uci.edu/past_projects/cliques/paper/actt01.pdf · unicati on system to pro ... tal results obtained with a protot yp e

Documents

Adversarially Robust Policy Learning through Active ...web.stanford.edu/class/cs234/past_projects/2017/... · Ajay Mandlekar, Yuke Zhu, Animesh Garg, Li Fei-Fei, Silvio Savarese1

Documents

Introduction Results Algorithm - Stanford Universityweb.stanford.edu/class/cs234/past_projects/2017/2017_Sachidanan… · Introduction Results Algorithm Discussion & Future Work Online

Documents

Mastering the game of Go from scratch - Stanford Universityweb.stanford.edu/class/cs234/past_projects/2017/... · Mastering the game of Go from scratch Michael Painter *1Luke Johnston

Documents

Linbing Wang, Ph.D., P.E., Professor, [email protected] ...Gene Boyd-Vulcan Materials Scholarship, 1997, Georgia Tech First Award for Rational Proposals, 1990, Anhui Electric Power Design

Documents

FINAL REPORT Occupational health and safety experience of ...depts.washington.edu/nwcohs/documents/past_projects/0506Seixas.pdfDay laborers commonly reported receiving some training

Documents

herosupermarket.co.idherosupermarket.co.id/wp-content/uploads/2017/03/FA-FULL-LWok-24... · Sari Wangl Coffee 15x40Gr,' 15x25Gr,' Yoghurt 125Gr (All Variant) 15x56Gr,' 17»x40Gr (All

Documents

Risk Assessment and Loss Prevention of LNG Carriers_KS Wangl

Documents

ELEPHANTS - missouliantech.commissouliantech.com/wonder/past_projects/2010/elephants.pdf · Elephants chew by moving the jaw from side to side in a grinding motion. Elephants are

Documents