+ All Categories
Home > Documents > Projectivities and continuous action spaces.

Projectivities and continuous action spaces.

Date post: 03-Jun-2015
Category:
Upload: r-uribe
View: 19 times
Download: 1 times
Share this document with a friend
Popular Tags:
10
Projectivities and continuous action spaces feat. Lena Reinaldo Uribe M Sept 30, 2013
Transcript
Page 1: Projectivities and continuous action spaces.

Projectivities and continuous action spacesfeat. Lena

Reinaldo Uribe M

Sept 30, 2013

Page 2: Projectivities and continuous action spaces.

w − l space as projective transformation from policyvalue/cost space

1

D

−D

Policy V

alu

e

Episode Length

Page 3: Projectivities and continuous action spaces.

w − l space as projective transformation from policyvalue/cost space

1

D

−D

Policy V

alu

e

Episode LengthD

−D

w

l

Page 4: Projectivities and continuous action spaces.

w − l space as projective transformation from policyvalue/cost space

1

D

−D

Policy V

alu

e

Episode Length

Page 5: Projectivities and continuous action spaces.

w − l space as projective transformation from policyvalue/cost space

1

D

−D

Policy V

alu

e

Episode LengthD

−D

w

l

Page 6: Projectivities and continuous action spaces.

Extension to continuous spacesSample task: two states, continuous actions

s1a1 ∈ [0, 1]r1 = 1 + (a1 − 0.5)2

c1 = 1 + a1

s2

a2 ∈ [0, 1]r2 = 1 + a2c2 = 1 + (a2 − 0.5)2

Page 7: Projectivities and continuous action spaces.

Extension to continuous spacesSample task: two states, continuous actions

s1a1 ∈ [0, 1]r1 = 1 + (a1 − 0.5)2

c1 = 1 + a1

s2a2 ∈ [0, 1]r2 = 1 + a2c2 = 1 + (a2 − 0.5)2

Page 8: Projectivities and continuous action spaces.

Extension to continuous spacesSample task: two states, continuous actions

Policy Space (Actions)

0

a2

1

0 a1 1

Page 9: Projectivities and continuous action spaces.

Extension to continuous spacesSample task: two states, continuous actions

Policy Values and Costs

Policy v

alu

e

Policy cost

4

4

Page 10: Projectivities and continuous action spaces.

Extension to continuous spacesSample task: two states, continuous actions

Policy Manifold in w − l

l

w

D/2

D/2


Recommended