+ All Categories
Home > Documents > MoHex 2.0: pattern-based MCTShayward/talks/hex.m2icga.pdf · hex knowledge mohex mohex 2.0 thank...

MoHex 2.0: pattern-based MCTShayward/talks/hex.m2icga.pdf · hex knowledge mohex mohex 2.0 thank...

Date post: 12-May-2020
Category:
Upload: others
View: 2 times
Download: 0 times
Share this document with a friend
63
hex knowledge mohex mohex 2.0 MoHex 2.0: pattern-based MCTS huang arneson hayward m¨ uller pawlewicz computing UAlberta [email protected] CG2013 aug 13 huang arneson hayward m¨ uller pawlewicz MoHex 2.0: pattern-based MCTS
Transcript
Page 1: MoHex 2.0: pattern-based MCTShayward/talks/hex.m2icga.pdf · hex knowledge mohex mohex 2.0 thank you Natural Sciences and Engineering Research Council of Canada huang arneson hayward

hexknowledge

mohexmohex 2.0

MoHex 2.0: pattern-based MCTS

huang arneson hayward muller pawlewicz

computing UAlberta [email protected]

CG2013 aug 13

huang arneson hayward muller pawlewicz MoHex 2.0: pattern-based MCTS

Page 2: MoHex 2.0: pattern-based MCTShayward/talks/hex.m2icga.pdf · hex knowledge mohex mohex 2.0 thank you Natural Sciences and Engineering Research Council of Canada huang arneson hayward

hexknowledge

mohexmohex 2.0

thank you

Natural Sciences and Engineering Research Council of Canada

huang arneson hayward muller pawlewicz MoHex 2.0: pattern-based MCTS

Page 3: MoHex 2.0: pattern-based MCTShayward/talks/hex.m2icga.pdf · hex knowledge mohex mohex 2.0 thank you Natural Sciences and Engineering Research Council of Canada huang arneson hayward

hexknowledge

mohexmohex 2.0

1 hex

2 knowledge

3 mohex

4 mohex 2.0

huang arneson hayward muller pawlewicz MoHex 2.0: pattern-based MCTS

Page 4: MoHex 2.0: pattern-based MCTShayward/talks/hex.m2icga.pdf · hex knowledge mohex mohex 2.0 thank you Natural Sciences and Engineering Research Council of Canada huang arneson hayward

hexknowledge

mohexmohex 2.0

HexShannon machine

1942 Hex

rules

black v white, alternate moves

win: connect sides

huang arneson hayward muller pawlewicz MoHex 2.0: pattern-based MCTS

Page 5: MoHex 2.0: pattern-based MCTShayward/talks/hex.m2icga.pdf · hex knowledge mohex mohex 2.0 thank you Natural Sciences and Engineering Research Council of Canada huang arneson hayward

hexknowledge

mohexmohex 2.0

HexShannon machine

1942 Hex

rules

black v white, alternate moves

win: connect sides

huang arneson hayward muller pawlewicz MoHex 2.0: pattern-based MCTS

Page 6: MoHex 2.0: pattern-based MCTShayward/talks/hex.m2icga.pdf · hex knowledge mohex mohex 2.0 thank you Natural Sciences and Engineering Research Council of Canada huang arneson hayward

hexknowledge

mohexmohex 2.0

HexShannon machine

properties

properties

no draw

n-by-n: 1st-player win

n-by-(n+k): longer-side win

Pspace-complete

huang arneson hayward muller pawlewicz MoHex 2.0: pattern-based MCTS

Page 7: MoHex 2.0: pattern-based MCTShayward/talks/hex.m2icga.pdf · hex knowledge mohex mohex 2.0 thank you Natural Sciences and Engineering Research Council of Canada huang arneson hayward

hexknowledge

mohexmohex 2.0

HexShannon machine

Shannon’s birdcage machine

huang arneson hayward muller pawlewicz MoHex 2.0: pattern-based MCTS

Page 8: MoHex 2.0: pattern-based MCTShayward/talks/hex.m2icga.pdf · hex knowledge mohex mohex 2.0 thank you Natural Sciences and Engineering Research Council of Canada huang arneson hayward

hexknowledge

mohexmohex 2.0

HexShannon machine

switching network

play on any graph

two marked vertices

black move: ‘short’ any vertex (make nbrs clique)

white move: ‘cut’ any vertex (delete)

black wins iff two marked vertices are shorted (connected)

generalizes Hex

huang arneson hayward muller pawlewicz MoHex 2.0: pattern-based MCTS

Page 9: MoHex 2.0: pattern-based MCTShayward/talks/hex.m2icga.pdf · hex knowledge mohex mohex 2.0 thank you Natural Sciences and Engineering Research Council of Canada huang arneson hayward

hexknowledge

mohexmohex 2.0

HexShannon machine

switching network

huang arneson hayward muller pawlewicz MoHex 2.0: pattern-based MCTS

Page 10: MoHex 2.0: pattern-based MCTShayward/talks/hex.m2icga.pdf · hex knowledge mohex mohex 2.0 thank you Natural Sciences and Engineering Research Council of Canada huang arneson hayward

hexknowledge

mohexmohex 2.0

HexShannon machine

switching network

T

T

huang arneson hayward muller pawlewicz MoHex 2.0: pattern-based MCTS

Page 11: MoHex 2.0: pattern-based MCTShayward/talks/hex.m2icga.pdf · hex knowledge mohex mohex 2.0 thank you Natural Sciences and Engineering Research Council of Canada huang arneson hayward

hexknowledge

mohexmohex 2.0

HexShannon machine

switching network

T

T

huang arneson hayward muller pawlewicz MoHex 2.0: pattern-based MCTS

Page 12: MoHex 2.0: pattern-based MCTShayward/talks/hex.m2icga.pdf · hex knowledge mohex mohex 2.0 thank you Natural Sciences and Engineering Research Council of Canada huang arneson hayward

hexknowledge

mohexmohex 2.0

virtual connectionsinferior cells

knowledge

virtual connections

inferior cells

huang arneson hayward muller pawlewicz MoHex 2.0: pattern-based MCTS

Page 13: MoHex 2.0: pattern-based MCTShayward/talks/hex.m2icga.pdf · hex knowledge mohex mohex 2.0 thank you Natural Sciences and Engineering Research Council of Canada huang arneson hayward

hexknowledge

mohexmohex 2.0

virtual connectionsinferior cells

a virtual connection

huang arneson hayward muller pawlewicz MoHex 2.0: pattern-based MCTS

Page 14: MoHex 2.0: pattern-based MCTShayward/talks/hex.m2icga.pdf · hex knowledge mohex mohex 2.0 thank you Natural Sciences and Engineering Research Council of Canada huang arneson hayward

hexknowledge

mohexmohex 2.0

virtual connectionsinferior cells

a virtual connection

huang arneson hayward muller pawlewicz MoHex 2.0: pattern-based MCTS

Page 15: MoHex 2.0: pattern-based MCTShayward/talks/hex.m2icga.pdf · hex knowledge mohex mohex 2.0 thank you Natural Sciences and Engineering Research Council of Canada huang arneson hayward

hexknowledge

mohexmohex 2.0

virtual connectionsinferior cells

combining rule: and (full)

huang arneson hayward muller pawlewicz MoHex 2.0: pattern-based MCTS

Page 16: MoHex 2.0: pattern-based MCTShayward/talks/hex.m2icga.pdf · hex knowledge mohex mohex 2.0 thank you Natural Sciences and Engineering Research Council of Canada huang arneson hayward

hexknowledge

mohexmohex 2.0

virtual connectionsinferior cells

combining rule: and (full)

huang arneson hayward muller pawlewicz MoHex 2.0: pattern-based MCTS

Page 17: MoHex 2.0: pattern-based MCTShayward/talks/hex.m2icga.pdf · hex knowledge mohex mohex 2.0 thank you Natural Sciences and Engineering Research Council of Canada huang arneson hayward

hexknowledge

mohexmohex 2.0

virtual connectionsinferior cells

combining rule: and (full)

huang arneson hayward muller pawlewicz MoHex 2.0: pattern-based MCTS

Page 18: MoHex 2.0: pattern-based MCTShayward/talks/hex.m2icga.pdf · hex knowledge mohex mohex 2.0 thank you Natural Sciences and Engineering Research Council of Canada huang arneson hayward

hexknowledge

mohexmohex 2.0

virtual connectionsinferior cells

combining rule: and (full)

huang arneson hayward muller pawlewicz MoHex 2.0: pattern-based MCTS

Page 19: MoHex 2.0: pattern-based MCTShayward/talks/hex.m2icga.pdf · hex knowledge mohex mohex 2.0 thank you Natural Sciences and Engineering Research Council of Canada huang arneson hayward

hexknowledge

mohexmohex 2.0

virtual connectionsinferior cells

combining rule: and (full)

huang arneson hayward muller pawlewicz MoHex 2.0: pattern-based MCTS

Page 20: MoHex 2.0: pattern-based MCTShayward/talks/hex.m2icga.pdf · hex knowledge mohex mohex 2.0 thank you Natural Sciences and Engineering Research Council of Canada huang arneson hayward

hexknowledge

mohexmohex 2.0

virtual connectionsinferior cells

combining rule: and (full)

huang arneson hayward muller pawlewicz MoHex 2.0: pattern-based MCTS

Page 21: MoHex 2.0: pattern-based MCTShayward/talks/hex.m2icga.pdf · hex knowledge mohex mohex 2.0 thank you Natural Sciences and Engineering Research Council of Canada huang arneson hayward

hexknowledge

mohexmohex 2.0

virtual connectionsinferior cells

combining rule: and (semi)

huang arneson hayward muller pawlewicz MoHex 2.0: pattern-based MCTS

Page 22: MoHex 2.0: pattern-based MCTShayward/talks/hex.m2icga.pdf · hex knowledge mohex mohex 2.0 thank you Natural Sciences and Engineering Research Council of Canada huang arneson hayward

hexknowledge

mohexmohex 2.0

virtual connectionsinferior cells

combining rule: and (semi)

huang arneson hayward muller pawlewicz MoHex 2.0: pattern-based MCTS

Page 23: MoHex 2.0: pattern-based MCTShayward/talks/hex.m2icga.pdf · hex knowledge mohex mohex 2.0 thank you Natural Sciences and Engineering Research Council of Canada huang arneson hayward

hexknowledge

mohexmohex 2.0

virtual connectionsinferior cells

combining rule: and (semi)

huang arneson hayward muller pawlewicz MoHex 2.0: pattern-based MCTS

Page 24: MoHex 2.0: pattern-based MCTShayward/talks/hex.m2icga.pdf · hex knowledge mohex mohex 2.0 thank you Natural Sciences and Engineering Research Council of Canada huang arneson hayward

hexknowledge

mohexmohex 2.0

virtual connectionsinferior cells

combining rule: or

huang arneson hayward muller pawlewicz MoHex 2.0: pattern-based MCTS

Page 25: MoHex 2.0: pattern-based MCTShayward/talks/hex.m2icga.pdf · hex knowledge mohex mohex 2.0 thank you Natural Sciences and Engineering Research Council of Canada huang arneson hayward

hexknowledge

mohexmohex 2.0

virtual connectionsinferior cells

combining rule: or

huang arneson hayward muller pawlewicz MoHex 2.0: pattern-based MCTS

Page 26: MoHex 2.0: pattern-based MCTShayward/talks/hex.m2icga.pdf · hex knowledge mohex mohex 2.0 thank you Natural Sciences and Engineering Research Council of Canada huang arneson hayward

hexknowledge

mohexmohex 2.0

virtual connectionsinferior cells

combining rule: or

huang arneson hayward muller pawlewicz MoHex 2.0: pattern-based MCTS

Page 27: MoHex 2.0: pattern-based MCTShayward/talks/hex.m2icga.pdf · hex knowledge mohex mohex 2.0 thank you Natural Sciences and Engineering Research Council of Canada huang arneson hayward

hexknowledge

mohexmohex 2.0

virtual connectionsinferior cells

combining rule: or

huang arneson hayward muller pawlewicz MoHex 2.0: pattern-based MCTS

Page 28: MoHex 2.0: pattern-based MCTShayward/talks/hex.m2icga.pdf · hex knowledge mohex mohex 2.0 thank you Natural Sciences and Engineering Research Council of Canada huang arneson hayward

hexknowledge

mohexmohex 2.0

virtual connectionsinferior cells

where must white play?

huang arneson hayward muller pawlewicz MoHex 2.0: pattern-based MCTS

Page 29: MoHex 2.0: pattern-based MCTShayward/talks/hex.m2icga.pdf · hex knowledge mohex mohex 2.0 thank you Natural Sciences and Engineering Research Council of Canada huang arneson hayward

hexknowledge

mohexmohex 2.0

virtual connectionsinferior cells

where must white play?

huang arneson hayward muller pawlewicz MoHex 2.0: pattern-based MCTS

Page 30: MoHex 2.0: pattern-based MCTShayward/talks/hex.m2icga.pdf · hex knowledge mohex mohex 2.0 thank you Natural Sciences and Engineering Research Council of Canada huang arneson hayward

hexknowledge

mohexmohex 2.0

virtual connectionsinferior cells

where must white play?

huang arneson hayward muller pawlewicz MoHex 2.0: pattern-based MCTS

Page 31: MoHex 2.0: pattern-based MCTShayward/talks/hex.m2icga.pdf · hex knowledge mohex mohex 2.0 thank you Natural Sciences and Engineering Research Council of Canada huang arneson hayward

hexknowledge

mohexmohex 2.0

virtual connectionsinferior cells

where must white play?

huang arneson hayward muller pawlewicz MoHex 2.0: pattern-based MCTS

Page 32: MoHex 2.0: pattern-based MCTShayward/talks/hex.m2icga.pdf · hex knowledge mohex mohex 2.0 thank you Natural Sciences and Engineering Research Council of Canada huang arneson hayward

hexknowledge

mohexmohex 2.0

virtual connectionsinferior cells

dead

huang arneson hayward muller pawlewicz MoHex 2.0: pattern-based MCTS

Page 33: MoHex 2.0: pattern-based MCTShayward/talks/hex.m2icga.pdf · hex knowledge mohex mohex 2.0 thank you Natural Sciences and Engineering Research Council of Canada huang arneson hayward

hexknowledge

mohexmohex 2.0

virtual connectionsinferior cells

black-dominated (dot superior)

huang arneson hayward muller pawlewicz MoHex 2.0: pattern-based MCTS

Page 34: MoHex 2.0: pattern-based MCTShayward/talks/hex.m2icga.pdf · hex knowledge mohex mohex 2.0 thank you Natural Sciences and Engineering Research Council of Canada huang arneson hayward

hexknowledge

mohexmohex 2.0

virtual connectionsinferior cells

black-captured

huang arneson hayward muller pawlewicz MoHex 2.0: pattern-based MCTS

Page 35: MoHex 2.0: pattern-based MCTShayward/talks/hex.m2icga.pdf · hex knowledge mohex mohex 2.0 thank you Natural Sciences and Engineering Research Council of Canada huang arneson hayward

hexknowledge

mohexmohex 2.0

virtual connectionsinferior cells

black-dominated (dot superior)

huang arneson hayward muller pawlewicz MoHex 2.0: pattern-based MCTS

Page 36: MoHex 2.0: pattern-based MCTShayward/talks/hex.m2icga.pdf · hex knowledge mohex mohex 2.0 thank you Natural Sciences and Engineering Research Council of Canada huang arneson hayward

hexknowledge

mohexmohex 2.0

virtual connectionsinferior cells

black-capture-reversible (to white dot)

huang arneson hayward muller pawlewicz MoHex 2.0: pattern-based MCTS

Page 37: MoHex 2.0: pattern-based MCTShayward/talks/hex.m2icga.pdf · hex knowledge mohex mohex 2.0 thank you Natural Sciences and Engineering Research Council of Canada huang arneson hayward

hexknowledge

mohexmohex 2.0

virtual connectionsinferior cells

black fill decomposition

huang arneson hayward muller pawlewicz MoHex 2.0: pattern-based MCTS

Page 38: MoHex 2.0: pattern-based MCTShayward/talks/hex.m2icga.pdf · hex knowledge mohex mohex 2.0 thank you Natural Sciences and Engineering Research Council of Canada huang arneson hayward

hexknowledge

mohexmohex 2.0

virtual connectionsinferior cells

star decomposition

huang arneson hayward muller pawlewicz MoHex 2.0: pattern-based MCTS

Page 39: MoHex 2.0: pattern-based MCTShayward/talks/hex.m2icga.pdf · hex knowledge mohex mohex 2.0 thank you Natural Sciences and Engineering Research Council of Canada huang arneson hayward

hexknowledge

mohexmohex 2.0

virtual connectionsinferior cells

black star decomp domination

huang arneson hayward muller pawlewicz MoHex 2.0: pattern-based MCTS

Page 40: MoHex 2.0: pattern-based MCTShayward/talks/hex.m2icga.pdf · hex knowledge mohex mohex 2.0 thank you Natural Sciences and Engineering Research Council of Canada huang arneson hayward

hexknowledge

mohexmohex 2.0

virtual connectionsinferior cells

modify H-search

and/or combining rules + capture

+ =

huang arneson hayward muller pawlewicz MoHex 2.0: pattern-based MCTS

Page 41: MoHex 2.0: pattern-based MCTShayward/talks/hex.m2icga.pdf · hex knowledge mohex mohex 2.0 thank you Natural Sciences and Engineering Research Council of Canada huang arneson hayward

hexknowledge

mohexmohex 2.0

mohex framework

while time remains:traverse tree (repeat: select child, move to child)expand: leaf → nodeevaluate node: simulationupdate info: traverse from node back to root

select most-visited root-child as move

huang arneson hayward muller pawlewicz MoHex 2.0: pattern-based MCTS

Page 42: MoHex 2.0: pattern-based MCTShayward/talks/hex.m2icga.pdf · hex knowledge mohex mohex 2.0 thank you Natural Sciences and Engineering Research Council of Canada huang arneson hayward

hexknowledge

mohexmohex 2.0

mohex simulation pattern

huang arneson hayward muller pawlewicz MoHex 2.0: pattern-based MCTS

Page 43: MoHex 2.0: pattern-based MCTShayward/talks/hex.m2icga.pdf · hex knowledge mohex mohex 2.0 thank you Natural Sciences and Engineering Research Council of Canada huang arneson hayward

hexknowledge

mohexmohex 2.0

mohex simulation pattern

huang arneson hayward muller pawlewicz MoHex 2.0: pattern-based MCTS

Page 44: MoHex 2.0: pattern-based MCTShayward/talks/hex.m2icga.pdf · hex knowledge mohex mohex 2.0 thank you Natural Sciences and Engineering Research Council of Canada huang arneson hayward

hexknowledge

mohexmohex 2.0

mohex simulation pattern

huang arneson hayward muller pawlewicz MoHex 2.0: pattern-based MCTS

Page 45: MoHex 2.0: pattern-based MCTShayward/talks/hex.m2icga.pdf · hex knowledge mohex mohex 2.0 thank you Natural Sciences and Engineering Research Council of Canada huang arneson hayward

hexknowledge

mohexmohex 2.0

all moves as first

use RAVE, an AMAF heuristic

set exploration multiplier to 0 (so not UCT)

huang arneson hayward muller pawlewicz MoHex 2.0: pattern-based MCTS

Page 46: MoHex 2.0: pattern-based MCTShayward/talks/hex.m2icga.pdf · hex knowledge mohex mohex 2.0 thank you Natural Sciences and Engineering Research Council of Canada huang arneson hayward

hexknowledge

mohexmohex 2.0

ice/vce pruning

during traversal:if node becomes heavy

apply ICE/VCEprune inferior cellsprune non-mustplay

huang arneson hayward muller pawlewicz MoHex 2.0: pattern-based MCTS

Page 47: MoHex 2.0: pattern-based MCTShayward/talks/hex.m2icga.pdf · hex knowledge mohex mohex 2.0 thank you Natural Sciences and Engineering Research Council of Canada huang arneson hayward

hexknowledge

mohexmohex 2.0

ice pruning

huang arneson hayward muller pawlewicz MoHex 2.0: pattern-based MCTS

Page 48: MoHex 2.0: pattern-based MCTShayward/talks/hex.m2icga.pdf · hex knowledge mohex mohex 2.0 thank you Natural Sciences and Engineering Research Council of Canada huang arneson hayward

hexknowledge

mohexmohex 2.0

ice pruning

huang arneson hayward muller pawlewicz MoHex 2.0: pattern-based MCTS

Page 49: MoHex 2.0: pattern-based MCTShayward/talks/hex.m2icga.pdf · hex knowledge mohex mohex 2.0 thank you Natural Sciences and Engineering Research Council of Canada huang arneson hayward

hexknowledge

mohexmohex 2.0

mohex flaws

weak without VCE, ICE

weak playouts

huang arneson hayward muller pawlewicz MoHex 2.0: pattern-based MCTS

Page 50: MoHex 2.0: pattern-based MCTShayward/talks/hex.m2icga.pdf · hex knowledge mohex mohex 2.0 thank you Natural Sciences and Engineering Research Council of Canada huang arneson hayward

hexknowledge

mohexmohex 2.0

improvements

extend on unstable search

lazy delete obsolete subtrees

improved RAVE formulapatterns

estimate prior knowledgeprogressive biasprobabilistic simulations

experiments

future work

huang arneson hayward muller pawlewicz MoHex 2.0: pattern-based MCTS

Page 51: MoHex 2.0: pattern-based MCTShayward/talks/hex.m2icga.pdf · hex knowledge mohex mohex 2.0 thank you Natural Sciences and Engineering Research Council of Canada huang arneson hayward

hexknowledge

mohexmohex 2.0

lazy delete obsolete subtree

move becomes obsolete ?

1) mark child obsolete

2) in traversal, before moving to a child, checkwhether obsolete: yes ? mark as proven loss

huang arneson hayward muller pawlewicz MoHex 2.0: pattern-based MCTS

Page 52: MoHex 2.0: pattern-based MCTShayward/talks/hex.m2icga.pdf · hex knowledge mohex mohex 2.0 thank you Natural Sciences and Engineering Research Council of Canada huang arneson hayward

hexknowledge

mohexmohex 2.0

improved rave formula

U : UCT mean (wins/visits)

R: RAVE mean (wins/visits)n: parent visit countnj : node visit count

cb: constantw : RAVE term weight (decays ∼1 to 0 with nj)

E : UCT exploration formula cb ×

ln nnj

score(j) = (1− w)× (U + E ) + w × R

huang arneson hayward muller pawlewicz MoHex 2.0: pattern-based MCTS

Page 53: MoHex 2.0: pattern-based MCTShayward/talks/hex.m2icga.pdf · hex knowledge mohex mohex 2.0 thank you Natural Sciences and Engineering Research Council of Canada huang arneson hayward

hexknowledge

mohexmohex 2.0

patterns

supervised learning minorization-maximization

15 000 11x11 mohex-wolve games (ignore 1st move)

20 000 13x13 strong little golem games

consider 6- 12- 18-cell patterns

65 900 global 6-,12-patterns (30 600 prunable)

11 600 local 6-,12-patterns (3 700 prunable)

prunable dead/captured, dominated: γ → 1e-5, 1e-4

huang arneson hayward muller pawlewicz MoHex 2.0: pattern-based MCTS

Page 54: MoHex 2.0: pattern-based MCTShayward/talks/hex.m2icga.pdf · hex knowledge mohex mohex 2.0 thank you Natural Sciences and Engineering Research Council of Canada huang arneson hayward

hexknowledge

mohexmohex 2.0

patterns

(γ, p, a) = (886, 439, 479) (754,179,194)

(754,179,194) (321,48,64) (213,52,65)

huang arneson hayward muller pawlewicz MoHex 2.0: pattern-based MCTS

Page 55: MoHex 2.0: pattern-based MCTShayward/talks/hex.m2icga.pdf · hex knowledge mohex mohex 2.0 thank you Natural Sciences and Engineering Research Council of Canada huang arneson hayward

hexknowledge

mohexmohex 2.0

patterns

(194,2247,3259) (100,86,182) (98,94,191)

(.04,0,10190) (.05,3,14270) (.05,6,17351)

huang arneson hayward muller pawlewicz MoHex 2.0: pattern-based MCTS

Page 56: MoHex 2.0: pattern-based MCTShayward/talks/hex.m2icga.pdf · hex knowledge mohex mohex 2.0 thank you Natural Sciences and Engineering Research Council of Canada huang arneson hayward

hexknowledge

mohexmohex 2.0

estimating prior knowledge

check pattern of every available move

prunable ? move not considered

non-prunable ? ρ← relative global+local γ sum

unvisited node: RAVE score,count ← .5, 8

huang arneson hayward muller pawlewicz MoHex 2.0: pattern-based MCTS

Page 57: MoHex 2.0: pattern-based MCTShayward/talks/hex.m2icga.pdf · hex knowledge mohex mohex 2.0 thank you Natural Sciences and Engineering Research Council of Canada huang arneson hayward

hexknowledge

mohexmohex 2.0

progressive bias

following Mango, . . .

Score(j) = (1− w)× (U + E ) + w × R + PB

following Castro, . . .

PB = cpb × ρ/√

nj + 1

from CLOP

cpb = 2.47

huang arneson hayward muller pawlewicz MoHex 2.0: pattern-based MCTS

Page 58: MoHex 2.0: pattern-based MCTShayward/talks/hex.m2icga.pdf · hex knowledge mohex mohex 2.0 thank you Natural Sciences and Engineering Research Council of Canada huang arneson hayward

hexknowledge

mohexmohex 2.0

probabilistic simulations

use weights, generate moves stochastically via softmax

cap global γ max ← .157, by CLOP

huang arneson hayward muller pawlewicz MoHex 2.0: pattern-based MCTS

Page 59: MoHex 2.0: pattern-based MCTShayward/talks/hex.m2icga.pdf · hex knowledge mohex mohex 2.0 thank you Natural Sciences and Engineering Research Council of Canada huang arneson hayward

hexknowledge

mohexmohex 2.0

probabilistic simulations

huang arneson hayward muller pawlewicz MoHex 2.0: pattern-based MCTS

Page 60: MoHex 2.0: pattern-based MCTShayward/talks/hex.m2icga.pdf · hex knowledge mohex mohex 2.0 thank you Natural Sciences and Engineering Research Council of Canada huang arneson hayward

hexknowledge

mohexmohex 2.0

experiments

all openings

each player: 4 cores, 1.5Gb, 1-3-5 min/game

3000 13×13 games, each player 3-min/gameM-W (.587±.008) M2-W (.854±.006) 245 Elo

1000 games M2-M:

time/playerboard size 1 min 3 min 5 min

11×11 .811 ± .01013×13 .853 ± .006 .852 ± .006 .856 ± .010

huang arneson hayward muller pawlewicz MoHex 2.0: pattern-based MCTS

Page 61: MoHex 2.0: pattern-based MCTShayward/talks/hex.m2icga.pdf · hex knowledge mohex mohex 2.0 thank you Natural Sciences and Engineering Research Council of Canada huang arneson hayward

hexknowledge

mohexmohex 2.0

failures

hand-crafted patternssavebridge + breakbridge + ladderwin rate .6/10K .5/100K

degrade RAVE by distance to last move

move criticality

. . .

huang arneson hayward muller pawlewicz MoHex 2.0: pattern-based MCTS

Page 62: MoHex 2.0: pattern-based MCTShayward/talks/hex.m2icga.pdf · hex knowledge mohex mohex 2.0 thank you Natural Sciences and Engineering Research Council of Canada huang arneson hayward

hexknowledge

mohexmohex 2.0

future worka

a

b

b

c

c

d

d

e

e

f

f

g

g

h

h

i

i

j

j

k

k

1 1

2 2

3 3

4 4

5 5

6 6

7 7

8 8

9 9

10 10

11 11

1S

3

4

5

6

7

8

9

10

11

12

13

14

1516

17

18

1920

2122

23

24

25 26

W:Panoramex B:MoHex (2011 Olympiad)

huang arneson hayward muller pawlewicz MoHex 2.0: pattern-based MCTS

Page 63: MoHex 2.0: pattern-based MCTShayward/talks/hex.m2icga.pdf · hex knowledge mohex mohex 2.0 thank you Natural Sciences and Engineering Research Council of Canada huang arneson hayward

hexknowledge

mohexmohex 2.0

thank you

Natural Sciences and Engineering Research Council of Canada

huang arneson hayward muller pawlewicz MoHex 2.0: pattern-based MCTS


Recommended