CS 5243: Algorithms Dynamic Programming Dynamic Programming is applicable when sub-problems are...

transcript

CS 5243: Algorithms

Dynamic Programming

Dynamic Programming is applicable when sub-problems are dependent!

In the case of Divide and Conquer they are independent.

Often used in optimization problems

Fib(5)

Fib(4) Fib(3)

Fib(3) Fib(2) Fib(2) Fib(1)

Development of a DP algorithm

1. Characterize the structure of an optimal solution

2. Recursively define the value of an optimal solution

3. Computer the value of an optimal solution

4. Construct an optimal solution from computed information

Note: Dynamic programming works bottom up while a techniquecalled Memoization is very similar but is a recursive top downmethod.

Memoization

Memoization is one way to deal with overlapping subproblems

After computing the solution to a subproblem, store in a tableSubsequent calls just do a table lookup

Can modify recursive alg to use memoziation:

Lets do this for the Fibonacci sequence

Fib(n) = Fib( n-1) + Fib( n-2)

Fib(2) = Fib(1) = 1

Fibonacci Memoization

Lets write a recursive algorithm to calculate Fib(n) and create an array to store subprogram results in order that they will not need to be recalculated. In memoization we recurse down and fill the array bottom up.

int fibArray[]={0,1,1,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0}

int Fib(int n){

if (fibArray[n]>0)return( fibArray[n])

else return (Fib(n-1)+Fib(n-2))

Fibonacci Tree Traversal

Fib(5)

Fib(4) Fib(3)

Fib(3)

Fib(2)

Fib(1)Fib(k) Is known when

requested!!

Fib(1)

Fib(6)

Fib(4)

The DP solution to Fibonacci

int Fib(int n)

{ // Note that this is not recursive

int fibArray[]={0,1,1,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,,0};

for (i=3, i <= n; i++)

fibArray[i] = fibArray[i-1]+fibArray[i-2];

return fibArray[n];

Next Problem:Matrix Multiplication

Matrix-Multiply(A, B)

1. if columns[A] != rows[B]

2. then error “ incompatible dimensions”

3. else for i = 1 to rows[A]

4. do for j = 1 to columns[B]

5. do C[i, j] = 0

6. for k=1 to columns[A]

7. do C[i, j] = C[i, j]+A[i, k]*B[k, j]

8. return C

Matrix-chain Multiplication problem

Problem: Given a chain <A1, A2, A3, . . ., An> of n matrices, where for i= 1,2,...,n, matrix Ai has dimension p i-1 x p i, fully parenthesize the product A1A2... An in a way that minimizes the number of scalar multiplications.

Example: Suppose we have the following size arrays

10 x 20 * 20 x 40 * 40 x 5

then ((10 x 20 20 x 40) 40 x 5) yields 10*20*40+10*40*5=10,000 mults

then (10 x 20 (20 x 40 40 x 5)) yields 20*40*5+10*20*5=5,000 mults

Clearly parenthesizing makes a big difference!

How many parens are there?

Exhaustively checking all possible parenthesizations is inefficient!

Let P(n) denote the number of alternative parenthesizations of a sequence of n matrices. Hence P(1)=1 otherwise if n>2 we have

11)( 1

nknPkP

The complexity of P(n) in this case is (2n)

Optimal Parenthesization

Major Observation: An optimal solution to an instance of the matrix chain multiplication problem contains within it optimal solutions to subproblems.

Suppose that an optimal parenthesization of Ai...Aj splits the product between Ak and Ak+1. Then the parenthesization of the prefix subchain Ai . . Ak must also be optimal! The same is true for the postfix subchain. Why is this true?

(A1, A2, A3, . Ak)(A k+1. ., An)

Optimal Optimal

Cost(A1.. An) = Cost(A1.. Ak) + Cost(A k+1.. An) + Cost of final product

Intermediate Storage in Table

As usual in DP we will store intermediate results in a table say in this case M. Let M[i,j] be the minimum # of scalar multiplications needed to computer A i..j

If the optimal soln splits A i..j at k then

M[i,j] = M[i,k] + M[k,j] + pi-1pkpj where Ai has dim pi-1pj

Since we really do not know the value of k we need to check the j-i possible values of k. Hence

jipppjkMkiM

jkijki

1],1[],[min

Finding the Optimal Solution

jipppjkMkiM

jkijki

1],1[],[min

Each time we find the above minimum k lets store it in an array s[i,j]. Hence s[i,j] is the value of k at which we can split the product A1Ai+1... Aj to obtain an optimal parenthesization.

The Algorithm

Matrix-Chain-Order(p)

n length[p]-1

for i 1 to n

do m[i,j] 0

for p 2 to n // p is the chain length

do for i 1 to n - p + 1

do j i + p - 1

m[i,k]

for k i to j - 1

do q m[i,k] + m[k+1,j] + pi-1pkpj

if q < m[i,j]

then m[i,j] q; s[i,j] k

return m and s

The m and s tables for n=6

3 3 5 5

15125 10500 5375 3500 5000 0

11875 7125 2500 1000 0

9375 4375 750 0

7875 2625 0

15750 0

matrix dimension

A1 30x35

A2 35x15

A3 15x5

A4 5x10

A5 10x20

A6 20x20

1 2 3 4 5 61 2 3 4 5

m[2,2] + m[3,5] + p1p2p5m[2,5]= min m[2,3] + m[4,5] + p1p3p5 m[2,4] + m[5,5] + p1p4p5

Observations

15125 10500 5375 3500 5000 0

11875 7125 2500 1000 0

9375 4375 750 0

7875 2625 0

15750 0

1 2 3 4 5 6

Note that we build this array from bottom up. Once a cell is filled we never recalculate it.

Complexity is (n3) Why?

What is the space complexity?

Printing the Parenthezation

PrintOptimalParens(s, i, j)

if i = j

then print “A”;

else print “(“

PrintOptimalParens(s, i, s[i, j])

PrintOptimalParens(s, s[i,j]+1, j)

print “)”

The call

PrintOptimalParens(s, 1, 6)

prints the solution

((A1( A2, A3))((A4A5) A6))

Optimal Binary Search Trees

The next problem is a another optimization problem.

You are required to find a BST that optimizes accesses to the tree given probability of access for each item in the tree. Here are examples assume equal probabilities of 1/7.

whiledo1

2 2 2 2

cost=15/7 cost=13/7

More Examples

whiledo

Assuming that probabilities of access are p(do)=.5, p(if)=.1, p(while)=.05 and q(0)=.15, q(1)=.1, q(2)=.05, q(3)= .05

q(0) : do : q(1) : if : q(2) : while : q(3)

Cost=2.65Cost=2.05

Cost=1.9

Cost(third tree)= 1(.1)+2(.5)+2(.05)+2(.15)+2(.1)+2(.05)+2(.05) = 1.9

A DP Solution for the Optimal BST

Let the set of nodes in a tree be {a1<a2<a3< • • • < an} and define Ei, 0 i n. The class Ei contains all identifiers x such that ai x ai+1. Therefore the expected cost of a tree is

i EleveliqalevelipCost01

)1)(()()()(

e1 e2 e3

e022 2 2

3=level(e3)-13

Optimal Subproblems!

OptimalRight tree

OptimalLeft tree

This problem exhibits optimal substructure. For if the above tree isoptimal then the left and right subtrees most be optimal!

Optimal Substructure often implies a Dynamic Programming solutionThe following algorithm is based on this observation.

Optimal Subtree Calculation

i EleveliqaleveliptreeleftCost01

)1)(()()()()_(

i EleveliqaleveliptreerightCost01

)1)(()()()()_(

OptimalRight tree

OptimalLeft tree

Note: The level of the root of each subtree is 1 in each case

Optimal Subtree

OptimalRight tree

OptimalLeft tree

The cost of the above entire tree in terms of the optimal subtree is p(k) + cost(left_tree) + w(0,k-1) +cost(right_tree) + + w(k,n)

where ))()((()(),(

rprqiqjiw

W(0,k-1) is a fix for the left tree

left subtree

Assuming p(i) is probability of ai the cost of left subtree by itself is

1*p(1)+2*p(2)+ 1*q(0)+2*q(1)+2*q(2)

and w(0,2) in this case is

p(1) + p(2) + q(0) + q(1) + q(2)

Can you see what w(0,2) does in this case?

It adds 1 probe count to each term on the left side to accountfor the fact that the subtree is one node deeper!

Minimizing over the roots

Recall that the cost of the entire tree, with ak the root, is

p(k) + cost(left_tree) + w(0,k-1) +cost(right_tree) + + w(k,n)

All we need to do then is to find the value of k so that the above is minimum. So we look at each tree with ak the root.

Minimization formula

If we use c(i,j) to represent the cost of an optimal binary search tree, tij, containing ai+1,...,aj and Ei,...,Ej, then for the tree to be optimal k must be optimally chosen so that

is minimum. So

p(k) + c(left_tree) + w(0,k-1) +c(right_tree) + + w(k,n)

),()1,0()(),()1,0(min),0(1

nkwkwkpnkckcncnk

),()1,()(),()1,(min),( jkwkiwkpjkckicjicjki

),(),()1,(min),( jiwjkckicjicjki

in general

simplifies to

The Process

),(),()1,(min),( jiwjkckicjicjki

The above equation can be solve sort of bottom up by so we compute all c(i,j) where j=i+1, then those that have j=i+2 etc.

While we do this we record the root r(i,j) of each optimal tree. This allows us to then reconstruct the complete optimal tree.

An Example

Let n=4 and {a1,a2,a3,a4} be the set of nodes in a tree. Also let p(1:4)=(3,3,1,1) and q(0:4)=(2,3,1,1,1) . Here we multiply the p’s and the q’s by 16 for simplicity. So w(i,i)=q(i)=c(i,i)=0

w(0,1) = p(1) + q(1) + w(0,0) = 8

c(0,1) = w(0,1) + min{c(0,0+c(1,1)} = 8

r(0,1) = 1

w(1,2) = p(2) + q(2) + w(1,1) = 7

c(1,2) = w(1,2) + min{c(1,1)+c(2,2)} = 7

r(1,2) = 2

w(2,3) = p(3) + q(3) + w(2,2) = 3

c(2,3) = w(2,3) + min{c(2,2)+c(3,3)} = 3

r(2,3) = 3

w(3,4) = p(4) + q(4) + w(3,3) = 3

c(3,4) = w(3,4) + min{c(3,3)+c(4,4)} = 3

r(3,4) = 4

Computation Table

w02=12

c02=19

c13=12

w04=16

c04=32

w03=14

c03=25

w14=11

c14=19

a4t04 = t01 : t24 (root a2)t024 = t22 : t34(root a3)Recall that tij={ai+1,...,aj}

Complexity of OBST Algorithm

We computer c(i,j) for j-i = 1,2,3,...,n in that order.

When j-i=m there are n-m+1 c(i,j)’s to computer each of which finds the minimum of m quantities. Hence we do m(n-m+1) or mn-m2+m operations on the mth row. So the total time is

)()()()( 322

2 nOnOmnmmmnmmnm

D.E. Knuth has shown that if you limit the search to therange r(i,j-1) k r(i+1,j) the computing time becomes O(n2)

CS 5243: Algorithms Dynamic Programming Dynamic Programming is applicable when sub-problems are...

Documents