Mapping Multiple Multivariate Gaussian Random...

transcript

Mapping Multiple Multivariate

Gaussian Random Number Generators

on an FPGA

Chalermpol Saiprasert, Christos-S. Bouganis and George A. Constantinides

Outline

• Monte Carlo Simulation

• Multivariate Gaussian Random Number Generator (MVGRNG)

• Objective

• Optimization algorithm

• Proposed framework – Hardware architecture

• Experimental Results

• Conclusions

Introduction

• Monte Carlo simulation

» Mathematical technique

» Repeated random sampling

» Evaluate non-deterministic processes

• Pre-requisite for MC simulation random numbers

• Multivariate Gaussian distribution to capture many correlated

variables

• Acceleration of MC using FPGA

» Speed up simulations

» Optimization of MVGRNG

Objective

• Existing approaches only focus on single distribution MVGRNG

• Mapping of multiple multivariate Gaussian distributions

• Example: Optimization of many financial portfolios

» Represented by many multivariate Gaussian distributions

• MVRNG usually part of larger application

» Resource usage CRUCIAL

• Efficient resource sharing

Generate multivariate Gaussian Random Numbers

• Mean (m) and Covariance matrix (Σ)

• OBJECTIVE : APPROXIMATE Σ

• Eigenvalue Decomposition using SVD

• Using any levels of decomposition K

U1/ 21/ 2UT

x U1/ 2z m, z ~ N(0,I)

c izi i1

Proposed Algorithm

Approximation

Optimization

Approximation

Error Calculation

Calculate Overall

Approximation Error

Approximation

Optimization

Approximation

Error Calculation

Approximation

Optimization

Approximation

Error Calculation

Calculate Remainder of Target

Matrices

Check for Termination

Constraint

Termination constraint

Input Matrices

Vector Coefficients c

Σ1 Σ2 Σm

Approximate Σ for each

distribution

Target redundancies between

ALL input distribution

Exploit similarities in

PRECISION REQUIREMENTS

Select appropriate precision to minimize approximation error for all distributions

Distinct coefficients for each

distribution

Algorithm takes any M

number of distributions

Error Estimation Model

• Mean square error

• Approximation error for each distribution

NError

Approximated Matrix

Actual Matrix

Matrix Order

1 / zz

2 / zz

3 / zz

Hardware Architecture

Constructed from K number of CBs K = no of decomposition levels

Mixed precisions in datapath

LUTs based

Precision in adder path = max(All CBs)

Hardware Architecture

Two multivariate Gaussian distributions (3x3

correlation matrices)

Using 3 levels of decomposition (K=3)

GRNG with different seeds for each input

distribution – completely independent

x1 produced after K clock cycles

x2 produced after 2K clock cycles

1 / zz

2 / zz

3 / zz

Experiment I

Accuracy of Error and Resource

Estimation Model

Accuracy of the Error Estimation Model

Estimated Approximation Error of Correlation Matrices

Accuracy of the Resource Estimation Model

0 500 1000 1500 2000 2500 3000 35000

Estimated Resource Utilization (LUTs)

Experiment II

Comparison with Existing

Approaches

Experimental Setup

• Approaches under consideration

» [Thomas and Luk 2008]

» Our previous work [Saiprasert et al 2009]

• Adjust throughput of existing approaches to be the same level

» Fair comparison

• Force M consecutive levels to use same CB for [Saiprasert et al

» M = number of input distributions

Comparison of All Approaches

[Thomas08] [Saiprasert09] This work

Architecture DSP LUTs LUTs

Precision Fixed Mixed Mixed

Optimization across

all input distributions

No No Yes

Reuse same

hardware for all

input matrices

Force M consecutive

decomposition levels

to share same

hardware

Optimized

precisions and

coeff for all input

distributions

Experimental Setup

• 4 sets of input correlation matrices

» Set I: Four 2x2 matrices

» Set II: Four 4x4 matrices

» Set III: Four 6x6 matrices

» Set IV: Two 2x2 and two 4x4 matrices

• One MVGRNG optimised for each set

• 100,000 vectors obtained for each set

Set I Matrices (2x2)

200 400 600 800 1000 1200 140010

Resource Utilization (LUTs)

Proposed Approach

Extension of our previous work

[Thomas and Luk 08]

18bit GRNG

Floating Point GRNG

18 bit upstream, double precision hardware

Double precision upstream, double precision hardware

18 bit upstream, 18bit hardware

18 bit upstream, mixed precision hardware

Set II Matrices (4x4)

0 500 1000 1500 2000 2500 300010

Proposed Approach

[Thomas and Luk 08]

18bit GRNG

Floating Point GRNG

Set III Matrices (6x6)

0 500 1000 1500 2000 2500 3000 3500 400010

Proposed Approach

[Thomas and Luk 08]

18bit GRNG

Floating Point GRNG

Set IV Matrices (Mixed Matrix Orders)

0 500 1000 1500 2000 250010

Proposed Approach

[Thomas and Luk 08]

18bit GRNG

Floating Point GRNG

Conclusions

• Innovative approach for multiple distributions MVGRNG

• One generator optimized for all input distributions

• Effective resource sharing algorithm

• Exploits similarities in precision requirements

• Up to 50% reduction in resource usage

• Without any penalty on the quality of the generated data

THANK YOU FOR YOUR ATTENTION

Mapping Multiple Multivariate Gaussian Random...

Documents