+ All Categories
Home > Documents > Gaussian Elimination

Gaussian Elimination

Date post: 12-Feb-2016
Category:
Upload: cruz
View: 36 times
Download: 0 times
Share this document with a friend
Description:
Gaussian Elimination. By Yequn Zhang, Yu Zhang. Contents. Introduction Problem Analysis Proposed Algorithm Evaluation. Contents. Introduction Problem Analysis Proposed Algorithm Evaluation. Gaus sian Elimination. Forward Elimination Back Substitution. Contents. Introduction - PowerPoint PPT Presentation
Popular Tags:
27
Gaussian Elimination By Yequn Zhang, Yu Zhang
Transcript
Page 1: Gaussian Elimination

Gaussian EliminationBy

Yequn Zhang, Yu Zhang

Page 2: Gaussian Elimination

ContentsIntroductionProblem AnalysisProposed AlgorithmEvaluation

Page 3: Gaussian Elimination

ContentsIntroductionProblem AnalysisProposed AlgorithmEvaluation

Page 4: Gaussian Elimination

Gaussian EliminationForward EliminationBack Substitution

Page 5: Gaussian Elimination

ContentsIntroductionProblem AnalysisProposed AlgorithmEvaluation

Page 6: Gaussian Elimination

Problem AnalysisData size used by kernels changes continuouslyDifficult to find an appropriate block size to avoid divergenceBlock-based approach

Assign a certain part of computation running on CPU-leave the irregularity to cpu

Manually make the data size changes with a step of block sizeBlock number per grid is easy to set

Page 7: Gaussian Elimination

ContentsIntroductionProblem AnalysisProposed AlgorithmEvaluation

Page 8: Gaussian Elimination

Forward EliminationA block-based approachTry to avoid divergenceTry to use GPUTry to be fine-grained

Page 9: Gaussian Elimination

K 1

Find Max Row

Page 10: Gaussian Elimination

Swapcpu

Now start toeliminate the block of data on cpu

Page 11: Gaussian Elimination

Calculatecoefficients

Page 12: Gaussian Elimination

Eliminationon CPU

Page 13: Gaussian Elimination

K 1

Calculate Coefficients

Page 14: Gaussian Elimination

K2K 2

Eliminationon CPU

Page 15: Gaussian Elimination

Swap on GPU

K3

K 3

Page 16: Gaussian Elimination

K4Elimination on GPU

K 4

Page 17: Gaussian Elimination

K5Eliminationon GPU

K 5

Page 18: Gaussian Elimination

Intra-block loop

Page 19: Gaussian Elimination

Inter-block loop

Page 20: Gaussian Elimination

Last inter-block loopprocessedon CPU

Page 21: Gaussian Elimination

Back SubstitutionLaunch kernel when number of coefficients per row

exceeds four block size (64*4=256)A fine-grained way, use a similar way as forward

elimination, part on CPU and part on GPU

Page 22: Gaussian Elimination

ContentsIntroductionProblem AnalysisProposed AlgorithmEvaluation

Page 23: Gaussian Elimination

Block size effect

Page 24: Gaussian Elimination

The contribution of swap and find max rowIs it necessary to implement every part on GPU?

Page 25: Gaussian Elimination

Performance breakdownContribution of each part to the total performance,

including kernels as well as CPU part

Page 26: Gaussian Elimination

Speedup

Page 27: Gaussian Elimination

Questions ?


Recommended