+ All Categories
Home > Engineering > Map Reduce in Hazelcast - Hazelcast User Group London Version

Map Reduce in Hazelcast - Hazelcast User Group London Version

Date post: 19-Aug-2015
Category:
Upload: christoph-engelbert
View: 1,077 times
Download: 1 times
Share this document with a friend
Popular Tags:
30
BIG DATA - FAST DATA USING MAPREDUCE IN HAZELCAST Source: www.hazelcast.com
Transcript
Page 1: Map Reduce in Hazelcast - Hazelcast User Group London Version

BIG DATA - FAST DATAUSING MAPREDUCE IN HAZELCAST

Source:

www.hazelcast.com

Page 2: Map Reduce in Hazelcast - Hazelcast User Group London Version

Christoph Engelbert (@noctarius2k)8+ years of Java WeirdonessPerformance, GC, traffic topicsApache CommitterGaming, Travel Management, ...CastMapR MapReduce for Hazelcast 3

www.hazelcast.com

Page 3: Map Reduce in Hazelcast - Hazelcast User Group London Version

TOPICSHazelcastDistributed ComputingMap & ReduceDemonstrationQuestions

www.hazelcast.com

Page 4: Map Reduce in Hazelcast - Hazelcast User Group London Version

HAZELCASTPICKIN' DIAMONDS

www.hazelcast.com

Page 5: Map Reduce in Hazelcast - Hazelcast User Group London Version

WHAT IS HAZELCAST?In-Memory Data-GridData Partioning (Sharding)Java Collections ImplementationDistributed Computing Platform

www.hazelcast.com

Page 6: Map Reduce in Hazelcast - Hazelcast User Group London Version

WHY HAZELCAST?Automatic PartitioningFault ToleranceSync / Async BackupsFully DistributedIn-Memory for Highest Speed

www.hazelcast.com

Page 7: Map Reduce in Hazelcast - Hazelcast User Group London Version

WHY HAZELCAST?

www.hazelcast.com

Page 8: Map Reduce in Hazelcast - Hazelcast User Group London Version

WHY DISTRIBUTED COMPUTING?

www.hazelcast.com

Page 9: Map Reduce in Hazelcast - Hazelcast User Group London Version

WHY IN-MEMORYCOMPUTING?

www.hazelcast.com

Page 10: Map Reduce in Hazelcast - Hazelcast User Group London Version

TREND OF PRICES

Data Source:

www.hazelcast.com

Page 11: Map Reduce in Hazelcast - Hazelcast User Group London Version

SPEED DIFFERENCE

Data Source:

www.hazelcast.com

Page 12: Map Reduce in Hazelcast - Hazelcast User Group London Version

DISTRIBUTEDCOMPUTING

OR

MULTICORE CPU ON STEROIDS

www.hazelcast.com

Page 13: Map Reduce in Hazelcast - Hazelcast User Group London Version

THE IDEA OF DISTRIBUTED COMPUTING

Source:

www.hazelcast.com

Page 14: Map Reduce in Hazelcast - Hazelcast User Group London Version

THE BEGINNING

Source:

www.hazelcast.com

Page 15: Map Reduce in Hazelcast - Hazelcast User Group London Version

MULTICORE IS NOT NEW

Source:

www.hazelcast.com

Page 16: Map Reduce in Hazelcast - Hazelcast User Group London Version

CLUSTER IT

Source:

www.hazelcast.com

Page 17: Map Reduce in Hazelcast - Hazelcast User Group London Version

SUPER COMPUTER

Source:

www.hazelcast.com

Page 18: Map Reduce in Hazelcast - Hazelcast User Group London Version

CLOUD COMPUTING

Source:

www.hazelcast.com

Page 19: Map Reduce in Hazelcast - Hazelcast User Group London Version

MAP & REDUCETHE BLACK MAGIC FROM PLANET GOOGLE

www.hazelcast.com

Page 20: Map Reduce in Hazelcast - Hazelcast User Group London Version

USE CASESLog AnalysisData QueryingAggregationDistributed SortETL (Extract Transform Load)and more...

www.hazelcast.com

Page 21: Map Reduce in Hazelcast - Hazelcast User Group London Version

BASIC STEPSReadMap / TransformReduce

www.hazelcast.com

Page 22: Map Reduce in Hazelcast - Hazelcast User Group London Version

FULL STEPSReadMap / TransformCombineGroup / ShuffleReduceCollate

www.hazelcast.com

Page 23: Map Reduce in Hazelcast - Hazelcast User Group London Version

MAPREDUCE WORKFLOW

www.hazelcast.com

Page 24: Map Reduce in Hazelcast - Hazelcast User Group London Version

Data are mapped / transformed in a set of key-value pairs

SOME PSEUDO CODE (1/3)

MAPPING

map( key:String, document:String ):Void -> for each w:Word in document: emit( w, 1 )

www.hazelcast.com

Page 25: Map Reduce in Hazelcast - Hazelcast User Group London Version

Multiple values are combined to an intermediate result to preserve traffic

SOME PSEUDO CODE (2/3)

COMBINING

combine( word:Word, counts:List[Int] ):Void -> emit( word, sum( counts ) )

www.hazelcast.com

Page 26: Map Reduce in Hazelcast - Hazelcast User Group London Version

Values are reduced / aggregated to the requested result

SOME PSEUDO CODE (3/3)

REDUCING

reduce( word:String, counts:List[Int] ):Int -> return sum( counts )

www.hazelcast.com

Page 27: Map Reduce in Hazelcast - Hazelcast User Group London Version

FOR MATHEMATICIANSProcess: (K x V)* → (L x W)* ⇒ [(l1, w1), …, (lm, wm)]

Mapping: (K x V) → (L x W)* ⇒ (k, v) → [(l1, w1), …, (ln, wn)]

Reducing: L x W* → X* ⇒ (l, [w1, …, wn]) → [x1, …,xn]

www.hazelcast.com

Page 28: Map Reduce in Hazelcast - Hazelcast User Group London Version

MAPREDUCE PROGRAMS INGOOGLE SOURCE TREE

Source:

www.hazelcast.com

Page 29: Map Reduce in Hazelcast - Hazelcast User Group London Version

DEMONSTRATION

www.hazelcast.com

Page 30: Map Reduce in Hazelcast - Hazelcast User Group London Version

@noctarius2k@hazelcast

http://www.sourceprojects.comhttp://github.com/noctarius

THANK YOU!ANY QUESTIONS?

Images: All images are licensed under Creative Commons

www.hazelcast.com


Recommended