is hereby granted to
to certify that he/she has completed to satisfaction
The CCDH Exam
Cloudera, Inc. 210 Portage Avenue Palo Alto, CA 94306 www.cloudera.com
___________________________ Date Granted
Test Date:
___________________________ Authorized Signature
Mansour Raad
March 2, 2012
Mar 09, 2012
is hereby granted to
to certify that he/she has completed to satisfaction
The CCDH Exam
Cloudera, Inc. 210 Portage Avenue Palo Alto, CA 94306 www.cloudera.com
___________________________ Date Granted
Test Date:
__ __________________________Authorized Signature
March 2, 2012
Mar 09, 2012
Cooperative Processing
g.beginGradientFill(GradientType.RADIAL,[ 0xFF0000, 0x0000FF ], ...);g.drawRect(x, y, 200, 200);g.endFill();bitmapData.draw(shape, null, null, BlendMode.SCREEN, null, true);
Great Story Telling Tool !
Data Democratizer!Beyond Dashboard!Can have best ML, best model, best team, all useless if u cannot tell a story of results!
Volume
• Very Large Amount
• More Parameters
• Multi Node
• Storage
• Processing -Simple math is more effective with large parameters-Scalable storage-Program to data rather data to program
Velocity
• Rate of digital flow
• Streaming
• Event Processing
• Feedback Loop
• Recommendations - Clicks, locations- Mobile / Smartphones- Last 5 min snapshot of traffic is no good when crossing the street- CERN
HDFS
• Multi-TB Storage
• Inexpensive Nodes
• Fault Tolerant
• Concurrent Reading
• Brings Programs To Data
MapReduce
• Software Framework
• Parallel Processing
• Jobs Executed on HDFS
• Java / Python / C++
• Spatial Libraries
MapReduce Job
input | map | sort | reduce | output
Java Jars packaged and sent to data nodes for execution
Spatial Storage
• CSV,TSV Lat,Lon
• Esri JSON format
• {geometry:{x:-123,y:45},attributes:{}}
• Custom
The “Zoo”
• Pig - high level language for hadoop
• HBase - real/time random access to hdfs
• Flume - streaming data flow
• Mahout - machine learning
• Zookeeper - distributed state management
Processing Evolution
• Transactional - Batch
• Operational - Dashboard
• Analytical - Exploratory
• Intelligent - Real/Time, predictive
Fixed Schema
Variable Schema
“[T]here are known knowns; there are things we know that we know.There are known unknowns; that is to say there are things that, we now know we don't know.But there are also unknown unknowns – there are things we do not know we don't know.”
—United States Secretary of Defense, Donald Rumsfeld
Date Event Location
March 21, 2013Esri DC Meet Up – Big Data & Location Analytics Washington, DC
April 18, 2013 Esri DC Meet Up Washington, DC
March 23–26, 2013 Esri Partner Conference Palm Springs, CA
March 25–28, 2013 Esri Developer Summit Palm Springs, CA
July 6–9, 2013 Esri National Security Summit San Diego, CA
July 8–12, 2013 Esri International User Conference San Diego, CA
Upcoming Events