Copyr i g ht © 2014, SAS Ins t i tu t e Inc . A l l r ights reser ve d .
PREDICTIVE ANALYTICS AND BIG DATA
Rachel Hawley
Solutions Architect
Copyr i g ht © 2014, SAS Ins t i tu t e Inc . A l l r ights reser ve d .
ANALYTICS SAS® BUSINESS ANALYTICS FRAMEWORK
Copyr i g ht © 2014, SAS Ins t i tu t e Inc . A l l r ights reser ve d .
SAS AND YARN
http://blogs.sas.com/content/datamanagement/2014/08/20/sas-high-performance-capabilities-with-hadoop-yarn/
“With this milestone, we warmly welcome SAS LASR to a growing community of YARN Ready applications.” – Arun Murthy
Copyr i g ht © 2014, SAS Ins t i tu t e Inc . A l l r ights reser ve d .
TWO STARTING
POINTS
NOT MUTUALLY EXCLUSIVE… BUT OFTEN NOT SEEN TOGETHER!
Hadoop as a Data Platform(standalone or as part of a broader ecosystem)
Hadoop as a core component of the next
generation of BI and Analytics
.. to support innovative business usage.. to support an IT Transformation
Copyr i g ht © 2014, SAS Ins t i tu t e Inc . A l l r ights reser ve d .
SAS & HADOOP HOW?
SAS & Hadoop intersect in many ways:
SAS can treat Hadoop just as any other data source, pulling data
FROM Hadoop, when it is most convenient;
SAS can work WITH Hadoop, lifting data in a purpose-built
advanced analytics in-memory environment;
SAS can work directly IN Hadoop, leveraging the distributed
processing capabilities of Hadoop.
Copyr i g ht © 2014, SAS Ins t i tu t e Inc . A l l r ights reser ve d .
SAS AND HADOOP HOW DOES IT WORK?
Data Store
SAS
Traditional SAS
THESE APPROACHES ARE COMPLEMENTARY & CAN BE COMBINED FOR MAXIMUM EFFECT
Data Store
SAS
Data
In-Memory
MemoryData
Data Store
SAS
Data
In-Database
Copyr i g ht © 2014, SAS Ins t i tu t e Inc . A l l r ights reser ve d .
FROM + WITH + IN HADOOP IS NOT AN OR, BUT AN AND
Prepare data IN
Hadoop for
analytics
Move it FROM Hadoop
into a SAS server
Deploy and manage
model score code
IN Hadoop
HPA temporarily lifts
data IN to memory for
analytics at scale
Model data at scale in-
memory WITH visual
statistics and in-memory
statistics
TIP
Use the right
technique for
what needs to
be done!
Explore data at scale, in-
memory WITH visual
analytics