Post on 14-Feb-2022
transcript
Cisco Confidential 1 © 2010 Cisco and/or its affiliates. All rights reserved.
Big Data and Its Implications
Sen Chandaka and Jing Wang
Cisco, The Netherlands
2 © 2012 Cisco and/or its affiliates. All rights reserved.
“data sets whose size is beyond the ability of commonly
used software tools to capture, manage, and process the
data within a tolerable elapsed time.” – Prof. Lev
Manovich, University of California-San Diego
"Big Data are high-volume, high-velocity, and/or high-
variety information assets that require new forms of
processing to enable enhanced decision making, insight
discovery and process optimization.” - Gartner
1 Petabyte = 20,000,000
4 © 2012 Cisco and/or its affiliates. All rights reserved. From Flickr [some rights reserved by Akakumo]
5 © 2012 Cisco and/or its affiliates. All rights reserved.
Internet and Social media Business Transaction
Internet of Things / M2M Multimedia Content
6 © 2012 Cisco and/or its affiliates. All rights reserved.
50 Billion Connected Devices
1 Million Applications
1 Zettabyte of Content
6 Million Security Threats
7 © 2012 Cisco and/or its affiliates. All rights reserved.
Big Data – Overview
What is Big Data
Importance of Big data
The Hadoop Framework
Vendors in Big Data
Big Data Efforts at Cisco
Data volume is growing 40% per year and will increase 44 times between 2010 and 2020 [McKinsey Global Institute]
8 © 2012 Cisco and/or its affiliates. All rights reserved.
V o l u m e
V e l o c i t y
V a r i e t y
V a l u e
9 © 2012 Cisco and/or its affiliates. All rights reserved. “Big data: The next frontier for innovation, competition, and productivity”, McKinsey Global Institute, June 2011
12 © 2012 Cisco and/or its affiliates. All rights reserved.
Source: US Bureau of Labor Statistics; McKinsey Global Institute analysis
13 © 2012 Cisco and/or its affiliates. All rights reserved.
Integrated Data Warehouse ~80TB in Prod. Dual Active environment with CoD and online archive environment
Analytics Tier: DM’ss and environments
housing relevant data extracts, potential tiered
DW archive space
Common Logging, source data unload platform, and
repository for other unstructured and semi-structured data sources
Application Data Size
HANA MR
Server Vendors
Commodity
Server
Vendors
1TB 10TB 100TB 10PB
Co
mp
eti
tio
n
Server Vendors, Appliance Vendors:
Oracle Exadata, Teradata, Neteeza
Greenplum DB (DCA), Oracle Big Data
Appliance, HP Vertica
Web 2.0
VBlock
Open Source
High
performance BI
Appliance
VectorWise
In-Memory DW
Massive
Scale-out
<5TB
10-40TB
50-500TB
100TB-5PB Database
Cisco Switching
Server Vendors
14 © 2012 Cisco and/or its affiliates. All rights reserved.
14
Data Warehouse Modernization
• Over 4X more performance
• 50% more cores, 3X more threads
• New virtualization & security features
• Lower power
• *Third party brands and names may be claimed by others
• Source: Publicly posted scores on http://www.tpc.org/tpch/default.asp as of 5-May-2011. See backup for system configurations , availability & QphH
Software & workloads used in performance tests may have been optimized for performance only on Intel microprocessors. Performance tests, such as SYSmark and MobileMark, are measured using specific computer systems, components, software, operations & functions. Any
change to any of those factors may cause the results to vary. You should consult other information and performance tests to assist you in fully evaluating your contemplated purchases, including the performance of that product when combined with other products. For more
information go to http://www.intel.com/performance. See “Configurations” page in backup for configuration details.
2-Processors (Xeon® 5355)
HP ProLiant* ML370G5
2-Processors (Xeon® 5680)
HP ProLiant DL380G7
4.18
1.00
2007 Server 2011 Server
Data
Ware
house P
erf
orm
ance
TP
C-H
, Q
phH
, 100 G
B S
calin
g
(Hig
her
score
s a
re b
etter)
17,687 73,975
4-Processors (Xeon® E7-8837)
Dell PowerEdge* R910 w/ VectorWise* 1.6
2011 Server
Data
Ware
house P
erf
orm
ance
TP
C-H
, Q
phH
, 300 G
B S
calin
g
(Hig
her
score
s a
re b
etter)
400, 932
Data Warehouse Scale Up (From 2 to 4 Processors) • 4-processor servers support up to 2TB of
memory for top performance on larger data warehouses
15 © 2012 Cisco and/or its affiliates. All rights reserved.
• Open-source software framework that supports data-intensive distributed applications
• Apache project written in Java programming language
• Derived from Google’s MapReduce and Google File System (GFS)
• Includes a number of projects – MapReduce, Hbase, HDFS, Pig, Hive, etc
16 © 2012 Cisco and/or its affiliates. All rights reserved.
Database
NoSQL Database
\
Exclusive hardware reference
Joint GTM
Several joint engagements
UCS is exclusive hardware reference
Several joint engagements UCS is the only partner platform
Commercial, distributed key-value
database.
Commercial
Document-oriented
database
NetApp Open Solution for Hadoop (NOSH)
© 2010 Cisco and/or its affiliates. All rights reserved. Cisco Confidential 18
Private Cloud Automation
Research/ Academia
Experimental
OpenFlow/SDN
components for
production
networks
Massively Scalable Data Center
Customize with
Programmatic
APIs to provide
deep insight into
network traffic
Service Providers
Policy-based
control and
analytics to
optimize and
monetize
service delivery
Enterprise
Virtual
workloads, VDI,
Orchestration of
security profiles
Cloud
Automated
provisioning
and
programmable
overlay,
OpenStack
Diverse Programmability Requirements Across Segments (Automation & Programmability)
Scalable Multi-Tenancy
Network Flow Management
Network “Slicing”
Agile Service Delivery
© 2010 Cisco and/or its affiliates. All rights reserved. Cisco Confidential 19
Technical
Advisory Group
Chair,
Working Groups:
Config, Hybrid,
Extensibility,
Futures/FPMOD/
OF2.0
802.1 Overlay Networking
Projects, Cisco
Innovations:
FEX Architecture
Overlay Working Groups:
NVO3, L2VPN, TRILL, L3VPN, LISP,
PWE3
API Working Groups:
NETCONF, ALTO, CDNI, XMPP, SDNP,
I2AEX
Controller Working Groups:
PCE, FORCES
Open Source
Cloud
Computing
project
Open Network
Research Center at
Stanford University
Working Groups:
Quantum API
Donabe
Cisco Innovations:
OpenStack API for
Nexus
OpenStack Extensions
Note: Very little standardization in hypervisor technologies (e.g. live migration, config, APIs)
© 2010 Cisco and/or its affiliates. All rights reserved. Cisco Confidential 20
Resilient. Scalable. Secure.
Rich-features. Evolutionary
Investment Protection
Simpler. Fewer nodes to manage.
Topology View Combined Benefits
Control Plane
Data Plane
Control Plane
Data Plane
Control Plane
Data Plane
Control Plane
Current Model “SDN approach” Hybrid Model?
22 © 2012 Cisco and/or its affiliates. All rights reserved.
- “Big data: The next frontier for innovation, competition, and
productivity”, McKinsey Global Institute
- Google Flu Trends project
- Urban EcoMap
- Hadoop World NY
- Big Data in the Enterprise
- How Companies Learn Your Secrets
© 2010 Cisco and/or its affiliates. All rights reserved. Cisco Confidential 23
What’s on your future checklist?
© 2010 Cisco and/or its affiliates. All rights reserved. Cisco Confidential 24
Why Cisco?
Fortune 100 Best Companies to Work For
Best Employers in Europe, India, Canada
13th Most Valuable Brand in the World
Top 50 Employers—Careers and the DisABLED
Top 50 Places Women Want to Work in the UK
China’s Most Respected Companies
Worlds Most Ethical Companies
Best Places to Launch a Career
© 2010 Cisco and/or its affiliates. All rights reserved. Cisco Confidential 25
Phase 1
(Q1-Q2 FY12)
Industry-leading
accelerated career
development
program for
top university
graduates
worldwide who
aspire to become
the next generation
of sales leaders
at Cisco
World-Class
Sales Training Phase 1
(Q1-Q2 FY12)
Business, sales,
and technical
curriculum delivered
virtually
in state-of-the-art
classrooms that
leverage Cisco
TelePresence,
Cisco WebEx, and
iPad technologies.
Virtual
Learning
(Months 1-3)
Phase 1
(Q1-Q2 FY12)
Associates move
into a sales or
engineering role
where they will get
on-the-job
experience
interacting with
customers and
partners and will be
mentored by Cisco
seasoned sales
and engineering
professionals.
On-the-Job
Experience
(Months 4-12) Phase 1
(Q1-Q2 FY12)
After successfully
completing the
program
Associates are
promoted into a
Virtual Account
Manager or Virtual
Systems Engineer
role
Join the
Cisco Sales
Organization
(Month 13)
© 2010 Cisco and/or its affiliates. All rights reserved. Cisco Confidential 26
San Jose, CA ( ASE)
Raleigh, NC
(ASR + ASE)
London (ASR + ASE)
Amsterdam (ASR+ASE)
Toronto (ASR + ASE)
Shanghai (ASR)
Frankfurt (ASR + ASE)
Singapore (ASR & ASE)
Beijing (ASE)
Bangalore (ASR + ASE)
Mexico City
(ASE)
Sao Paulo
(ASR+ASE)
Moscow (ASR)
Riyadh (ASR)
Training Hub Location
Mexico ASEs to train in RTP Year 1 and will return to home country in Year 2 (based on business need)
Johannesbu
rg
(ASR)
Dubai (ASR)
Moscow, Saudi Arabia, UAE and South Africa ASRs train in Amsterdam Year 1 and will return to home country in Year 2 (based
on business need)
© 2010 Cisco and/or its affiliates. All rights reserved. Cisco Confidential 27
www.cisco.com/go/universitycsap
© 2010 Cisco and/or its affiliates. All rights reserved. Cisco Confidential 28
http://www.facebook.com/CSAPINFO