Remote Office Data Protection with De-duplication for VMware
Breakout Session TA3089
Neela Jacques Tom Martin Mark JonesVMware EMC Corporate Express
September 2008
Disclaimer
This session may contain product features that are currently under development.
This session/overview of the new technology represents no commitment from VMware to deliver these features in any generally available product.
Features are subject to change, and must not be included in contracts, purchase orders, or sales agreements of any kind.contracts, purchase orders, or sales agreements of any kind.
Technical feasibility and market demand will affect final delivery.
Pricing and packaging for any new technologies or features discussed or presented have not been determined.
“These features are representative of feature areas under development. Feature commitments are subject to change, and must not be included in contracts, purchase orders, or sales agreements of any kind. Technical feasibility and market demand will affect final delivery.”
Today’s Agenda
� Remote Office: Neela Jacques, VMware
� Remote Office & Branch Office (ROBO) Challenges
� A Better Way: Empowering ROBO with Virtualization
� Optimizing Protection: Tom Martin, EMC Avamar
� Backup & Recovery with De-duplication
� Customer Implementation: Mark Jones, Corporate Express� Customer Implementation: Mark Jones, Corporate Express
� Virtualization and De-duplication For Tangible Benefits
� Q&A
Remote Office & Branch Offices (ROBO) Challenges
� ROBO’s face the same information infrastructure challenges as the data center� Data protection� Security and compliance� Storage consolidation
� Branch offices often lack data center best practices� Difficult to leverage data center investments at the
Complexity
As much as 75% of your corporate data may live at remote sites
� Difficult to leverage data center investments at the branch
� Information growth at remote sites has made costs difficult to control
� Challenges are even more complex due to network dependency� Network topology and connectivity impact remote
operations� Increasing WAN connections is a costly solution for
remote user performance
Availability
Efficiency
Typical ROBO Environment
� Too many servers (~ 5 to 10)� Limited space, power and cooling� Each server adds to maintenance costs
� Management complexity� Most sites lack server administrators� Outsourcing is an imperfect solution� Supporting legacy operating systems can be a key challenge
� Speed and ability to deploy new applications
WAN
RemoteOffice
RemoteOffice
RemoteOffice
� Speed and ability to deploy new applications� Redundant and often inconsistent hardware and software� Adding applications requires additional hardware and space� New hardware needs to be provisioned at HQ or requires HQ
support
� Data protection� High data redundancy that can degrade application performance,
result in inefficient storage utilization� Host failure recovery is tedious and time consuming� Data security and compliance is at risk with tape backups
HQ Data Center
Centralized
Manageability
Business
Simplify IT Management
� Standardize on a single virtual server platform
� Remotely and centrally manage from the Data Center
� Accelerate provisioning
Ensure Availability and Recoverability
Empowering ROBO with Virtualization
Improve management and protection of your remote offices
Business
Continuity
Cost Savings
� Guarantee uptime
� Automate backup and restore
Reduce Hardware and Operational Costs
� Minimize costly setup and support trips
� Reduce operational costs
� Deploy new applications without adding any Hardware
Centralized VirtualizationEnable remote offices to access virtual machines on centralized server hardware in the data center
�Maximizes server consolidation ratios, cost savings
Two Approaches for Virtualizing ROBO
Distributed VirtualizationVirtualize servers and desktops at the remote office and remotely manage them from the data center
�Maintains application performance regardless of WAN speed and availability
Centralized Server & Desktop Deployments
Centralized Deployments
> Remote/branch sites access virtual machines on centralized server hardware in the data centercenter
> Maximize server consolidation and cost savings
> Reduced management & support problems
Consideration �Is your centralized backup
infrastructure prepared to protect all of this new data?
Distributed Server & Desktop Deployments
Distributed Deployments
> Virtual servers and desktops reside at the remote/branch site.
> Maintain application performance regardless of WAN regardless of WAN speed or availability
> Centralized and remote management
> VDI
Consideration �Is your distributed backup
infrastructure prepared to efficiently protect this remote data?
AVAMAR: Backup & Recovery with Source-based Data De-duplication
� Global, source-based data de-duplication defuses the explosion of backup data
� Radically reduces the media, network, and time required for backup
� Alters the fundamental economics of disk versus tape
DE-DUPLICATION AT THE
SOURCE
10
disk versus tape
� Accelerates shift to disk as primary medium for backup
� An alternative to archaic IT processes (shipping tapes to disaster recovery)
� Automated, encrypted remote copy over existing WANs
THE TRADITIONAL BACKUP PROBLEM �
Backup Server
Disk / Tape
� All data must pass through the shared resources
� Traditional backup is very inefficient – does nothing to eliminate redundant data.
� De-duplication at the “source” solves both centralized and distributed remote office backup issues
How Source-based De-duplication WorksGlobal data de-duplication instances
� Client agent at the source scans the data for changes
� Modified files broken into sub-file variable length segments
� Only unique sub-file segments are backed up
Avamar Data Store or Avamar Server
Queries local file cache to identify and disregard static files
Compresses and applies a hash to each object
Queries local hash cache, and then the Avamar server to determine if objects are unique
Avamar Real-World Results
Data Type
Amount of
Primary Data
Backed Up
Amount of
Data Moved
Daily
Daily
De-
duplication
Ratio
Windows file systems 3,573 GB 6.1 GB 586:1
Mix of Windows, Linux, and UNIX file systems
5,097 GB 11.7 GB 436:1
Engineering files on NAS (NDMP backups)
3,265 GB 24.2 GB 135:1
Mix of 20 percent databases, 80 percentfile systems (Windows and UNIX)
9,583 GB 80.0 GB 120:1
Mix of Linux file systems and databases 7,831 GB 104.2 GB 75:1
Sourced-base De-duplication vs. Traditional Backup
Avamar for VMware Guest OS backup
Network UsageCPU Usage
Traditional
Avamar
14
Disk Usage
VMware Centralized Integration for ROBOAvamar Agent de-duplicates data across all end-user file shares in the data center before moving it to the Avamar Server.
Avamar
Agent
Avamar Data Store
or Server
Exchange File and Print DNSBackup
Server
Tape
Traditional Data Protection
VMware Distributed Integration for ROBO
Application
Operating System
Application
Operating System
Application
Operating System
Backup & recovery and DR deployed as a virtual appliance at each remote office
Avamar VM
Operating System
Avamar Virtual Edition
XX Library
Tape
Media
Off-site
Tape
Storage
Hardware
ESX Server
DiskNICMemoryCPU
XXX
Illustration: VMware Distributed Integration for ROBO
Avamar Virtual
Edition
Avamar Virtual
Edition
RemoteOffice
RemoteOfficeAX4 AX4
AVE allows for local
backups $
Built-in replication allows
for DR $ no more tapes
at the remote sites!
Flexible Deployment Options
Remote Offices without VMwareRemote Offices with VMware
WAN(ENCRYPTED) (ENCRYPTED)
(ENCRYPTED)
Hardware
ESX Server
DiskNICMemoryCPU
Avamar VM
Operating
System
Application
Operating
System
Single-node Avamar ServerAVE
Guest-level Backup
LAN/SAN SAN
VCB Proxy Server
x86 Architecture
VMware Virtualization Layer
App
OS
App
OS
App
OS
App
OS
App
OS
App
OS
App
OS
App
OS
App
OS
App
OS
App
OS
App
OS
DiskNICMemoryCPU
x86 Architecture
VMware Virtualization Layer
App
OS
App
OS
App
OS
App
OS
App
OS
App
OS
App
OS
App
OS
App
OS
App
OS
App
OS
App
OS
DiskNICMemoryCPU
x86 Architecture
VMware Virtualization Layer
App
OS
App
OS
App
OS
App
OS
App
OS
App
OS
App
OS
App
OS
App
OS
App
OS
App
OS
App
OS
DiskNICMemoryCPU
x86 Architecture
VMware Virtualization Layer
App
OS
App
OS
App
OS
App
OS
App
OS
App
OS
App
OS
App
OS
App
OS
App
OS
App
OS
App
OS
DiskNICMemoryCPU
x86 Architecture
VMware Virtualization Layer
App
OS
App
OS
App
OS
App
OS
App
OS
App
OS
App
OS
App
OS
App
OS
App
OS
App
OS
App
OS
DiskNICMemoryCPU
x86 Architecture
VMware Virtualization Layer
App
OS
App
OS
App
OS
App
OS
App
OS
App
OS
App
OS
App
OS
App
OS
App
OS
App
OS
App
OS
DiskNICMemoryCPU
VCB Backup
What If You Can’t Change Your Environment?
DE-DUPLICATION AT THE
TARGET
Backup application sends native data to a target storage device
Data is de-duplicated once it reaches the target
De-duplication can happen
Backup and recovery with target-based de-duplication
De-duplication can happen during or after backup
Found in VTLs or LAN B2D appliances
Transparency to backup application offers users a “plug and play” experience
VMware – Corporate Express Australia
Virtualization and De-duplication For Tangible Benefits
Mark Jones
Infrastructure Manager
Corporate Express
Corporate Express Australia
• Supplier of Business Essentials
• Established in 1995
• Revenue $1.3 billion in 2007
• Over 2500 staff in 50 locations across Australia and New Zealand
• Listed on the ASX with majority shareholderStaples Inc
• Our missionL
21
• Our missionL”to provide a single source supply solution to make it easier and more cost-effective for ourcustomers to do business”
• e-Commerce sales currently exceed 74% of all orders
• Growth strategy
• 72 Acquisitions since 1995
Business and Technology Challenges
• Data Centre at maximum capacity
• Needed to improve DR capability & recovery time objectives
• Complex technologies without tie-in to an Infrastructure Strategy
• Existing SAN couldn’t scale to requirements• Existing SAN couldn’t scale to requirements
• All storage classified the same regardless of criticality
• Server & Storage demands increasing rapidly – time to deploy slow
• Data Centre backup complex to administrate
• Remote site backups troublesome
Strategic PrioritiesWhat Did We Do
Virtualise Server
Environment
Implement Scalable
SAN Storage Model
Leverage a Tiered
Storage Solution
Replace tape based
backup with Disk
Solution
Remove current data centre capacity issues together with improved server utilisation metrics
VMware
Provide a reliable, resilient, and highly available storage infrastructure that supports the business Strategy
Segment data based on its business value. The aim is to control storage costs and simplify data management.
Simplify solution, increase reliability and lower cost of backup data.
1 2 3 4
VMware Strategy
Symmetrix &
Clariion
Symmetrix, Clariion
& Centera
Avamar
Strengthen DR Capability
• Improve RPO & RTO objectives inline with business requirements• Incorporate Data replication into strategy
•Incorporate server virtualisation into strategy•Keep solution simple and easy to test
5
Impacts70-80% reduction on data
centre space, power, cooling
184 Physicals removed
Server rebuild and app load
from 60-80 hrs to 30-60
minutes
Reduce TCO of servers by
40%
The Outcome
AFTERBEFORE
64 cables/ports600 cables/portsNetwork
20 TB SAN50 TB DASStorage
16200Servers
Backups 2TB 200GB
Improved server fleet
utilization by 70%
Simplified DR approach &
reduced restoration timings
Substantial reduction in
storage requirements
Removed tape from
environment
2 server racks
10 KVA power
20 server racks
100 KVA power Facilities
Data Reduction
Service Levels
• Reduced overall storage from 92 to 44 TB• Over 2 TB mail storage saved through archiving• Removed need for Exchange PST files•1.5 TB file storage saved through archiving
• Solution scalable to meet business goals
• Data is stored at business value level (tiering)
• All servers boot from SAN• Improved visibility around storage management
Outcome
Backups
Disaster Recovery
• Non DB data backup reduced from 6 hours to 30 minutes• Data Centre backup window reduced from 9 to 5 hours• Remote site backup reduce from 30 to 3 hours across • Daily data backed up reduced from 2.0 TB to 200 GB • Removed tape and the need for tape silo storage & tape movements
• Replicate Tier One data synchronously to DR site• Simplified DR approach for storage & servers• Improved Tier 1 systems restoration time from 24 hours to minutes• Tier 2 systems restoration reduced from days to <12 hours
Putting It All Together �
Distributed ROBO with EMC Avamar for VMware
See this LIVE at the EMC booth, # 502
� Virtualize – VMware Infrastructure 3 ESX Server
� Leverage – EMC CLARiiON SX4 iSCSI SAN storage
RemoteOffice
� Protect – EMC Avamar Virtual Edition for VMware
Q&A
Breakout Session TA3089
Neela Jacques Tom Martin Mark JonesVMware EMC Corporate Express
September 2008