+ All Categories
Home > Documents > 1EMC CONFIDENTIAL—INTERNAL USE ONLY EMC Disk Library Two new EDL engines: DL5100 & DL5200 Based...

1EMC CONFIDENTIAL—INTERNAL USE ONLY EMC Disk Library Two new EDL engines: DL5100 & DL5200 Based...

Date post: 22-Dec-2015
Category:
Upload: marvin-mare
View: 226 times
Download: 3 times
Share this document with a friend
27
1 EMC CONFIDENTIAL—INTERNAL USE ONLY EMC Disk Library Two new EDL engines: DL5100 & DL5200 Based on Clariion CX4 array Up to 10.2 TB per hour New software revision : 4.0 Similar to 3.3 only new feature is CX4 support
Transcript
Page 1: 1EMC CONFIDENTIAL—INTERNAL USE ONLY EMC Disk Library  Two new EDL engines: DL5100 & DL5200  Based on Clariion CX4 array  Up to 10.2 TB per hour  New.

1EMC CONFIDENTIAL—INTERNAL USE ONLY

EMC Disk Library Two new EDL engines: DL5100 & DL5200

Based on Clariion CX4 array Up to 10.2 TB per hour

New software revision : 4.0 Similar to 3.3 only new feature is CX4 support

Page 2: 1EMC CONFIDENTIAL—INTERNAL USE ONLY EMC Disk Library  Two new EDL engines: DL5100 & DL5200  Based on Clariion CX4 array  Up to 10.2 TB per hour  New.

2EMC CONFIDENTIAL—INTERNAL USE ONLY

DataDomain 3 new appliances :

DD670: single quad-core / 3 optional PCI Card DD860: DD Archiver enables data movement between

tiered storage based on time DD890: TBD

DDOS 5.0 scheduled for Q1 2011 Support up to 96GB of RAM instead of 64GB on

DD880/DD890 DD Archiver for externalization Support for I-series AS400 thru BRMS LACP & IP-aliasing support 50% of the CLI commands changed compare to 4.x

IPv6 is not supported

Page 3: 1EMC CONFIDENTIAL—INTERNAL USE ONLY EMC Disk Library  Two new EDL engines: DL5100 & DL5200  Based on Clariion CX4 array  Up to 10.2 TB per hour  New.

3EMC CONFIDENTIAL—INTERNAL USE ONLY

DataDomain: Replication CIFS / NFS replication of files will start after 10min of inactivity

VTL replication needs the virtual tape to be unmounted for the replication to starts

Replication can be encrypted if needed

Files due to be replicated will not be affected by GC (cleaning)

If replication is slow, box can fill up

Page 4: 1EMC CONFIDENTIAL—INTERNAL USE ONLY EMC Disk Library  Two new EDL engines: DL5100 & DL5200  Based on Clariion CX4 array  Up to 10.2 TB per hour  New.

4EMC CONFIDENTIAL—INTERNAL USE ONLY

DataDomain: Best Practices DDBoost device paralelism:

“Target Session” default is 1 Optimal 4, maximum 10 for performance reasons Can configure multiple devices per SN.

DDBoost 2.2.2.0 library is used by NetWorker 7.6 SP1 AIX and HP-UX are currently not supported by DD Boost,

planned for NW 7.6 SP2

DD VTL device, “target Sessions” and “maximum sessions” needs to be set to 1 to avoid multiplexing which causes poor de-duplication ratios

Hashing is optimized for Intel architectures (performances will be better compared to SPARC)

Page 5: 1EMC CONFIDENTIAL—INTERNAL USE ONLY EMC Disk Library  Two new EDL engines: DL5100 & DL5200  Based on Clariion CX4 array  Up to 10.2 TB per hour  New.

5EMC CONFIDENTIAL—INTERNAL USE ONLY

DataDomain: Best Practices cont.Amount of sessions / Memory available

Optimized cloning is counted as replication stream

Backup write streams

Backup read streams

ReplSource Streams

ReplDestination Streams

MaximumConcurrent

<=32GB RAM 90 50 90 90 <140

>=64GB RAM 180 50 90 180 <180

Page 6: 1EMC CONFIDENTIAL—INTERNAL USE ONLY EMC Disk Library  Two new EDL engines: DL5100 & DL5200  Based on Clariion CX4 array  Up to 10.2 TB per hour  New.

6EMC CONFIDENTIAL—INTERNAL USE ONLY

Storage Node Implementation

File systems Virtual

Proprietary File systemsVia NDMP

Snapshots, CDP, and CRR

Applications

MMD (Storage nodeMedia Daemon) daemon)

Data Domain

libddboost

save

DSA

dasv

savewrapper

Page 7: 1EMC CONFIDENTIAL—INTERNAL USE ONLY EMC Disk Library  Two new EDL engines: DL5100 & DL5200  Based on Clariion CX4 array  Up to 10.2 TB per hour  New.

7EMC CONFIDENTIAL—INTERNAL USE ONLY

• Basic Constructs– Savestreams

• Movement of data• Generates a saveset on the target device• Can represent a System disk, file system,

directory (system disk shown),

– Data Domain Device type• Logical construct• Each Data Domain Device type uses a

unique instance of Boost

– Storage unit• Logical construct that Boost uses for target• Max qty of storage units is model

dependent

Data Domain Device Type

/C:/D:

/C:/D:

/C:/D:

/C:/D:

Boost

StorageUnit

Boost

Savestreams

Configuration

Saveset Savesets

Page 8: 1EMC CONFIDENTIAL—INTERNAL USE ONLY EMC Disk Library  Two new EDL engines: DL5100 & DL5200  Based on Clariion CX4 array  Up to 10.2 TB per hour  New.

8EMC CONFIDENTIAL—INTERNAL USE ONLY

• Design considerations– Multiple Data Domain Devices per

storage node• Cannot be shared between multiple

storage nodes • Each generates a new Boost footprint • Practical limits

# of devices per storage node available memory

– Multiple Logical Storage units per Data Domain system

• Each creates a new folder

Data Domain Device Type

/C:/D:

/C:/D:

/C:/D:

/C:/D:

Boost

StorageUnit

BoostBoost

Configuration

Page 9: 1EMC CONFIDENTIAL—INTERNAL USE ONLY EMC Disk Library  Two new EDL engines: DL5100 & DL5200  Based on Clariion CX4 array  Up to 10.2 TB per hour  New.

9EMC CONFIDENTIAL—INTERNAL USE ONLY

• Design considerations– Best practices for savestream

multiplexing• Boost is optimized for handling a single

stream Open – read/write – close, move on to next file

• Target sessions ‘optimal’ setting for multiplexed savestreams default = 4, NetWorker will exceed this if

workload demands more resources No benefit in reducing to <4

• Max sessions ‘Hard limit’ setting default is 10, NetWorker will not exceed this Allocates memory for 10 sessions Reduce Max sessions to less than 10 to reduce

memory allocation

StorageNode w/ Boost

/C:/D:

/C:/D:

/C:/D:

/C:/D:

8 savestreams(sessions) shown

Boost

Configuration

Page 10: 1EMC CONFIDENTIAL—INTERNAL USE ONLY EMC Disk Library  Two new EDL engines: DL5100 & DL5200  Based on Clariion CX4 array  Up to 10.2 TB per hour  New.

10EMC CONFIDENTIAL—INTERNAL USE ONLY

• Design considerations– Best practices for pools

• Add devices if savestreams exceed max sessions and if available system resources

Maximize available DD system bandwidth and de-duplication efficiency

• Optionally add devices & reduce max sessions 2 devices with MAX SESSIONS = 4 is better

than 1 device with MAX SESSIONS = 8• Build pools across multiple systems as a last

option If sessions/bandwidth to the first Data Domain

system is maximized Lose some global de-duplication efficiency

• Do not mix Data Domain Device types and any other device type in the same pool

Impacts Clone Controlled Replication operation

StorageNodew/ Boost

Boost

Boost

Configuration

Page 11: 1EMC CONFIDENTIAL—INTERNAL USE ONLY EMC Disk Library  Two new EDL engines: DL5100 & DL5200  Based on Clariion CX4 array  Up to 10.2 TB per hour  New.

11EMC CONFIDENTIAL—INTERNAL USE ONLY

• Design considerations– Best practices for design

• Configure a Data Domain Device type to a single storage node

Cannot share a Data Domain Device type between storage nodes

• Map multiple storage nodes to a single Data Domain system to maximize system bandwidth

Multiple storage nodes per Data Domain system helps drive available DD bandwidth to saturation

StorageNode Aw/ Boost

StorageNode Bw/ Boost

Boost

Boost

Configuration

Page 12: 1EMC CONFIDENTIAL—INTERNAL USE ONLY EMC Disk Library  Two new EDL engines: DL5100 & DL5200  Based on Clariion CX4 array  Up to 10.2 TB per hour  New.

12EMC CONFIDENTIAL—INTERNAL USE ONLY

• Design considerations– Best practices for design

• Configure a Data Domain Storage Unit to a single NetWorker Data Zone

• No sharing of storage units across NetWorker Data Zones

Data Zone Aw/ Boost

Data Zone Bw/ Boost

Boost

Boost

Configuration

Page 13: 1EMC CONFIDENTIAL—INTERNAL USE ONLY EMC Disk Library  Two new EDL engines: DL5100 & DL5200  Based on Clariion CX4 array  Up to 10.2 TB per hour  New.

13EMC CONFIDENTIAL—INTERNAL USE ONLY

Boost

Boost

Boost

• Design considerations– Best practices for design

• Do not exceed maximum sessions specification for the Data Domain model

E.g. DD880 = 180 sessions max 10 Data Domain Devices, each with

max sessions @ 10 = 100 potential sessions

5 storage nodes w/Max sessions= 50 sessions

Boost

Boost

Configuration

Page 14: 1EMC CONFIDENTIAL—INTERNAL USE ONLY EMC Disk Library  Two new EDL engines: DL5100 & DL5200  Based on Clariion CX4 array  Up to 10.2 TB per hour  New.

14EMC CONFIDENTIAL—INTERNAL USE ONLY

Clone Controlled Replication

• Immediate Cloning– Clones begin as soon as the savegroup backup has finished– Pro: Reduces the gap in time between a secure backup and the

completed copy– Con: Other savegroups may still be running, creating resource

contention

• Scheduled cloning– Two approaches with NW 7.6 SP1: NMC and Scripts/scheduler– Objective: Postpone clone process to reduce resource contention

with backups– Pro: allows backups to complete as quickly as possible– Con: increases the gap in time between the secure backup and

the completed copy

Page 15: 1EMC CONFIDENTIAL—INTERNAL USE ONLY EMC Disk Library  Two new EDL engines: DL5100 & DL5200  Based on Clariion CX4 array  Up to 10.2 TB per hour  New.

15EMC CONFIDENTIAL—INTERNAL USE ONLY

Clone Controlled Replication

• Comparison of replication types– Directory Replication

• Used by existing Data Domain users without backup application control• Replication begins even as the backup is in process• Pro: Reduces the gap in time between a secure backup and the completed copy• Con: The replica is not kept in the backup apps catalog

– File replication • Used by customers deploying the NW/DD Boost integration• Replication process is initiated by NetWorker after the backup is completed• Pro: The replica is cataloged by NetWorker • Con: Increases the gap in time between the secure backup and the completed copy

Backup Replication

T0 T3 T4 T5 T6T1 T2

Replication

T0 T3 T4 T5 T6T1 T2

Backup

Page 16: 1EMC CONFIDENTIAL—INTERNAL USE ONLY EMC Disk Library  Two new EDL engines: DL5100 & DL5200  Based on Clariion CX4 array  Up to 10.2 TB per hour  New.

16EMC CONFIDENTIAL—INTERNAL USE ONLY

Clone Controlled Replication

• Best Practices• Reduce the gap in time between a secure backup and the completed

copy – Use Immediate cloning– Increase granularity of the backup and increase concurrency

• Reduce Savegroup size• Use saveset cloning• Reduce saveset size

Savegroup 1 Replication 1

T0 T3 T4 T5 T6T1 T2

T0 T3 T4 T5 T6T1 T2

Replication 2Replication 1Savegroup 1 Savegroup 2

Page 17: 1EMC CONFIDENTIAL—INTERNAL USE ONLY EMC Disk Library  Two new EDL engines: DL5100 & DL5200  Based on Clariion CX4 array  Up to 10.2 TB per hour  New.

17EMC CONFIDENTIAL—INTERNAL USE ONLY

Clone Controlled Replication

Page 18: 1EMC CONFIDENTIAL—INTERNAL USE ONLY EMC Disk Library  Two new EDL engines: DL5100 & DL5200  Based on Clariion CX4 array  Up to 10.2 TB per hour  New.

18EMC CONFIDENTIAL—INTERNAL USE ONLY

Clone 2

Storage Node

NW Server NMC

Data Domain Replication

Remote Site

ManagesNW

Saveset Use Clone ID/ or Clone poolUse remote storage node

Clone Controlled Replication

Clone 1

Page 19: 1EMC CONFIDENTIAL—INTERNAL USE ONLY EMC Disk Library  Two new EDL engines: DL5100 & DL5200  Based on Clariion CX4 array  Up to 10.2 TB per hour  New.

19EMC CONFIDENTIAL—INTERNAL USE ONLY

Clone Controlled Replication

• Remote Clone to tape– Remember to clone from the clone ID/clone pool and not the original

saveset (backup)– Remember to use a storage node attached to the tape device(s) at the

remote site– Each clone is independently scheduled

• First clone is based upon backup– Use immediate or scheduled

• Remote clone is created from the first clone– Must be scheduled– Clone of a clone is a separate policy

• Data Domain system stream bandwidth is shared– Backups, recoveries, replications– E.g. DD880 – 180 connections max

• # backup savestreams + # recoveries + # replications cannot exceed 180

Page 20: 1EMC CONFIDENTIAL—INTERNAL USE ONLY EMC Disk Library  Two new EDL engines: DL5100 & DL5200  Based on Clariion CX4 array  Up to 10.2 TB per hour  New.

20EMC CONFIDENTIAL—INTERNAL USE ONLY

Resource Planning

EMC NetWorker with EMC Data Domain Boost Best Practices Planning (in draft)

• Storage Node Memory -Boost– Default and minimum memory allocation is 64MB, supporting 4 sessions– Changing target sessions to <4 still allocates 64MB – Each additional session allocates 16MB

• Back of napkin calculation for Boost– m = n * (64*s)

• m= memory in MB• n= number of Data Domain Devices• s= sessions

• Storage Node Memory – Data Domain Device type– Allocates between 200MB and 250MB of memory per device– Includes memory used by the RO device

• Design recommendation (not minimums)– 8 Data Domain Devices per storage node, no more than 16 max.– 4 streams per device, no more than 10 max– 8GB RAM

Page 21: 1EMC CONFIDENTIAL—INTERNAL USE ONLY EMC Disk Library  Two new EDL engines: DL5100 & DL5200  Based on Clariion CX4 array  Up to 10.2 TB per hour  New.

21EMC CONFIDENTIAL—INTERNAL USE ONLY

• EMC® NetWorker® Data Domain® Deduplication Devices Integration Guide

– Page 16: Memory and network considerations

• Each read/write device (active nsrmmd process) that takes four save streams requires about 96 MB of RAM on the storage node.

• Each read-only device requires about 20 MB, regardless of the number of save streams.

• a fully loaded Data Domain system that is running four save streams per device would require about

– (96MB x 16 devices) + (20MB x 16 devices) = 2.3 GB, of physical memory on the storage node.

• The recommended minimum memory requirement for a storage node is 4 GB of RAM.

– preliminary, subject to update in the next revision of document– 4GB minimum also reflects purchasing options (increments)

Resource Planning

Subject to update

For reference only

Page 22: 1EMC CONFIDENTIAL—INTERNAL USE ONLY EMC Disk Library  Two new EDL engines: DL5100 & DL5200  Based on Clariion CX4 array  Up to 10.2 TB per hour  New.

22EMC CONFIDENTIAL—INTERNAL USE ONLY

Resource Planning

• Storage Node Processor– Distributed Segment Processing increases processor utilization

on the first backup– Subsequent backups will benefit from

• Reduced CPU utilization• Reduced LAN traffic

• Server – Encryption (encryptasm) not supported to the Data Domain

Device type– Compression (compressasm) not suported with the Data Domain

Device type– CheckPoint Restart not supported with the Data Domain Device

type– Data Domain retention locks are not supported with this release

of NetWorker

Page 23: 1EMC CONFIDENTIAL—INTERNAL USE ONLY EMC Disk Library  Two new EDL engines: DL5100 & DL5200  Based on Clariion CX4 array  Up to 10.2 TB per hour  New.

23EMC CONFIDENTIAL—INTERNAL USE ONLY

• Customer Benefits– Higher aggregate backup throughput

• Backup windows shrink considerably• Enables faster DR readiness

– Lower CPU usage on the Data Domain system• CPU can be used for other tasks, such as replication, cleaning

– Reduced CPU usage on the media server• 20-40% lower overhead on the media server• No need to upgrade the media server hardware

– Leverage existing 1GbE backup infrastructure• Achieve 10GbE throughput with 1GbE networks• Avoid the need to upgrade media server and network

hardware

– Failed backups go much faster on retries• Data that is already sent to the Data Domain system need not

be sent again• Enables faster backups for retried backups

Distributed Segment Processing

Page 24: 1EMC CONFIDENTIAL—INTERNAL USE ONLY EMC Disk Library  Two new EDL engines: DL5100 & DL5200  Based on Clariion CX4 array  Up to 10.2 TB per hour  New.

24EMC CONFIDENTIAL—INTERNAL USE ONLY

• Good fit situations– GDA (mandatory)– Network (1 GbE) constrained connectivity to

Data Domain system– High stream counts (>8 streams)

• DD model dependent; lower –end models see benefits at fewer stream counts

Distributed Segment Processing – Good Fit

Page 25: 1EMC CONFIDENTIAL—INTERNAL USE ONLY EMC Disk Library  Two new EDL engines: DL5100 & DL5200  Based on Clariion CX4 array  Up to 10.2 TB per hour  New.

25EMC CONFIDENTIAL—INTERNAL USE ONLY

• Boost license applied to the DD system– Distributed Segment Processing ‘on’

• Replicator license applied to the DD system• NetWorker enabler for Data Domain Device

type applied– Enables backups to be directed to Boost – Enables Clone Controlled Replication

Other Considerations

Page 26: 1EMC CONFIDENTIAL—INTERNAL USE ONLY EMC Disk Library  Two new EDL engines: DL5100 & DL5200  Based on Clariion CX4 array  Up to 10.2 TB per hour  New.

26EMC CONFIDENTIAL—INTERNAL USE ONLY

Planning: Support Dependencies

DD Boost License

• One per DDR

DD OS

• DD OS 4.8 and 4.9

NetWorker Storage Nodes

• 7.6 SP1 has integrated DD Boost• Windows 2003/2008/2008 R2, Linux (Red Hat 4/5, SuSE 10/11), Solaris 9/10 (SPARC)

NetWorker Server

• 7.6 SP1• DD Device Type Enabler per raw capacity (except for capacity licensing)

NMC

• 7.6 SP1 contains DD device management and new device wizard

Page 27: 1EMC CONFIDENTIAL—INTERNAL USE ONLY EMC Disk Library  Two new EDL engines: DL5100 & DL5200  Based on Clariion CX4 array  Up to 10.2 TB per hour  New.

27EMC CONFIDENTIAL—INTERNAL USE ONLY

Installation & Setup• Configure DataDomain system

1. Install DD BOOST license 2. Install REPLICATION license (optional)3. Enable DD BOOST protocol4. Create DD BOOST user/password5. Create DD BOOST Storage Units (at least one for each StorageNode)6. Enable DD BOOST distributed segment processing 7. Define DD BOOST Interface Group (optional)8. Enable/Disable DD BOOST low bandwidth optimization (optional)9. Configure SNMP for NMC monitoring (optional)

• Configure Networker system1. Install Networker 7.6.12. Install DD Device Type Enabler (based on the DD raw disk capacity)3. DD system and Networker Storage Node must be IP connected4. Create a new DataDomain system5. Add credentials for the DD system (user/password)6. Create a new Networker device (DataDomain device type)7. Configure media pool8. Configure SNMP monitoring options (optional)9. Configure backup group, cloning policy, etc.


Recommended