© 2009 EMC Corporation. All rights reserved.
EMC Proven Professional
The #1 Certification Program in the information storage and management industry
Introduction to Information Storage and ManagementIntroduction to Information Storage and Management
Chapter 1
Section 1 : Storage System
© 2009 EMC Corporation. All rights reserved.
Chapter Objectives
Upon completion of this chapter, you will be able to:
o Describe who is creating data and the amount of data being created
o Describe the value of data to business
o List the solutions available for data storage
o List and explain the core elements of data center
o Describe the ILM strategy
o Describe storage evolution
© 2009 EMC Corporation. All rights reserved.
Lesson : Information Storage
Upon completion of this lesson, you will be able to:
o Describe the importance of information to individuals and to businesses
o Define data and information
o Discuss the categories of data
o Describe the storage architectures and their evolution
© 2009 EMC Corporation. All rights reserved.
Why Information Storage
o “Digital universe – The Information Explosion”o 21st Century is information era o Information is being created at ever increasing rateo Information has become critical for success
o We live in an on-command, on-demand worldo Example: Social networking sites, e-mails, video and photo sharing
website, online shopping, search engines etc
© 2009 EMC Corporation. All rights reserved.
Nearly a quarter of the world's population – roughly 1.4 billion people – will use the Internet
on a regular basis in 2009.
© 2009 EMC Corporation. All rights reserved.
England has approximately 4 million surveillance cameras
1 for approximately every 14 Britons
© 2009 EMC Corporation. All rights reserved.
Storage requirements: Facebook
• 10,000,000,000 photos
• 2-3 Terabytes of photos are being uploaded to the site every day
• One petabyte of photo storage
• Serve over 15 billion photo images per day
• Photo traffic now peaks at over 300,000 images served per second
© 2009 EMC Corporation. All rights reserved.
Optimize
Information management is a big challenge
Store
Store
Protect
Protect Optimize
Leverage
Leverage
© 2009 EMC Corporation. All rights reserved.
If a Grain of Sand were One Byte of Information . . .
1 Gigabyte =1 billion bytespatch of sand— 9” square, 1’ deep1 Terabyte =1 trillion bytesa sandbox— 24’ square, 1’ deep1 Petabyte =1,000 terabytesa mile long beach— 100’ wide , 1’ deep
AVocabulary for Measuring Information
1 Megabyte =1 million bytesa tablespoon of sand
© 2009 EMC Corporation. All rights reserved.
A Vocabulary for Measuring Information
If a Grain of Sand were One Byte of Information . . .
1 Zetabyte =1,000 exabytesthe same beach—along the entire US coast
1 Gigabyte =1 billion bytespatch of sand— 9” square, 1’ deep1 Terabyte =1 trillion bytesa sandbox— 24’ square, 1’ deep1 Petabyte =1,000 terabytesa mile long beach— 100’ wide , 1’ deep
1 Exabyte =1,000 petabytesthe same beach—from Maine to North Carolina
1 Yottabyte =1,000 zetabytesenough info to bury the entireUS under 296 feet of sand
New
1 Megabyte =1 million bytesa tablespoon of sand
© 2009 EMC Corporation. All rights reserved.
What do you Think ?
o What is your contribution to the digital Universe ( how many Mb’s have you generated till date ?? )
a) <100 GB
b) 100 GB - 500 GB
c) 500 GB – 1 TB
d) > 1 TB
© 2009 EMC Corporation. All rights reserved.
What is Data
o Data is converted into more convenient form i.e. Digital Data
o Increase in data processing capabilities
o Lower cost of digital storageo Affordable and faster communication
technology
o Who creates data?o Individualso Businesses
“Collection of raw facts from which conclusions may be drawn”
010101010101010101101000010101011
01010101010
10101010101
01010101010
Video
Photo
Book
Letter Digital Data
© 2009 EMC Corporation. All rights reserved.
Categories of Datao Data can be categorized as either
structured or unstructured data
o Over 80% of enterprise information is unstructured
Rows and Columns
Contracts
Images
Manuals
X-Rays
Instant Messages
Forms
E-Mail Attachments
Check
Documents
PDFs
Web Pages
Audio Video
Invoices
Rich Media
Structured (20%)
Unstructured (80%)
© 2009 EMC Corporation. All rights reserved.
Define Information o What do individuals/businesses do
with the data they collect?o They turn it into “information”o “Information is the intelligence and
knowledge derived from data”
o Businesses analyze raw data in order to identify meaningful trends
o For example:o Buying habits and patterns of customerso Health history of patients
Users of Information
Centralized information storage and processing
Uploading information
Accessing information
Wired Wireless WiredWireless
Network Network
Demand for more Information
Creators of information
Virtuous cycle of information
© 2009 EMC Corporation. All rights reserved.
Value of Information to a Businesso Identifying new business opportunities
o Buying/spending patternso Internet stores, retail stores, supermarkets
o Customer satisfaction/serviceo Tracking shipments, and deliveries
o Identifying patterns that lead to changes in existing businesso Reduced cost
o Just-in-time inventory, eliminating over-stocking of products, optimizing shipment and delivery
o New serviceso Security alerts for “stolen” credit card purchases
o Targeted marketing campaignso Communicate to bank customers with high account balances about a special savings
plan
o Creating a competitive advantage
© 2009 EMC Corporation. All rights reserved.
Storage
o Data created by individuals/businesses must be stored for further processing
o Type of storage used is based on the type of data and the rate at which it is created and used
o Examples:o Individuals: Digital camera, Cell phone, DVD’s, Hard disko Businesses: Hard disk, external disk arrays, tape library
o Storage model: An evolutiono Centralized: mainframe computerso Decentralized: Client –server modelo Centralized: Storage Networking
© 2009 EMC Corporation. All rights reserved.
Storage Technology and Architecture Evolution
IP SAN
Multi Protocol
Router
SAN / NAS
FC SANLAN
RAID Array
JBOD
Internal DAS
Time
© 2009 EMC Corporation. All rights reserved.
Lesson Summary
Key points covered in this lesson:
o Importance of information
o Data, information and storage
o Categories of data
o Storage architectures and their evolution
Additional Task for Students
Research on Storage Technologies
& Architecture evolution ( RAID &
Storage Networking )
© 2009 EMC Corporation. All rights reserved.
Lesson: Data Center Infrastructure and Introduction to ILM
Upon completion of this lesson, you will be able to:
o List the five core elements of a data center infrastructure
o Describe the requirements of storage systems for optimally supporting business activities
o Explain the importance of Information Lifecycle Management
o List the activities in developing the ILM strategy
© 2009 EMC Corporation. All rights reserved.
Example of an Order Processing System
LAN FC SAN
Storage ArrayDBMS
Server/ OS
Application User
Interface
Client
© 2009 EMC Corporation. All rights reserved.
Key Requirements for Data Center Elements
AvailabilityAvailability
Data IntegrityData Integrity SecuritySecurity
CapacityCapacity
ScalabilityScalability
PerformancePerformance
ManageabilityManageability
© 2009 EMC Corporation. All rights reserved.
Challenges in Managing Information
o Exploding digital universeo Multifold increase of information growth
o Increasing dependency on informationo The strategic use of information plays
o Changing value of informationo Information that is valuable today may become less important tomorrow.
Additional Task
Research on Tiered Storage Model
© 2009 EMC Corporation. All rights reserved.
Information Lifecycle Management
Create Access Migrate Archive Dispose
New order
Value
Process order
Deliver order
Warranty claim
Fulfilled order
Ageddata
WarrantyVoided
Protect
Time
A proactive strategy that enables an IT organization
to effectively manage the data throughout its lifecycle
© 2009 EMC Corporation. All rights reserved.
Information Lifecycle Management ProcessPolicy-based Alignment of Storage Infrastructure with Data Value
AUTOMATED
FLEXIBLE
Classifydata /
applications based on business rules
Implement policies with information
management tools
Integrated management
of storage environment
Organizestorage resources
toalign with data
classes
© 2009 EMC Corporation. All rights reserved.
Benefits of Implementing ILMo Improved utilization
o Tiered storage platforms
o Simplified management o Processes, tools and automation
o Simplified backup and recoveryo A wider range of options to balance the need for business continuity
o Maintaining compliance o Knowledge of what data needs to be protected for what length of time
o Lower Total Cost of Ownership o By aligning the infrastructure and management costs with information value
© 2009 EMC Corporation. All rights reserved.
Lesson Summary
Key points covered in this lesson:
o The five core elements of a Data Center infrastructure
o Key requirements of storage systems to support business activities, as well as some of the constraints
o ILM strategyo Importanceo Characteristicso Activities in developing ILM strategyo IML implementationo Benefits of ILM
© 2009 EMC Corporation. All rights reserved.
Chapter Summary
Key points covered in this chapter:
o Importance of data, information, and storage infrastructure
o Types of data, its value, and key management requirements of a storage system
o Evolution of storage architectures
o Core elements of a data center
o Importance of the ILM strategy
© 2009 EMC Corporation. All rights reserved.
What do you Think ?
o If you were to send a 1.1 MB e-mail to 4 of your friends, by the time your 4 friend receive that e-mail, What is your contribution to the digital Universe ( how many Mb’s )
a) < 5 MB
b) 10 – 30 MB
c) 30 – 50 MB
d) > 50 MB
© 2009 EMC Corporation. All rights reserved.
How Much Data do YOU Create?
Source: IDC White Paper, "The Diverse and Exploding Digital Universe," Sponsored by EMC, March 2008
SENT TO FOURCOLLEAGUES
Backup
Tape Back-up
8.8 MB8.8 MB
2.2 MB
2.2 MB
2.2 MB
2.2 MBE-mail withdoc 1.1 MB
E-mail withdoc 1.1 MB
E-mail withdoc 1.1 MB
E-mail withdoc 1.1 MB
FINISH 51.5 MB!
Original MB Document + 1.0 E-mail Text 0.1 Local E-mail Copy 1.1 E-mail Server 1.1 Desktop Backup 1.0 Redundant Server 2.1 Tape Archive 4.2
9.5Copies (4) E-mail Local Copies 4.4 Server Copies 4.4 Server Backup 4.4 Tape Archive 8.8
22.0
Transient Overhead 20.0
TOTAL 51.5
START 1.1 MB
E-mail with Attachment
Document1 MB
RedundantBackup
E-mail Server/Desktop Backup
Tape Back-up
E-mail withdoc 1.1 MB
1.1 MB1.0 MB2.1 MB
2.1 MB
4.2 MB4.2 MB
© 2009 EMC Corporation. All rights reserved.
EMC Proven Professional
The #1 Certification Program in the information storage and management industry
What is Information Storage & ManagementWhat is Information Storage & Management
An Example for an Intelligent Information Infrastructure
© 2009 EMC Corporation. All rights reserved.
Typical IT Environment
Outlook users &file clients
File servers
SQL server
Exchange servers
LAN
LANbackup
CRM server
Web server
Tapebackup
App server
© 2009 EMC Corporation. All rights reserved.
Typical IT Environment – Potential Issues
Outlook users &file clients
File servers
SQL server
Exchange servers
LAN
LANbackup
CRM server
Web server
Tapebackup
App server
Data Everywhere
Data Everywhere
Data Everywhere
Data Everywhere
© 2009 EMC Corporation. All rights reserved.
Typical IT Environment – Potential Issues
Outlook users &file clients
File servers
SQL server
Exchange servers
LAN
LANbackup
CRM server
Web server
Tapebackup
App server
Multiple Backup
Multiple Backup
Hard to Manage
Low Utilization
Difficult to build DR
© 2009 EMC Corporation. All rights reserved.
Typical IT Environment – Potential Issues
Outlook users &file clients
File servers
SQL server
Exchange servers
LAN
LANbackup
CRM server
Web server
Tapebackup
App server
Unprotected Tape
User Identification
Compliance
How to Archive
Business Intelligence
To many servers
How to handle paper
RTO/RPO
How to manage SLA
Automating workflow
© 2009 EMC Corporation. All rights reserved.
IP/iSCSI
Store More Intelligently
Outlook users &file clients
SQL server
Exchange servers
LAN
LANbackup
ERP server
Web server
Tapebackup
App server
FC
SANIP
iSCSI
Unified Storage
© 2009 EMC Corporation. All rights reserved.
Protect Against Data Loss
Outlook users &file clients
SQL server
Exchange servers
LAN
Consolidatedbackup
ERP server
Web server
App server
FC
SANIP
iSCSI
Data ReplicationData Snapshot
© 2009 EMC Corporation. All rights reserved.
Protect from Mis-Use (Security)
Outlook users &file clients
SQL server
Exchange servers
LAN
ERP server
Web server
App server
FC
ProtectUser Access
ProtectBackup Tape
ProtectInformation
ProtectDatabase
ProtectReplication
© 2009 EMC Corporation. All rights reserved.
Intelligent Information Management
Outlook users &file clients
SQL server
Exchange servers
LAN
ERP server
Web server
App server
FC
ArchiveStorage
Archive inactivedata before backup
backup
Information Classification
TieredStorage
© 2009 EMC Corporation. All rights reserved.
Leveraging Content to Create Business Value
Outlook users &file clients
SQL server
Exchange servers
LAN
ERP server
Web server
FAX
Unified Content Repository
Report Data
Web Content
Transaction
Customer data
Unstructured files
ECM server
Capture, Classify, Change, Route, Approve, Publish, Search, Retire information
Paper Doc.
© 2009 EMC Corporation. All rights reserved.
Virtualized & Automate Infrastructure
Users
Web serverApp serverERP Server
VMware Environment
SQL serverExchange Server
VMware Environment
Virtualized Storage and File System
Corporate Network (IP & FC)
VirtualTape
De-Duplication
Next Generation Backup
Archive
`
End to EndInfrastructure &
Service Management