Date post: | 24-Jan-2017 |
Category: |
Technology |
Author: | solarisyougood |
View: | 320 times |
Download: | 1 times |
StorageGRID Webscale SE
StorageGRID Webscale Technical OverviewSeptember 2014
Student Guide & Workshop & Internal Training & Confidential Update Dailyhttps://goo.gl/VVmVZ05673-StorageGRID for Standard Enterprise Solutionhttp://ouo.io/SROnEVBuilding Private Clouds on E-Series The Swift Object Opportunityhttp://ouo.io/vWryaDeployment Guide - Acuo Universal Clinical Platform with NetApp StorageGRID Integration Architecturehttp://ouo.io/TowHxUDeployment Guide - StorageGRID E-Serieshttp://ouo.io/jhf1MGetting Started with StorageGRID CDMI Integrationshttp://ouo.io/dAORQjNetApp and the Object-Based Storage and Archiving Landscapehttp://ouo.io/wvoEgQNetApp Distributed Content Repositories What Are We Doing in Real Lifehttp://ouo.io/3y65MNetApp Portfolio Overviewhttp://ouo.io/O9aKQPOC - StorageGRID Webscale Proof of Concept Guidehttp://ouo.io/pzljIHQRG_StorageGRID Webscalehttp://ouo.io/3dqRrSE Training - StorageGRID Webscale Technical Overviewhttp://ouo.io/BqtKUUSolution Brief - NetApp StorageGRID Manage Large Pools of Mission-Critical Patient Data Across Healthcare Facilitieshttp://ouo.io/KTDFzStorageGRID Webscale 10.0 Installation Overviewhttp://ouo.io/hIYhJStorageGRID Webscale Nonstop Object Storage for Enterprise and Cloudhttp://ouo.io/dMRoOrStorageGRID Webscale Object-Enabled Data Managementhttp://ouo.io/nhuj4fTechnical Report - Integration Guide for NetApp StorageGRID with McKesson Horizon Medical Imaging PACShttp://ouo.io/9ncafj
2014 NetApp, Inc. All rights reserved. NetApp Proprietary Limited Use Only2
AgendaWhat is Object StorageStorageGRID Webscale IntroductionTechnical OverviewKey ConceptsData flow
2014 NetApp, Inc. All rights reserved. NetApp Proprietary Limited Use Only3
2014 NetApp, Inc. All rights reserved. NetApp Proprietary Limited Use Only4Object storage growth driven by macro trendsUnstructured data continues to growNew sources of unstructured data growth media, mobile and applicationsData continues to be retained for long periods (archival, compliance etc)Unstructured data profile is changingSimultaneous access to the same data is rarely requiredMost data accessed a few times initially and then rarely accessedData access is changingGeographically dispersed accessApplications accessing data dont care about POSIX semantics, file locking Highly cost sensitive petabyte scale repositoriesDriving tradeoffs between $/GB, latency, throughput, and data protection
Storage being managed in a cloud ecosystemUnified management & orchestrationGrowth in cloud hosted applications that leverage object storage
Introducing StorageGRID WebscaleA new variant of StorageGRIDTarget object store (cloud, archive, media) use cases at massive scale
New FeaturesProtocols native support for S3 APIScalability 100 billion objects, 70 PBSimplicity modular scalable resilient architecture, simplified deploymentProven track record for reliability and innovation
2014 NetApp, Inc. All rights reserved. NetApp Proprietary Limited Use Only5
2014 NetApp, Inc. All rights reserved. NetApp Proprietary Limited Use Only6BlockFileObjectWhat is Object Storage?Different ways to address dataSpecific location on disks / memory TracksSectorsSpecific folder in fixed logical orderFile pathFile nameDateFlexible container sizeData and MetadataUnique ID
Object Storage Example: File vs. Object
2014 NetApp, Inc. All rights reserved. NetApp Proprietary Limited Use Only7Object Based File Based ValetParking Garage
Daily Garage 1Floor 4Row NSpace 53/users/jsmith/car/garage1/floor4/rown/space53.fileC:\Users\jsmith\Garage1\Floor4\RowN\Space53.fileObject UID 317
Why Object storage?
2014 NetApp, Inc. All rights reserved. NetApp Proprietary Limited Use Only8Massive ScaleBillions of objectsPetabytes of dataGlobal namespaceExplosive growthRespond to compliance and retention requirementsCost effectively meet SLAs with intelligent data placement
Access from anywhere Control access, security, and data integrity
Object storage used across different workloadsEmerging Object storage segments in the hybrid cloud
2014 NetApp, Inc. All rights reserved. NetApp Proprietary Limited Use Only9Service Providers (XaaS)Media Repository (Media Redistribution)
Large object sizes (+250MB)Distributed repositoriesHigh data rates for redistributionTime to first byte latency < 50msSecure multi-tenancy (billing, isolation, authentication, & self-service)Control plane APIs and workflow automationNew apps requiring RESTful interfaces (S3, Swift)Web Data Repositories
Small object (~KB) performanceExtremely high transaction load Searchable, scalable metadataHigh object countsData Archives
Long access latency toleranceIntegration to tape / GlacierLong retention periodsErasure coding
Scale seamlesslyEase of install & managementGlobal namespaceCost
StorageGRID WebscaleObject-Enabled Data Management 2014 NetApp, Inc. All rights reserved. NetApp Proprietary Limited Use Only10
NetApp StorageGRID WebscaleSoftware Defined Object StorageBillions of objects and multi-petabyteArchitected for massive scaleBuilt for the Hybrid CloudGlobal, Always-On data availability and durabilitySupport for cloud applications (S3, CDMI)10th Generation object store proven with product deployments
Dynamic Policy EngineIntelligent, policy driven data management for optimal availability, performance and cost over the life cycle of data
2014 NetApp, Inc. All rights reserved. NetApp Proprietary Limited Use Only11
Introducing NetApps Content Repository solution for large, unstructured distributed data sets StorageGRID
NetApp StorageGRID was designed from the ground up to solve Big Content challenges. StorageGRID offers a large, global content repository for unstructured data, capable of storing billions of data sets and managing dozens of petabytes of capacity.
StorageGRID enables customers to create, manage and consume content globally, while providing complete location independence. With location independence you get all the benefits of local storage (meaning that you get the performance that you expect and access to all content you require) while providing a truly global namespace across hundreds of sites. Every site can be set up differently, from remote sites that simple have a virtual machine or smaller system that only stores small amounts of data locally, to large datacenters that retain a full set of all data sets. Remember though this is all managed automatically with policies, metadata information and intelligent caching, no administrator intervention required.
StorageGRID uses metadata-based management for data classification and access, meaning that StorageGRID manages where data is physically stored, how many copies exist (and where) for disaster recovery purposes, how long those copies are retained and when they are destroyed. Further, metadata-based access to your data means that instead of looking for a file name, you simple look for Mortgage documents, customer John Doe, account number 123456 greatly simplifying how your applications interact with your storage.
11
2014 NetApp, Inc. All rights reserved. NetApp Proprietary Limited Use Only12StorageGRID
Platform for Distributed Content RepositoriesStorageGRID Object Storage Software + E-Series Storage Array
MULTIPLE: APPLICATIONS + SITES + PROTOCOLSMULTIPLE: TARGETS + TIERSMULTIPLE: TENANTS + POLICIES + ADMINISTRATORSSite 1Site 2 Site NSite 3APPLICATIONSAPPLICATIONSAPPLICATIONSAPPLICATIONNetApp E-SeriesTape
StorageGRID Webscale features 2014 NetApp, Inc. All rights reserved. NetApp Proprietary Limited Use Only13Hardware Obsolescence Protection
Non-DisruptiveOperationsObject Integrity and SecurityMulti-TenancyGlobal ObjectNamespace
Services AutomationManageabilityReliabilityScalabilityILM, Metadata driven policiesSeamlessScaling
Audit & Reporting
Technical OverviewStorageGRID Webscale 2014 NetApp, Inc. All rights reserved. NetApp Proprietary Limited Use Only14
Solution Topology 2014 NetApp, Inc. All rights reserved. NetApp Proprietary Limited Use Only15
Clients can access global object namespace via Gateway Nodes or Storage Nodes
E2760
DE6600AdministratorsREST API clients10GbE LANPer Node Resource RequirementVMDK(GB)vCPURAM (GB)100824100824300824
Storage
Gateway
Admin
DE6600
E2760
DE6600
DE6600
StorageGRID Webscale nodes running in VMware hostsS3, CDMIHTTPS16Gb FC switch
WAN RouterE2760 block-based storageSSDs for read cache and VM datastoresMixed disk types (SSD, SAS, NL-SAS) for tiered storage pools
The CPU and RAM requirements are for production deployments. Lab \ Test configurations can be smaller:Admin 2CPU, 4GB RAMGateway 2CPU 4GB RAMStorage 2CPU, 6GB RAM
Storage recommendations:Thebest practice and recommendedstorage for StorageGRID Webscale is as follows:NetApp E-Series Storage System with capacity optimized NL-SAS in DDPSSDs for Storage Node system disks (100GB per node)SSDs for all Admin Node disks (about 300GB per node)
15
StorageGRID Architecture 2014 NetApp, Inc. All rights reserved. NetApp Proprietary Limited Use Only16App1App2
Admin NodesManagement Services: Configuration, Monitoring, Audit and LoggingStorage NodesManages object storage including replicationAPI Gateway NodesLoad balancing interface through which applications connect to the systemArchive NodesInterface to archive media storage such as tapeDATA CENTER 1
DC2
DC3Design the grid to scale for performance, capacity & resiliency
STORAGE
STORAGE
ADMINARCHIVE
API LOADBALANCER
Use NetApp StorageGRID Webscale Designer to architect the grid to your requirementsQuickly deploy the grid via NetApp StorageGRID Webscale InstallerAdapt your grid to changing requirementsAdd sitesAdd nodes and capacitySupport rolling upgradesCentralized Deployment & Rolling UpgradesDesign, deploy, and maintain configuration control 2014 NetApp, Inc. All rights reserved. NetApp Proprietary Limited Use Only17
DC1
DC2
DC4
DC3
Data Management Key Concepts
2014 NetApp, Inc. All rights reserved. NetApp Proprietary Limited Use Only18
Client Connections
StoragePools
Metadata
Object Identifier
ILMPolicy
Client Connections, Object Identifiers & Metadata
2014 NetApp, Inc. All rights reserved. NetApp Proprietary Limited Use Only19
ClientCDMIS3Read/Write
StorageGRID Webscale System HTTPs
Client Connections
Object Identifier
Object type: JPGDate modified: 07/21/ 2014GPS Coordinates: Lat, LongLocation: DC @ Seattle
Metadata
Scale beyond traditional application metadata to enable new capabilitiesFlexibility: Metadata is application definedUp to 4096 fields can be created as requirements change (CDMI)No lock-in to predefined schemaILM engine evaluates the metadata and applies policies S3 metadata available to the policy engine
Extensive metadata management
Metadata Why it matters ?Metadata is distributed throughout the gridIncreased scalability and resiliencyFaster retrieval and efficient ILM policy evaluation 2014 NetApp, Inc. All rights reserved. NetApp Proprietary Limited Use Only20Object Identifier:00006FFD00192A1200555FFEE12039468EBF622D9402C4F962
Locations:Location 1: Data Center 1/DC1-S1/LDRLocation 2: Data Center 2/DC2-S3/LDR
Metadata CDMI/CVTE: 0CDMI/META: {"application":"finance","doctype":"contract", "project":"45667}
Extensive metadata managementMetadata ManagementMetadata Includes Object size, User Metadata, Bucket Name, Account IdCDMI supports a large number of user metadata fields (up to 4096)Policy supportMetadata can be used in polices (CDMI & S3)No pre-configuration required for metadata, immediately available for policy use
2014 NetApp, Inc. All rights reserved. NetApp Proprietary Limited Use Only21
Data Management Key Concepts
2014 NetApp, Inc. All rights reserved. NetApp Proprietary Limited Use Only22
Client Connections
StoragePools
Metadata
Object Identifier
ILMPolicy
2014 NetApp, Inc. All rights reserved. NetApp Proprietary Limited Use Only23
Sites(Link Cost Groups)Storage PoolsStorageGRID Data Management
StoragePoolsStorage GradeTAPEStorage GradeSASStorage GradeFLASHMunich, GermanySan Francisco, USAStorage Node T1Storage Node T2Tape Storage PoolEU Storage PoolStorage Node S1Storage Node F1Storage Node F2Fast Storage PoolVancouver, CAStorage Node T3Storage Node F3Simple configuration of SLO based storage across multiple sites and storage grades
2014 NetApp, Inc. All rights reserved. NetApp Proprietary Limited Use Only24Customer Defined Storage gradesStorageGRID Data Management
Associating Storage Nodes with Storage GradesCreating Storage Grades
Customers can configure their own Storage Grades, and associate them with specific nodes
2014 NetApp, Inc. All rights reserved. NetApp Proprietary Limited Use Only25
Site configuration Not just a siteStorageGRID Data ManagementCustomers can model network costs, creating powerful configurationsPossible ExamplesConfigure two sites to be treated as one, Specify which sites are connected by high b/w pipes & low b/w pipes, Prefer traffic to go in a particular direction
Data Management Key Concepts
2014 NetApp, Inc. All rights reserved. NetApp Proprietary Limited Use Only26
Client Connections
StoragePools
Metadata
Object Identifier
ILMPolicy
Dynamic Policy Engine An OverviewData management key concepts 2014 NetApp, Inc. All rights reserved. NetApp Proprietary Limited Use Only27Manage policies not objectsEvaluate objects based on metadata such as:Custom user \ application metadataMethod of ingest (S3 or CDMI)Size of objectLast access time
Apply ILM rules to set:Geography Placement of an objectStorage grade Type of storage used to store an objectReplication Number of copies storedRetention Set time during which an object cannot be purged
App1Site1Site2Site3
2014 NetApp, Inc. All rights reserved. NetApp Proprietary Limited Use Only28Policy ManagementData Management Key Concepts
ILMPolicy
WhenIf..then..At IngestObjects at rest (already ingested)After a read (enabling caching)Metadata matches specific criteriaMove/Copy to one or more Storage Pools
ILM Rule Specify the ConditionsData Management Key Concepts
2014 NetApp, Inc. All rights reserved. NetApp Proprietary Limited Use Only29
WhenIf..Customers designate when a rule will be applied and what conditions must be met to trigger the action for placement and retention.
ILM Rule Defining data placement and retention 2014 NetApp, Inc. All rights reserved. NetApp Proprietary Limited Use Only30
then..Simple configuration of complex placement rules, that are graphically displayed for easy understandingData Management Key Concepts
Combine ILM rules 2014 NetApp, Inc. All rights reserved. NetApp Proprietary Limited Use Only31Data Management Key Concepts
Highly complex business logic can be implemented by combining multiple rules into a policy
ILM Policy Example
2014 NetApp, Inc. All rights reserved. NetApp Proprietary Limited Use Only32Application writes to grid via S3Metadata is evaluated
DC1
DC2
DC3
Store objects with S3 metadata Bucket Name = ClientX on ingest at DC1 on SSD and DC2 on SATA for 90 daysAfter 90 days store on DC1 on SATA and DC3 on Tape1 x copy DC1\SSD1 x copyDC2\SATA90 Days later.
1 x copy DC1\SATA1 x copyDC3\Tape
BucketName =ClientX
ILM Policy Example
2014 NetApp, Inc. All rights reserved. NetApp Proprietary Limited Use Only33
DC1
DC2
DC3
What if requirements change?A new Data Center is brought on line and now we must store a copy at DC4We can edit the policy and apply not just to new data, but re-evaluate existing data and create new replicas as neededWithout impact to the performance of the grid
DC4
Create ILM Rules to set customized service levelsGOLD = 2 x copies on SSD, 1 x copy on SATASILVER = 1 x copy on SAS, 2 x SATABRONZE = 1 x copy on SATA, 1 x copy on tapeSet custom service levels via metadataSLA Example
2014 NetApp, Inc. All rights reserved. NetApp Proprietary Limited Use Only34{"sla":"gold"}
DC1
DC2
DC3
Built-in Object Integrity Verification and Self HealingDurability at the object levelDigital fingerprint is calculated per object upon ingestInterlocking layers of object-wide and sub-object level integrity protectionObject hash value Content hash valueCRC checksumHMAC message authentication digest
2014 NetApp, Inc. All rights reserved. NetApp Proprietary Limited Use Only35
ID: 41E85A1DDataMetadataDataMetadataFingerprint
Built-in Object Integrity Verification and Self HealingDurability at the object levelContinuous verification: on ingest, retrieval, replication, migration and at rest Object failing integrity test is automatically replaced with another copy
2014 NetApp, Inc. All rights reserved. NetApp Proprietary Limited Use Only36
Real time audit feed can be used to create custom reportsAudit logs created in open format allows the use of partner products such as SplunkComprehensive audit feed for:Chargeback and billingSearch integrationCustom reportingSecurity diagnosticsCompliance eventsValidate performance for SLAsContinuous and active monitoring Audit and Reporting
2014 NetApp, Inc. All rights reserved. NetApp Proprietary Limited Use Only37
System Performance AnalysisPerform deep performance analysis to fine tune your infrastructure
2014 NetApp, Inc. All rights reserved. NetApp Proprietary Limited Use Only38
Analyze system performance and activity at every levelGrid Wide
System Performance AnalysisPerform deep performance analysis to fine tune your infrastructure
2014 NetApp, Inc. All rights reserved. NetApp Proprietary Limited Use Only39
Analyze system performance and activity at every levelGrid WideSite Specific
System Performance AnalysisPerform deep performance analysis to fine tune your infrastructure
2014 NetApp, Inc. All rights reserved. NetApp Proprietary Limited Use Only40
Analyze system performance and activity at every levelGrid WideSite SpecificService Level
Data FlowUnderstanding how data flows through a StorageGRID Webscale system for different operations 2014 NetApp, Inc. All rights reserved. NetApp Proprietary Limited Use Only41
Data flow overviewBasic topology and business rules 2014 NetApp, Inc. All rights reserved. NetApp Proprietary Limited Use Only
SalesCapacityStorage PoolFinancePerformanceStorage PoolMarketingPerformanceStorage Pool
FinanceCapacityStorage PoolSalesPerformanceStorage PoolMarketingCapacityStorage Pool
Local Storage Pool
WAN
Satellite Office42Three departmentsSalesFinanceMarketingTwo pools per departmentCapacity poolPerformance poolThree sitesTwo datacenter sitesOne remote officePolicy - FinanceIngest into local poolAfter 1 day create copy in performance poolAfter 30 days move copy to capacity poolData Center 1Data Center 2
SalesCapacityStorage PoolFinancePerformanceStorage PoolMarketingPerformanceStorage Pool
FinanceCapacityStorage PoolSalesPerformanceStorage PoolMarketingCapacityStorage Pool
Local Storage Pool
WAN
Satellite OfficeData Center 1Data Center 2Object Ingest and ReplicationTransmitting objects from client to StorageGRID Webscale
2014 NetApp, Inc. All rights reserved. NetApp Proprietary Limited Use Only
Receives object write request from client along with custom metadataReturns object ID to client and forms the file payload into an objectPacketizationDigital fingerprintCompression (optional)Encryption (optional)
430x05DFF4338ADCE6F5
Object Ingest and ReplicationTransmitting objects from client to StorageGRID Webscale 2014 NetApp, Inc. All rights reserved. NetApp Proprietary Limited Use Only
SalesCapacityStorage PoolFinancePerformanceStorage PoolMarketingPerformanceStorage Pool
FinanceCapacityStorage PoolSalesPerformanceStorage PoolMarketingCapacityStorage Pool
LocalStorage Pool
WAN
Satellite Office
44By default, creates semi-sync local copy for immediate redundancyMetadata is stored and replicatedReplicate object as per ILM policyFinance Policy ExampleIngest into local poolAfter 1 day create copy in performance poolAfter 30 days move copy to capacity pool
Data Center 1Data Center 2
SalesCapacityStorage PoolFinancePerformanceStorage PoolMarketingPerformanceStorage Pool
FinanceCapacityStorage PoolSalesPerformanceStorage PoolMarketingCapacityStorage Pool
Local Storage Pool
WAN
Satellite OfficeData Center 1Data Center 2Object ReplicationMetadata-driven ILM with optimal resource utilization
2014 NetApp, Inc. All rights reserved. NetApp Proprietary Limited Use OnlyOptimal resources in target storage pools are selected for the replication destinationWhile honoring ILM the grid considersNetwork costsServer utilizationStorage utilization
45
SalesCapacityStorage PoolFinancePerformanceStorage PoolMarketingPerformanceStorage Pool
FinanceCapacityStorage PoolSalesPerformanceStorage PoolMarketingCapacityStorage Pool
LocalStorage Pool
WAN
Satellite OfficeData Center 1Data Center 2Object RetrievalRequesting objects by the client from StorageGRID Webscale 2014 NetApp, Inc. All rights reserved. NetApp Proprietary Limited Use Only
Receives object read request from client
Determines optimal object location relative to request location
Streams a copy of the object to the client and verifies integrity of the object on-the-fly
46
Thank You 2014 NetApp, Inc. All rights reserved. NetApp Proprietary Limited Use Only47
Management ServicesAdmin NodeNMS Network Management System: Provides the administrative interface for configuration and monitoring of the grid. CMN Configuration Management Node: Manages system-wide configurations such as connection profiles, grid tasks, and system configuration options.AMS - Audit Management System: Keeps logs of system activity and events.
2014 NetApp, Inc. All rights reserved. NetApp Proprietary Limited Use Only48
Object, Metadata Storage and ReplicationStorage NodesLDR - Local Distribution Router: Stores, moves, verifies, and retrieves object data stored on disksDDS - Distributed Data Store: Stores, replicates, and protects metadata in the key value store. 2014 NetApp, Inc. All rights reserved. NetApp Proprietary Limited Use Only49
Object, Metadata Storage and ReplicationStorage Nodes - ContinuedCMS - Content Management System: Manages object placement and replication based on ILM rules.ADC - Administrative Domain Controller: Maintains topology information and provides authentication services. 2014 NetApp, Inc. All rights reserved. NetApp Proprietary Limited Use Only50
Load Balancing and Client ConnectivityAPI Gateway NodesCLB - Connection Load Balancer: Acts as switchboard for connecting clients to the most efficient LDR service for ingest and retrieval.Ports8081 CDMI8082 S3
2014 NetApp, Inc. All rights reserved. NetApp Proprietary Limited Use Only51
Data storage on archive mediaArchive NodesARC Archive: Communicates with archiving middleware to store and retrieve data to and from archive media such as tape. 2014 NetApp, Inc. All rights reserved. NetApp Proprietary Limited Use Only52