Home > Technology > Se training storage grid webscale technical overview

Se training storage grid webscale technical overview

Date post: 24-Jan-2017
Category:
Author: solarisyougood
View: 320 times
Download: 1 times
Share this document with a friend
Embed Size (px)
of 52 /52
StorageGRID Webscale Technical Overview September 2014
Transcript

StorageGRID Webscale SE

StorageGRID Webscale Technical OverviewSeptember 2014

Student Guide & Workshop & Internal Training & Confidential Update Dailyhttps://goo.gl/VVmVZ05673-StorageGRID for Standard Enterprise Solutionhttp://ouo.io/SROnEVBuilding Private Clouds on E-Series The Swift Object Opportunityhttp://ouo.io/vWryaDeployment Guide - Acuo Universal Clinical Platform with NetApp StorageGRID Integration Architecturehttp://ouo.io/TowHxUDeployment Guide - StorageGRID E-Serieshttp://ouo.io/jhf1MGetting Started with StorageGRID CDMI Integrationshttp://ouo.io/dAORQjNetApp and the Object-Based Storage and Archiving Landscapehttp://ouo.io/wvoEgQNetApp Distributed Content Repositories What Are We Doing in Real Lifehttp://ouo.io/3y65MNetApp Portfolio Overviewhttp://ouo.io/O9aKQPOC - StorageGRID Webscale Proof of Concept Guidehttp://ouo.io/pzljIHQRG_StorageGRID Webscalehttp://ouo.io/3dqRrSE Training - StorageGRID Webscale Technical Overviewhttp://ouo.io/BqtKUUSolution Brief - NetApp StorageGRID Manage Large Pools of Mission-Critical Patient Data Across Healthcare Facilitieshttp://ouo.io/KTDFzStorageGRID Webscale 10.0 Installation Overviewhttp://ouo.io/hIYhJStorageGRID Webscale Nonstop Object Storage for Enterprise and Cloudhttp://ouo.io/dMRoOrStorageGRID Webscale Object-Enabled Data Managementhttp://ouo.io/nhuj4fTechnical Report - Integration Guide for NetApp StorageGRID with McKesson Horizon Medical Imaging PACShttp://ouo.io/9ncafj

2014 NetApp, Inc. All rights reserved. NetApp Proprietary Limited Use Only2

AgendaWhat is Object StorageStorageGRID Webscale IntroductionTechnical OverviewKey ConceptsData flow

2014 NetApp, Inc. All rights reserved. NetApp Proprietary Limited Use Only3

2014 NetApp, Inc. All rights reserved. NetApp Proprietary Limited Use Only4Object storage growth driven by macro trendsUnstructured data continues to growNew sources of unstructured data growth media, mobile and applicationsData continues to be retained for long periods (archival, compliance etc)Unstructured data profile is changingSimultaneous access to the same data is rarely requiredMost data accessed a few times initially and then rarely accessedData access is changingGeographically dispersed accessApplications accessing data dont care about POSIX semantics, file locking Highly cost sensitive petabyte scale repositoriesDriving tradeoffs between $/GB, latency, throughput, and data protection

Storage being managed in a cloud ecosystemUnified management & orchestrationGrowth in cloud hosted applications that leverage object storage

Introducing StorageGRID WebscaleA new variant of StorageGRIDTarget object store (cloud, archive, media) use cases at massive scale

New FeaturesProtocols native support for S3 APIScalability 100 billion objects, 70 PBSimplicity modular scalable resilient architecture, simplified deploymentProven track record for reliability and innovation

2014 NetApp, Inc. All rights reserved. NetApp Proprietary Limited Use Only5

2014 NetApp, Inc. All rights reserved. NetApp Proprietary Limited Use Only6BlockFileObjectWhat is Object Storage?Different ways to address dataSpecific location on disks / memory TracksSectorsSpecific folder in fixed logical orderFile pathFile nameDateFlexible container sizeData and MetadataUnique ID

Object Storage Example: File vs. Object

2014 NetApp, Inc. All rights reserved. NetApp Proprietary Limited Use Only7Object Based File Based ValetParking Garage

Daily Garage 1Floor 4Row NSpace 53/users/jsmith/car/garage1/floor4/rown/space53.fileC:\Users\jsmith\Garage1\Floor4\RowN\Space53.fileObject UID 317

Why Object storage?

2014 NetApp, Inc. All rights reserved. NetApp Proprietary Limited Use Only8Massive ScaleBillions of objectsPetabytes of dataGlobal namespaceExplosive growthRespond to compliance and retention requirementsCost effectively meet SLAs with intelligent data placement

Access from anywhere Control access, security, and data integrity

Object storage used across different workloadsEmerging Object storage segments in the hybrid cloud

2014 NetApp, Inc. All rights reserved. NetApp Proprietary Limited Use Only9Service Providers (XaaS)Media Repository (Media Redistribution)

Large object sizes (+250MB)Distributed repositoriesHigh data rates for redistributionTime to first byte latency < 50msSecure multi-tenancy (billing, isolation, authentication, & self-service)Control plane APIs and workflow automationNew apps requiring RESTful interfaces (S3, Swift)Web Data Repositories

Small object (~KB) performanceExtremely high transaction load Searchable, scalable metadataHigh object countsData Archives

Long access latency toleranceIntegration to tape / GlacierLong retention periodsErasure coding

Scale seamlesslyEase of install & managementGlobal namespaceCost

StorageGRID WebscaleObject-Enabled Data Management 2014 NetApp, Inc. All rights reserved. NetApp Proprietary Limited Use Only10

NetApp StorageGRID WebscaleSoftware Defined Object StorageBillions of objects and multi-petabyteArchitected for massive scaleBuilt for the Hybrid CloudGlobal, Always-On data availability and durabilitySupport for cloud applications (S3, CDMI)10th Generation object store proven with product deployments

Dynamic Policy EngineIntelligent, policy driven data management for optimal availability, performance and cost over the life cycle of data

2014 NetApp, Inc. All rights reserved. NetApp Proprietary Limited Use Only11

Introducing NetApps Content Repository solution for large, unstructured distributed data sets StorageGRID

NetApp StorageGRID was designed from the ground up to solve Big Content challenges. StorageGRID offers a large, global content repository for unstructured data, capable of storing billions of data sets and managing dozens of petabytes of capacity.

StorageGRID enables customers to create, manage and consume content globally, while providing complete location independence. With location independence you get all the benefits of local storage (meaning that you get the performance that you expect and access to all content you require) while providing a truly global namespace across hundreds of sites. Every site can be set up differently, from remote sites that simple have a virtual machine or smaller system that only stores small amounts of data locally, to large datacenters that retain a full set of all data sets. Remember though this is all managed automatically with policies, metadata information and intelligent caching, no administrator intervention required.

StorageGRID uses metadata-based management for data classification and access, meaning that StorageGRID manages where data is physically stored, how many copies exist (and where) for disaster recovery purposes, how long those copies are retained and when they are destroyed. Further, metadata-based access to your data means that instead of looking for a file name, you simple look for Mortgage documents, customer John Doe, account number 123456 greatly simplifying how your applications interact with your storage.

11

2014 NetApp, Inc. All rights reserved. NetApp Proprietary Limited Use Only12StorageGRID

Platform for Distributed Content RepositoriesStorageGRID Object Storage Software + E-Series Storage Array

MULTIPLE: APPLICATIONS + SITES + PROTOCOLSMULTIPLE: TARGETS + TIERSMULTIPLE: TENANTS + POLICIES + ADMINISTRATORSSite 1Site 2 Site NSite 3APPLICATIONSAPPLICATIONSAPPLICATIONSAPPLICATIONNetApp E-SeriesTape

StorageGRID Webscale features 2014 NetApp, Inc. All rights reserved. NetApp Proprietary Limited Use Only13Hardware Obsolescence Protection

Non-DisruptiveOperationsObject Integrity and SecurityMulti-TenancyGlobal ObjectNamespace

Services AutomationManageabilityReliabilityScalabilityILM, Metadata driven policiesSeamlessScaling

Audit & Reporting

Technical OverviewStorageGRID Webscale 2014 NetApp, Inc. All rights reserved. NetApp Proprietary Limited Use Only14

Solution Topology 2014 NetApp, Inc. All rights reserved. NetApp Proprietary Limited Use Only15

Clients can access global object namespace via Gateway Nodes or Storage Nodes

E2760

DE6600AdministratorsREST API clients10GbE LANPer Node Resource RequirementVMDK(GB)vCPURAM (GB)100824100824300824

Storage

Gateway

Admin

DE6600

E2760

DE6600

DE6600

StorageGRID Webscale nodes running in VMware hostsS3, CDMIHTTPS16Gb FC switch

WAN RouterE2760 block-based storageSSDs for read cache and VM datastoresMixed disk types (SSD, SAS, NL-SAS) for tiered storage pools

The CPU and RAM requirements are for production deployments. Lab \ Test configurations can be smaller:Admin 2CPU, 4GB RAMGateway 2CPU 4GB RAMStorage 2CPU, 6GB RAM

Storage recommendations:Thebest practice and recommendedstorage for StorageGRID Webscale is as follows:NetApp E-Series Storage System with capacity optimized NL-SAS in DDPSSDs for Storage Node system disks (100GB per node)SSDs for all Admin Node disks (about 300GB per node)

15

StorageGRID Architecture 2014 NetApp, Inc. All rights reserved. NetApp Proprietary Limited Use Only16App1App2

Admin NodesManagement Services: Configuration, Monitoring, Audit and LoggingStorage NodesManages object storage including replicationAPI Gateway NodesLoad balancing interface through which applications connect to the systemArchive NodesInterface to archive media storage such as tapeDATA CENTER 1

DC2

DC3Design the grid to scale for performance, capacity & resiliency

STORAGE

STORAGE

ADMINARCHIVE

API LOADBALANCER

Use NetApp StorageGRID Webscale Designer to architect the grid to your requirementsQuickly deploy the grid via NetApp StorageGRID Webscale InstallerAdapt your grid to changing requirementsAdd sitesAdd nodes and capacitySupport rolling upgradesCentralized Deployment & Rolling UpgradesDesign, deploy, and maintain configuration control 2014 NetApp, Inc. All rights reserved. NetApp Proprietary Limited Use Only17

DC1

DC2

DC4

DC3

Data Management Key Concepts

2014 NetApp, Inc. All rights reserved. NetApp Proprietary Limited Use Only18

Client Connections

StoragePools

Metadata

Object Identifier

ILMPolicy

Client Connections, Object Identifiers & Metadata

2014 NetApp, Inc. All rights reserved. NetApp Proprietary Limited Use Only19

ClientCDMIS3Read/Write

StorageGRID Webscale System HTTPs

Client Connections

Object Identifier

Object type: JPGDate modified: 07/21/ 2014GPS Coordinates: Lat, LongLocation: DC @ Seattle

Metadata

Scale beyond traditional application metadata to enable new capabilitiesFlexibility: Metadata is application definedUp to 4096 fields can be created as requirements change (CDMI)No lock-in to predefined schemaILM engine evaluates the metadata and applies policies S3 metadata available to the policy engine

Extensive metadata management

Metadata Why it matters ?Metadata is distributed throughout the gridIncreased scalability and resiliencyFaster retrieval and efficient ILM policy evaluation 2014 NetApp, Inc. All rights reserved. NetApp Proprietary Limited Use Only20Object Identifier:00006FFD00192A1200555FFEE12039468EBF622D9402C4F962

Locations:Location 1: Data Center 1/DC1-S1/LDRLocation 2: Data Center 2/DC2-S3/LDR

Metadata CDMI/CVTE: 0CDMI/META: {"application":"finance","doctype":"contract", "project":"45667}

Extensive metadata managementMetadata ManagementMetadata Includes Object size, User Metadata, Bucket Name, Account IdCDMI supports a large number of user metadata fields (up to 4096)Policy supportMetadata can be used in polices (CDMI & S3)No pre-configuration required for metadata, immediately available for policy use

2014 NetApp, Inc. All rights reserved. NetApp Proprietary Limited Use Only21

Data Management Key Concepts

2014 NetApp, Inc. All rights reserved. NetApp Proprietary Limited Use Only22

Client Connections

StoragePools

Metadata

Object Identifier

ILMPolicy

2014 NetApp, Inc. All rights reserved. NetApp Proprietary Limited Use Only23

Sites(Link Cost Groups)Storage PoolsStorageGRID Data Management

StoragePoolsStorage GradeTAPEStorage GradeSASStorage GradeFLASHMunich, GermanySan Francisco, USAStorage Node T1Storage Node T2Tape Storage PoolEU Storage PoolStorage Node S1Storage Node F1Storage Node F2Fast Storage PoolVancouver, CAStorage Node T3Storage Node F3Simple configuration of SLO based storage across multiple sites and storage grades

2014 NetApp, Inc. All rights reserved. NetApp Proprietary Limited Use Only24Customer Defined Storage gradesStorageGRID Data Management

Associating Storage Nodes with Storage GradesCreating Storage Grades

Customers can configure their own Storage Grades, and associate them with specific nodes

2014 NetApp, Inc. All rights reserved. NetApp Proprietary Limited Use Only25

Site configuration Not just a siteStorageGRID Data ManagementCustomers can model network costs, creating powerful configurationsPossible ExamplesConfigure two sites to be treated as one, Specify which sites are connected by high b/w pipes & low b/w pipes, Prefer traffic to go in a particular direction

Data Management Key Concepts

2014 NetApp, Inc. All rights reserved. NetApp Proprietary Limited Use Only26

Client Connections

StoragePools

Metadata

Object Identifier

ILMPolicy

Dynamic Policy Engine An OverviewData management key concepts 2014 NetApp, Inc. All rights reserved. NetApp Proprietary Limited Use Only27Manage policies not objectsEvaluate objects based on metadata such as:Custom user \ application metadataMethod of ingest (S3 or CDMI)Size of objectLast access time

Apply ILM rules to set:Geography Placement of an objectStorage grade Type of storage used to store an objectReplication Number of copies storedRetention Set time during which an object cannot be purged

App1Site1Site2Site3

2014 NetApp, Inc. All rights reserved. NetApp Proprietary Limited Use Only28Policy ManagementData Management Key Concepts

ILMPolicy

WhenIf..then..At IngestObjects at rest (already ingested)After a read (enabling caching)Metadata matches specific criteriaMove/Copy to one or more Storage Pools

ILM Rule Specify the ConditionsData Management Key Concepts

2014 NetApp, Inc. All rights reserved. NetApp Proprietary Limited Use Only29

WhenIf..Customers designate when a rule will be applied and what conditions must be met to trigger the action for placement and retention.

ILM Rule Defining data placement and retention 2014 NetApp, Inc. All rights reserved. NetApp Proprietary Limited Use Only30

then..Simple configuration of complex placement rules, that are graphically displayed for easy understandingData Management Key Concepts

Combine ILM rules 2014 NetApp, Inc. All rights reserved. NetApp Proprietary Limited Use Only31Data Management Key Concepts

Highly complex business logic can be implemented by combining multiple rules into a policy

ILM Policy Example

2014 NetApp, Inc. All rights reserved. NetApp Proprietary Limited Use Only32Application writes to grid via S3Metadata is evaluated

DC1

DC2

DC3

Store objects with S3 metadata Bucket Name = ClientX on ingest at DC1 on SSD and DC2 on SATA for 90 daysAfter 90 days store on DC1 on SATA and DC3 on Tape1 x copy DC1\SSD1 x copyDC2\SATA90 Days later.

1 x copy DC1\SATA1 x copyDC3\Tape

BucketName =ClientX

ILM Policy Example

2014 NetApp, Inc. All rights reserved. NetApp Proprietary Limited Use Only33

DC1

DC2

DC3

What if requirements change?A new Data Center is brought on line and now we must store a copy at DC4We can edit the policy and apply not just to new data, but re-evaluate existing data and create new replicas as neededWithout impact to the performance of the grid

DC4

Create ILM Rules to set customized service levelsGOLD = 2 x copies on SSD, 1 x copy on SATASILVER = 1 x copy on SAS, 2 x SATABRONZE = 1 x copy on SATA, 1 x copy on tapeSet custom service levels via metadataSLA Example

2014 NetApp, Inc. All rights reserved. NetApp Proprietary Limited Use Only34{"sla":"gold"}

DC1

DC2

DC3

Built-in Object Integrity Verification and Self HealingDurability at the object levelDigital fingerprint is calculated per object upon ingestInterlocking layers of object-wide and sub-object level integrity protectionObject hash value Content hash valueCRC checksumHMAC message authentication digest

2014 NetApp, Inc. All rights reserved. NetApp Proprietary Limited Use Only35

ID: 41E85A1DDataMetadataDataMetadataFingerprint

Built-in Object Integrity Verification and Self HealingDurability at the object levelContinuous verification: on ingest, retrieval, replication, migration and at rest Object failing integrity test is automatically replaced with another copy

2014 NetApp, Inc. All rights reserved. NetApp Proprietary Limited Use Only36

Real time audit feed can be used to create custom reportsAudit logs created in open format allows the use of partner products such as SplunkComprehensive audit feed for:Chargeback and billingSearch integrationCustom reportingSecurity diagnosticsCompliance eventsValidate performance for SLAsContinuous and active monitoring Audit and Reporting

2014 NetApp, Inc. All rights reserved. NetApp Proprietary Limited Use Only37

System Performance AnalysisPerform deep performance analysis to fine tune your infrastructure

2014 NetApp, Inc. All rights reserved. NetApp Proprietary Limited Use Only38

Analyze system performance and activity at every levelGrid Wide

System Performance AnalysisPerform deep performance analysis to fine tune your infrastructure

2014 NetApp, Inc. All rights reserved. NetApp Proprietary Limited Use Only39

Analyze system performance and activity at every levelGrid WideSite Specific

System Performance AnalysisPerform deep performance analysis to fine tune your infrastructure

2014 NetApp, Inc. All rights reserved. NetApp Proprietary Limited Use Only40

Analyze system performance and activity at every levelGrid WideSite SpecificService Level

Data FlowUnderstanding how data flows through a StorageGRID Webscale system for different operations 2014 NetApp, Inc. All rights reserved. NetApp Proprietary Limited Use Only41

Data flow overviewBasic topology and business rules 2014 NetApp, Inc. All rights reserved. NetApp Proprietary Limited Use Only

SalesCapacityStorage PoolFinancePerformanceStorage PoolMarketingPerformanceStorage Pool

FinanceCapacityStorage PoolSalesPerformanceStorage PoolMarketingCapacityStorage Pool

Local Storage Pool

WAN

Satellite Office42Three departmentsSalesFinanceMarketingTwo pools per departmentCapacity poolPerformance poolThree sitesTwo datacenter sitesOne remote officePolicy - FinanceIngest into local poolAfter 1 day create copy in performance poolAfter 30 days move copy to capacity poolData Center 1Data Center 2

SalesCapacityStorage PoolFinancePerformanceStorage PoolMarketingPerformanceStorage Pool

FinanceCapacityStorage PoolSalesPerformanceStorage PoolMarketingCapacityStorage Pool

Local Storage Pool

WAN

Satellite OfficeData Center 1Data Center 2Object Ingest and ReplicationTransmitting objects from client to StorageGRID Webscale

2014 NetApp, Inc. All rights reserved. NetApp Proprietary Limited Use Only

Receives object write request from client along with custom metadataReturns object ID to client and forms the file payload into an objectPacketizationDigital fingerprintCompression (optional)Encryption (optional)

430x05DFF4338ADCE6F5

Object Ingest and ReplicationTransmitting objects from client to StorageGRID Webscale 2014 NetApp, Inc. All rights reserved. NetApp Proprietary Limited Use Only

SalesCapacityStorage PoolFinancePerformanceStorage PoolMarketingPerformanceStorage Pool

FinanceCapacityStorage PoolSalesPerformanceStorage PoolMarketingCapacityStorage Pool

LocalStorage Pool

WAN

Satellite Office

44By default, creates semi-sync local copy for immediate redundancyMetadata is stored and replicatedReplicate object as per ILM policyFinance Policy ExampleIngest into local poolAfter 1 day create copy in performance poolAfter 30 days move copy to capacity pool

Data Center 1Data Center 2

SalesCapacityStorage PoolFinancePerformanceStorage PoolMarketingPerformanceStorage Pool

FinanceCapacityStorage PoolSalesPerformanceStorage PoolMarketingCapacityStorage Pool

Local Storage Pool

WAN

Satellite OfficeData Center 1Data Center 2Object ReplicationMetadata-driven ILM with optimal resource utilization

2014 NetApp, Inc. All rights reserved. NetApp Proprietary Limited Use OnlyOptimal resources in target storage pools are selected for the replication destinationWhile honoring ILM the grid considersNetwork costsServer utilizationStorage utilization

45

SalesCapacityStorage PoolFinancePerformanceStorage PoolMarketingPerformanceStorage Pool

FinanceCapacityStorage PoolSalesPerformanceStorage PoolMarketingCapacityStorage Pool

LocalStorage Pool

WAN

Satellite OfficeData Center 1Data Center 2Object RetrievalRequesting objects by the client from StorageGRID Webscale 2014 NetApp, Inc. All rights reserved. NetApp Proprietary Limited Use Only

Receives object read request from client

Determines optimal object location relative to request location

Streams a copy of the object to the client and verifies integrity of the object on-the-fly

46

Thank You 2014 NetApp, Inc. All rights reserved. NetApp Proprietary Limited Use Only47

Management ServicesAdmin NodeNMS Network Management System: Provides the administrative interface for configuration and monitoring of the grid. CMN Configuration Management Node: Manages system-wide configurations such as connection profiles, grid tasks, and system configuration options.AMS - Audit Management System: Keeps logs of system activity and events.

2014 NetApp, Inc. All rights reserved. NetApp Proprietary Limited Use Only48

Object, Metadata Storage and ReplicationStorage NodesLDR - Local Distribution Router: Stores, moves, verifies, and retrieves object data stored on disksDDS - Distributed Data Store: Stores, replicates, and protects metadata in the key value store. 2014 NetApp, Inc. All rights reserved. NetApp Proprietary Limited Use Only49

Object, Metadata Storage and ReplicationStorage Nodes - ContinuedCMS - Content Management System: Manages object placement and replication based on ILM rules.ADC - Administrative Domain Controller: Maintains topology information and provides authentication services. 2014 NetApp, Inc. All rights reserved. NetApp Proprietary Limited Use Only50

Load Balancing and Client ConnectivityAPI Gateway NodesCLB - Connection Load Balancer: Acts as switchboard for connecting clients to the most efficient LDR service for ingest and retrieval.Ports8081 CDMI8082 S3

2014 NetApp, Inc. All rights reserved. NetApp Proprietary Limited Use Only51

Data storage on archive mediaArchive NodesARC Archive: Communicates with archiving middleware to store and retrieve data to and from archive media such as tape. 2014 NetApp, Inc. All rights reserved. NetApp Proprietary Limited Use Only52


Recommended