+ All Categories
Home > Business > Big Data on AWS - Toronto FSI Symposium - October 2016

Big Data on AWS - Toronto FSI Symposium - October 2016

Date post: 13-Jan-2017
Category:
Upload: amazon-web-services
View: 553 times
Download: 2 times
Share this document with a friend
40
Shawn Gandhi Head of Solutions Architecture AWS Canada @shawnagram Big Data on AWS
Transcript
Page 1: Big Data on AWS - Toronto FSI Symposium - October 2016

Shawn GandhiHead of Solutions Architecture

AWS Canada@shawnagram

Big Data on AWS

Page 2: Big Data on AWS - Toronto FSI Symposium - October 2016

Generated data

Available for analysis

Data volume

Gartner: User Survey Analysis: Key Trends Shaping the Future of Data Center Infrastructure Through 2011

IDC: Worldwide Business Analytics Software 2012–2016 Forecast and 2011 Vendor Shares

Page 3: Big Data on AWS - Toronto FSI Symposium - October 2016

Abraham Wald (1902-1950)

Page 4: Big Data on AWS - Toronto FSI Symposium - October 2016
Page 5: Big Data on AWS - Toronto FSI Symposium - October 2016

Data is part of the fabric of the applications

Front-end and UX Mobile Back-end and operations

Data and analytics

Page 6: Big Data on AWS - Toronto FSI Symposium - October 2016
Page 7: Big Data on AWS - Toronto FSI Symposium - October 2016

What is AWS?

AWS Global Infrastructure

Application Services

Networking

Deployment & Administration

DatabaseStorageCompute

Page 8: Big Data on AWS - Toronto FSI Symposium - October 2016

ENTERPRISE APPS

DEVELOPMENT & OPERATIONSMOBILE SERVICESAPP SERVICESANALYTICS

DataWarehousing

Hadoop/Spark

Streaming Data Collection

Machine Learning

Elastic Search

Virtual Desktops

Sharing & Collaboration

Corporate Email

Backup

Queuing & Notifications

Workflow

Search

Email

Transcoding

One-click App Deployment

Identity

Sync

Single Integrated Console

PushNotifications

DevOps Resource Management

Application Lifecycle Management

Containers

Triggers

Resource Templates

TECHNICAL & BUSINESS SUPPORT

Account Management

Support

Professional Services

Training & Certification

Security & Pricing Reports

Partner Ecosystem

Solutions Architects

MARKETPLACE

Business Apps

Business Intelligence

DatabasesDevOps Tools

NetworkingSecurity Storage

RegionsAvailability Zones

Points of Presence

INFRASTRUCTURE

CORE SERVICES

ComputeVMs, Auto-scaling, & Load Balancing

StorageObject, Blocks, Archival, Import/Export

DatabasesRelational, NoSQL, Caching, Migration

NetworkingVPC, DX, DNS

CDN

Access Control

Identity Management

Key Management & Storage

Monitoring & Logs

Assessment and reporting

Resource & Usage Auditing

SECURITY & COMPLIANCE

Configuration Compliance

Web application firewall

HYBRIDARCHITECTURE

Data Backups

Integrated App Deployments

DirectConnect

IdentityFederation

IntegratedResource Management

Integrated Networking

API Gateway

IoT

Rules Engine

Device Shadows

Device SDKs

Registry

Device Gateway

Streaming Data Analysis

Business Intelligence

MobileAnalytics

Page 9: Big Data on AWS - Toronto FSI Symposium - October 2016

Three types of data-driven development

Retrospectiveanalysis and

reporting

Amazon RedshiftAmazon RDS Amazon S3

Amazon EMR

Page 10: Big Data on AWS - Toronto FSI Symposium - October 2016

Three types of data-driven development

Retrospectiveanalysis and

reporting

Here-and-nowreal-time processing

and dashboards

Amazon Kinesis Amazon EC2 AWS Lambda

Amazon Redshift, Amazon RDS Amazon S3

Amazon EMR

Page 11: Big Data on AWS - Toronto FSI Symposium - October 2016

Three types of data-driven development

Retrospectiveanalysis and

reporting

Here-and-nowreal-time processing

and dashboards

Predictionsto enable smart

applications

Amazon Kinesis Amazon EC2 AWS Lambda

Amazon Redshift, Amazon RDS Amazon S3

Amazon EMR

Page 12: Big Data on AWS - Toronto FSI Symposium - October 2016
Page 13: Big Data on AWS - Toronto FSI Symposium - October 2016

Global Footprint

Page 14: Big Data on AWS - Toronto FSI Symposium - October 2016

AZ

AZ

AZ AZ AZ

What is a Region?• Each datacenter has a purpose built

network

Page 15: Big Data on AWS - Toronto FSI Symposium - October 2016

AZ

AZ

AZ AZ AZ

What is a Region?

• Metro-area DWDM links between AZs

• AZs <2ms apart & usually <1ms

• Each datacenter has a purpose built

network

Page 16: Big Data on AWS - Toronto FSI Symposium - October 2016

Big Data Pipeline

Data AnswersCollect Process Analyze

Store

Page 17: Big Data on AWS - Toronto FSI Symposium - October 2016

Primitive Patterns

Collect Process Analyze

Store

Data Collectionand Storage

Data

Processing

EventProcessing

Data Analysis

Page 18: Big Data on AWS - Toronto FSI Symposium - October 2016

One tool to

rule them all

Page 19: Big Data on AWS - Toronto FSI Symposium - October 2016

Collect Process Analyze

Store

Data Collectionand Storage

Data

Processing

Data Analysis

EventProcessing

Primitive Patterns

S3

Kinesis

DynamoDB

RDS (Aurora)

AWS Lambda

KCL AppsEMR Redshift

MachineLearning

Page 20: Big Data on AWS - Toronto FSI Symposium - October 2016

Collect Process Analyze

Store

Data Collectionand Storage

Primitive Patterns

S3

Kinesis

DynamoDB

RDS (Aurora)

Page 21: Big Data on AWS - Toronto FSI Symposium - October 2016

Data Collection and Storage

File

Stream

Transactional

Ap

ps

Logg

ing

Fram

ewo

rks

Page 22: Big Data on AWS - Toronto FSI Symposium - October 2016

AWS Services – Data Collection and Storage

Page 23: Big Data on AWS - Toronto FSI Symposium - October 2016
Page 24: Big Data on AWS - Toronto FSI Symposium - October 2016

S3$0.030/GB-Mo

RedshiftStarts at $0.25/hour

EC2Starts at $0.02/hour

Glacier$0.010/GB-Mo

Kinesis$0.015/shard 1MB/s in; 2MB/out$0.028/million puts

Page 25: Big Data on AWS - Toronto FSI Symposium - October 2016

Collect Process Analyze

Store

EventProcessing

Primitive Patterns

AWS Lambda

KCL Apps

Page 26: Big Data on AWS - Toronto FSI Symposium - October 2016

Event Processing – Enabling Capabilities

AWS Lambda

KCL Apps

Page 27: Big Data on AWS - Toronto FSI Symposium - October 2016

Primitive Patterns

Collect Process Analyze

Store

Data Collectionand Storage

Data

Processing

EventProcessing

Data Analysis

EMR Redshift

MachineLearning

Page 28: Big Data on AWS - Toronto FSI Symposium - October 2016
Page 29: Big Data on AWS - Toronto FSI Symposium - October 2016

Big Data in Action

FINRA handles approximately 30 billion market events every day to build a holistic picture of trading in the U.S.

Determisconduct by

enforcing the rules

Detectand prevent wrongdoing

in the U.S. markets

Disciplinethose who

break the rules

Page 30: Big Data on AWS - Toronto FSI Symposium - October 2016

Market volumes are volatile and steadily increasing

Exchanges and markets are evolving dynamically

New securities products are being introduced

New rules and regulations are being created

Market manipulators are innovating

FINRA – The Need for Big Data

Page 31: Big Data on AWS - Toronto FSI Symposium - October 2016

AWS Offered the Right Services For FINRA’s Platform

Cloud Platform

APIs at the right layer

Automated infrastructure deployment

Open source commitment

Operations Security

Page 32: Big Data on AWS - Toronto FSI Symposium - October 2016

FINRA – A Platform That Adapts to Market Dynamics

Data IntegrationHbaseHadoopMapReduce

Flexible Interactive QueriesHadoopEMRSQL/Hive

Fast Predefined QueriesHbase/NoSQLHadoopPredefined Datamarts

Surveillance AnalyticsEMRHive

Web ApplicationsAnalystsRegulators

Data Management ServicesData MovementData RegistrationNotificationVersion ManagementJob ManagementCluster Management

S3

Firms

Page 33: Big Data on AWS - Toronto FSI Symposium - October 2016

From One Instance

Page 34: Big Data on AWS - Toronto FSI Symposium - October 2016

To Thousands

Page 35: Big Data on AWS - Toronto FSI Symposium - October 2016

And Back Again

Page 36: Big Data on AWS - Toronto FSI Symposium - October 2016

“ At FINRA, we chose AWS because we wanted to be able to deliver innovation at a

much larger scale and much more rapidly to our core business.

”- Saman Michael Far, SVP of TechnologyWhat FINRA needed:• Infrastructure for its market surveillance platform• Analysis and storage of approximately 75 billion market records every day • Interactively query multi-petabyte data sets

Why they chose AWS:• Fulfillment of FINRA’s security requirements• Ability to create a flexible platform using dynamic clusters (Hadoop, Hive, and

HBase), Amazon EMR, and Amazon S3

Benefits realized:• Increased agility, speed, and cost savings• Estimated savings of $20m annually by using AWS

FINRA

FINRA is the largest independent regulator for all securities firms doing business in the US. FINRA oversees about 4,250 brokerage

firms, about 162,155 branch offices and approximately 629,525

registered securities representatives.

Page 37: Big Data on AWS - Toronto FSI Symposium - October 2016

“ The speed and performance of AWS are impressive. Data manipulation processes

that took days are now down to one minute.

National Bank of Canada has more than CAD$219 billion in AUM. The bank’s Global Equity Derivatives

Group (GED) is a leader in providing stock-trading solutions that manage exchange-traded securities such as stocks, funds, futures, and options.

- Pascal Bergeron, Director of Algorithmic TradingWhat the National Bank of Canada needed:• Quickly collect a fast-growing volume of stock-market financial data• Scale its data-analysis platform, which was outgrowing the on-prem resources• Process and analyze structured and unstructured data, historic and real time

Why they chose AWS:• The most big data services and solutions, such as Cloudera and TickSmith• Reliability to easily process and analyze hundreds of terabytes of data

Benefits realized:• Ability to easily access historic data, as far back as 10 years ago• Acceleration of post-trade analysis time, from weeks to hours• Improvement and optimization of trading operations, resulting in more revenue

The National Bank of Canada

Page 38: Big Data on AWS - Toronto FSI Symposium - October 2016
Page 39: Big Data on AWS - Toronto FSI Symposium - October 2016

The Benefits of Big Data on AWS

AgilityRespond quickly to market challenges

Speed Cost SavingsReduce query times from

hours to secondsEfficient scale

Pay for what you use

Page 40: Big Data on AWS - Toronto FSI Symposium - October 2016

Thank you

@shawnagram


Recommended