© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Felix Candelario – Global Financial Services Solutions Architect
March 23, 2017
AWS-Enabled Disaster Recovery
and Business Continuity for SIFIs
ELB/Appliance
EC2/Auto Scaling
Route 53
Load Balancers
Web/App Servers
Your Data
Centers
DNS
Amazon RDS
Security Groups / ACL
Availability Zones / VPC
Multi-regionGeographical
Redundancy
Data Centers
Firewall
Database Servers
DR Terminology Map
Backup &
Restore
Pilot Light
Warm
Standby
Hot-Site
Backup of on-premises
data to AWS to use in a
DR event
Replicate data and
minimal running
services into AWS,
ready to take over and
flare up
Replicate data and
services into AWS
ready to take over
Replicated and load
balanced environments
that are both actively
taking production traffic
RPO
aRTO
COST
24 hours 24 hours
$
RPO
aRTO
COST
12 hours 4 hours
$$
RPO
aRTO
COST
1-4 hours 15 min
$$$
RPO
aRTO
COST
<15 min 0-5 min
$$$
Ascending Levels of DR Options
~$200 / Month
In US-EAST
+VPN
On-premises
Active Production
www.example.com
Corporate data center AWS region
AWS DR failover
App
Servers
DB
Server
VPN
Connection
Storage
GatewayiSCSI
Backup
System
S3 / Bucket
Glacier / Archive
Web
Servers Internet traffic
S3 (1TB)
$31/Month
Glacier (2TB)
$22/Month
Storage Gateway
$125/Month
S3 / Bucket
S3 (1TB)
$31/Month1TB Data
Volume
Backup and Restore Architecture
Suitable for
• Solutions that can sustain higher technical debt
• Lower business critical nature
• Low cost DR option
Leverage existing investments in
• De-duplication
• Compression
• WAN Acceleration
Backup and Restore Details
Subordinate
database
server
Pilot light–prepwww.example.com
Data mirroring replication
Not running
Pilot light system
Reverse
proxy/
caching
server
Datavolume
Application
server
Corporate data center
Reverse proxy/ caching server
Application server
MasterDatabase
server
Database
server
Pilot light–recoverywww.example.com
Start in minutes
Add additional
capacity,
if needed
Reverse
proxy/
caching
server
Datavolume
Application
server
Corporate data center
Reverse proxy/ caching server
Application server
MasterDatabase
server
Considerations
Suitable for:
• Solutions that need lower RTO & RPO
• higher business critical nature
• Mid-range cost DR option
Pilot Light Details
Warm standby–prep
Mirroring /replication
Application data source
cut over
Elastic loadbalancer
ActiveNot active for
production traffic
Route 53
www.example.com
Scaled down
standbyCorporate data center
Datavolume
Applicationserver
Subordinatedatabase
server
Reverse proxy/ caching server
AWS region
Reverse proxy/ caching server
Application server
MasterDatabase
server
Warm standby–recover
Elastic loadbalancerActive
Route 53
www.example.com
Scaled-up
production
Corporate data center
Datavolume
Applicationserver
Databaseserver
AWS region
Reverse proxy/ caching server
Application Server
MasterDatabase
server
Reverse proxy/ caching server
Hot site – prep
Mirroring /replication
Application data source
cut over
Elastic loadbalancer
ActiveRoute 53
www.example.com
Corporate data center
Datavolume
Applicationserver
Subordinate database
server
AWS region
Reverse proxy/ caching server
Application server
MasterDatabase
server
Active
Reverse proxy/ caching server
Hot site – recovery
Elastic loadbalancer
Route 53
www.example.com
Corporate data center
Datavolume
Applicationserver
Databaseserver
Reverse proxy/ caching server
AWS region
Reverse proxy/ caching server
Application server
MasterDatabase
server
Active
Scaled up
for production
use
Considerations
Suitable for:
• Solutions that require RTO & RPO in minutes
• Core business critical functions
• Higher cost DR option
Warm Standby and Multi-site Details
Continuous Testing of Infrastructure
• Continuously and constantly test.
• Regularly execute tests in stable, production &
production-like test environments.
• Infrastructure as Code
• CI/CD Test in Infrastructure Build Pipeline
• Testing of infrastructure during Integration Test
Warm Standby – Testing
Mirroring /replication
Application data source
cut over
Elastic loadbalancer
ActiveNot active for
production traffic
Route 53
www.example.com
Scaled down
standbyCorporate data center
Datavolume
Applicationserver
Subordinatedatabase
server
Reverse proxy/ caching server
AWS region
Reverse proxy/ caching server
Application server
MasterDatabase
server
Warm Standby – Testing
Mirroring /replication
Application data source
cut over
Elastic loadbalancer
ActiveNot active for
production traffic
Route 53
www.example.com
Scaled down
standbyCorporate data center
Datavolume
Applicationserver
Subordinatedatabase
server
Reverse proxy/ caching server
AWS region
Reverse proxy/ caching server
Application server
MasterDatabase
server
Warm Standby – Testing
Mirroring /replication
Application data source
cut over
Elastic loadbalancer
ActiveNot active for
production traffic
Route 53
www.example.com
Scaled down
standbyCorporate data center
Datavolume
Applicationserver
Subordinatedatabase
server
Reverse proxy/ caching server
AWS region
Reverse proxy/ caching server
Application server
MasterDatabase
server
Warm Standby – Testing
Mirroring /replication
Application data source
cut over
Elastic loadbalancer
ActiveNot active for
production traffic
Route 53
www.example.com
Scaled down
standbyCorporate data center
Datavolume
Applicationserver
Subordinatedatabase
server
Reverse proxy/ caching server
AWS region
Reverse proxy/ caching server
Application server
MasterDatabase
server
aws rds reboot-db-instance --db-instance-identifier
dbInstanceID --force-failover
Cloud-Based Architectures
• High level of control over the environment
• Automate Everything! – Utilize AWS APIs
• Infrastructure as code – CloudFormation
• Parallel environment
• Rolling Update / All at Once
• Blue / Green Deployments
- Significant difference between physical and cloud is the
control and visibility cloud provides
Benefits of deployment on AWS
AWS:
• Agile deployments
• Flexible options
• RPO/RTO & Business
Continuity objectives
• Scalable capacity
• Pay for what you use
• Automation capabilities