Date post: | 01-Dec-2014 |
Category: |
Technology |
Upload: | antonio-severien |
View: | 514 times |
Download: | 2 times |
Rela%onal Cloud A Database-‐as-‐a-‐Service for the Cloud
Paper by Carlo Curino et al. @mit.edu
Presenta%on by Antonio Severien [email protected]
Overview
Ø Rela%onal Databases Ø Database-‐as-‐a-‐Service (DBaaS) Ø Problems AEacked
Ø Efficient Mul%-‐tenancy Ø Elas%c Scalability Ø Database Privacy
Ø Rela%onal Cloud Ø Experiments Ø Conclusion
2
Rela%onal Cloud
Ø Rela%onal Databases Ø Database-‐as-‐a-‐Service (DBaaS) Ø Problems AEacked
Ø Efficient Mul%-‐tenancy Ø Elas%c Scalability Ø Database Privacy
Ø Rela%onal Cloud Ø Experiments Ø Conclusion
3
Rela%onal Databases Ø 1970 by Edgar Codd, IBM research San Jose Ø Tables
Ø Rows à Tuples Ø Columns à AEributes
Ø Rela%onal Database Management Systems (RDBMS)
4
Rela%onal Cloud
Ø Rela%onal Databases Ø Database-‐as-‐a-‐Service (DBaaS) Ø Problems AEacked
Ø Efficient Mul%-‐tenancy Ø Elas%c Scalability Ø Database Privacy
Ø Rela%onal Cloud Ø Experiments Ø Conclusion
5
Database-‐as-‐a-‐Service (DBaaS)
Ø Cloud Ø Reduce management, opera%onal and energy costs
Ø Elas%city and scale economy Ø Pay-‐per-‐use
6
amazon RDS
Rela%onal Cloud
Ø Rela%onal Databases Ø Database-‐as-‐a-‐Service (DBaaS) Ø Problems AEacked
Ø Efficient Mul%-‐tenancy Ø Elas%c Scalability Ø Database Privacy
Ø Rela%onal Cloud Ø Experiments Ø Conclusion
7
Problems AEacked
8
Efficient Mul%-‐tenancy
Elas%c Scalability Privacy
9
Efficient Mul%-‐tenancy
Efficient Mul%-‐tenancy
Ø Reduce costs Ø Efficient usage of resources Ø Maximize hardware u%liza%on Ø Single database server on each machine Ø Maintain applica%on query performance
10
Efficient Mul%-‐tenancy
Ø Reduce costs Ø Efficient usage of resources Ø Maximize hardware u%liza%on Ø Single database server on each machine? Ø Maintain applica%on query performance
11 Virtual Machine
Efficient Mul%-‐tenancy
Ø Problems Ø Monitoring resource requirements for workloads Ø Predic%ng the load generated Ø Assigning workloads to physical machines Ø Migra%ng workloads between nodes Ø Live migra*on
12
Efficient Mul%-‐tenancy
Ø Kairos (Monitoring and consolida%on engine) Ø Resource Monitor
Disk ac%vity and RAM requirements Ø Combined Load Predictor
CPU, RAM, Disk model that predicts the combined resource requirements
Ø Consolida%on Engine Non-‐linear op%miza%on techniques to…
… minimize the number of machines needed … balance load between back-‐end machines
13
Elas%c Scalability
14
Elas%c Scalability
Ø Workload exceeds single machine capacity
15
Ø Scale a single database to mul%ple nodes Ø Scale-‐out by query processing par%%oning Ø Granular placement and load balance on backend
Elas%c Scalability
Ø Strategy well suited for OLTP and Web workloads… but can extend to OLAP
Ø Minimize cross-‐node distributed transac%ons
Ø Workload-‐aware par**oner Ø Par%%on data to minimize mul%-‐node transac%ons Ø Front-‐end analyses execu%on traces represented as a graph
16
Graph Par%%oning
17
id name age salary
id name age salary
id name age salary
we=2
we : weight of edge
we=1
we=10
Graph Par%%oning
18
id name age salary
id name age salary
id name age salary
we=2
we : weight of edge
we=1
we=10
Graph Par%%oning
19
id name age salary
id name age salary
id name age salary
Privacy
20
Privacy
Ø Adjustable security Ø Onion ring encryp%on design
2 onion layer and 1 homomorphic encryp%on of integer
Ø SQL query on encrypted data Ø Security level dynamically adap%ve
Converge to an overall security level
21
Onion Layers of Encryp%on
Value
6. RND: no func%onality
4. DET: equality selec%on
2. DET: equality join
Value
5. RND: no func%onality
3. OPE: inequality select, min, max, sort, group-‐by
1. OPE: inequality join
int value
HOM: addi%on
string value
String search
or
RND = Randomized Encryp%on (no opera%ons allowed) DET = Determinis%c Encryp%on OPE = Order-‐preserving Encryp%on HOM = Homomorphic Encryp%on (opera%ons over encrypted data)
Strong
Weak 22
Rela%onal Cloud
Ø Rela%onal Databases Ø Database-‐as-‐a-‐Service (DBaaS) Ø Problems AEacked
Ø Efficient Mul%-‐tenancy Ø Elas%c Scalability Ø Database Privacy
Ø Rela%onal Cloud Ø Experiments Ø Conclusion
23
Privacy-‐preserving Queries
Privacy-‐preserving Results
Rela%onal Cloud Architecture Client Nodes
User Applica%on
JDBC-‐client (CryptoDB enabled)
Frontend Nodes
Router Distributed Transac%onal Coordina%on
Admin Nodes
Par%%oning Engine
Placement and Migra%on Engine
Backend Nodes
CryptoDB Encryp%on Engine
Backend Nodes
CryptoDB Encryp%on Engine Par**ons
Placement
Database load stats
Trusted Pla,orm (Private/Secured)
Untrusted Pla,orm (Public)
Users
24
Rela%onal Cloud
Ø Rela%onal Databases Ø Database-‐as-‐a-‐Service (DBaaS) Ø Problems AEacked
Ø Efficient Mul%-‐tenancy Ø Elas%c Scalability Ø Database Privacy
Ø Rela%onal Cloud Ø Experiments Ø Conclusion
25
Experiments
26
Experiments
Bad results? Tradeoff for be=er privacy
27
Experiments
28
Scaling TPC-‐C
Rela%onal Cloud
Ø Rela%onal Databases Ø Database-‐as-‐a-‐Service (DBaaS) Ø Problems AEacked
Ø Efficient Mul%-‐tenancy Ø Elas%c Scalability Ø Database Privacy
Ø Rela%onal Cloud Ø Experiments Ø Conclusion
29
Conclusion
Ø Presented Rela%onal Cloud Ø Efficient Mul%-‐tenancy
Ø Novel resource es%ma%on Ø Non-‐linear op%miza%on-‐based consolida%on technique
Ø Scalability Ø Graph-‐based par%%oning
Ø Privacy Ø Adjustable privacy Ø SQL queries on encrypted data
Ø DBaaS is a viable cloud service
30
References Ø "Rela%onal Cloud: a Database Service for the cloud" Carlo
Curino, Evan Jones, Raluca Popa, Nirmesh Malviya, Eugene Wu, Sam Madden, Har Balakrishnan, Nickolai Zeldovich
Ø hEp://rela%onalcloud.com
32
Privacy
CryptoDB Example
SELECT i_price, ... FROM item WHERE i_id = N
JDBC client decrypts DET level 4
DET-‐encrypted cyphertext Return to JDBC client decrypted
RND cyphertexts
33