+ All Categories
Home > Technology > Low-cost Open Data As-a-Service

Low-cost Open Data As-a-Service

Date post: 05-Aug-2015
Category:
Upload: marin-dimitrov
View: 391 times
Download: 2 times
Share this document with a friend
Popular Tags:
17
Low-cost Open Data As-a-Service Marin Dimitrov, Alex Simov, Yavor Petkov May 31 st , 2015 Low-cost Open Data as-a-Service / SemDev’2015 #1 May 2015
Transcript
Page 1: Low-cost Open Data As-a-Service

Low-cost Open Data As-a-Service

Marin Dimitrov, Alex Simov, Yavor Petkov

May 31st, 2015

Low-cost Open Data as-a-Service / SemDev’2015 #1 May 2015

Page 2: Low-cost Open Data As-a-Service

• Use cases & requirements

• Cloud architecture for a RDF DBaaS

• Lessons learned

Contents

#2 May 2015 Low-cost Open Data as-a-Service / SemDev’2015

Page 3: Low-cost Open Data As-a-Service

Use Cases & Requirements

#3 May 2015 Low-cost Open Data as-a-Service / SemDev’2015

Page 4: Low-cost Open Data As-a-Service

Why an RDF DBaaS?

#4 May 2015 Low-cost Open Data as-a-Service / SemDev’2015

Grafter Grafterizer

RDF DBaaS Open Data Portal

• Transform tabular data into RDF • Publish (Linked) data services,

instead of static datasets • Lower-cost & easier data

publishing process

Page 5: Low-cost Open Data As-a-Service

Why an RDF DBaaS?

#5 May 2015 Low-cost Open Data as-a-Service / SemDev’2015

• Transform textual data into RDF • Linked data services • Low-cost & easy to use

Page 6: Low-cost Open Data As-a-Service

• Elastic

– dynamically adapt to growing data & query volumes

• High availability & resilience

– no SPFs, “graceful degradation” upon failures

• Cost efficient

• Host a large number of data services (databases)

– But probably of low/moderate data & query volume

• Isolation of the multi-tenant databases

DBaaS requirements

#6 May 2015 Low-cost Open Data as-a-Service / SemDev’2015

Not easy to achieve all three!

Page 7: Low-cost Open Data As-a-Service

Cloud Architecture

#7 May 2015 Low-cost Open Data as-a-Service / SemDev’2015

Page 8: Low-cost Open Data As-a-Service

• AWS based

– Network storage, compute & autoscaling, load balancing, integration services, …

• Ontotext GraphDB as the RDF DB engine

– OpenRDF REST API

• Docker for containerisation

• An RDF DBaaS is…

– A GraphDB instance…

– Running within a Docker container…

– Storing its data on a private NAS volume

DBaaS architecture on AWS

#8 May 2015 Low-cost Open Data as-a-Service / SemDev’2015

Page 9: Low-cost Open Data As-a-Service

DBaaS architecture on AWS

#9 May 2015 Low-cost Open Data as-a-Service / SemDev’2015

Elasticity vs High Availability vs

Cost Efficiency

Page 10: Low-cost Open Data As-a-Service

Dealing with failures

#10 May 2015 Low-cost Open Data as-a-Service / SemDev’2015

our responsibility

CSP responsibility

Page 11: Low-cost Open Data As-a-Service

• Elastic

– Routing nodes, data nodes + NAS storage grow as usage grows

• High availability & resilience

– Strategies for dealing with failures in data, routing, Coordinator nodes

– Planned: multi-DC deployment with replication

• Cost efficient

– Cloud native architecture -> cost savings

– Multi-tenant model -> cost savings

– Elastic: return underutilised or unused resources back to CSP

Evaluation

#11 May 2015 Low-cost Open Data as-a-Service / SemDev’2015

Page 12: Low-cost Open Data As-a-Service

Lessons Learned

#12 May 2015 Low-cost Open Data as-a-Service / SemDev’2015

Page 13: Low-cost Open Data As-a-Service

• Cloud-native architecture

– Improved scalability, reliability, cost savings

• A microservice architecture will continuously evolve

• Assume that failures will happen on all levels

– Design for “graceful degradation”

• A good DevOps process is essential

Lessons Learned

#13 May 2015 Low-cost Open Data as-a-Service / SemDev’2015

Page 14: Low-cost Open Data As-a-Service

Discussion

#14 May 2015 Low-cost Open Data as-a-Service / SemDev’2015

Page 15: Low-cost Open Data As-a-Service

• Use it for free!

– http://s4.ontotext.com (available NOW)

– http://dapaas.eu (end of June)

• Send us questions, comments, criticism, suggestions for improvements, …

Help us improve it!

#15 May 2015 Low-cost Open Data as-a-Service / SemDev’2015

Page 16: Low-cost Open Data As-a-Service

• Are you measuring the TCO of your on-premise RDF databases?

– Important for many Open Data scenarios

• What is your #1 concern for using an RDF DBaaS

• Do you have use cases where your productivity will increase by using an RDF DBaaS

– Experiment & prototype faster; focus on building apps, don’t worry about infrastructure; provision new DBs instantly…

– Real world example: training courses by Ontotext switching from local deployments to the RDF DBaaS

Discussion topics

#16 May 2015 Low-cost Open Data as-a-Service / SemDev’2015

Page 17: Low-cost Open Data As-a-Service

Thank you!

#17 May 2015 Low-cost Open Data as-a-Service / SemDev’2015


Recommended