Mind the gap: understanding today’s data platforms landscape
Matthew Aslett, research director
451 Research is an information technology research & advisory company Founded in 2000
210+ employees, including over 100 analysts
1,000+ clients: Technology & Service providers, corporate advisory, finance, professional services, and IT decision makers
10,000+ senior IT professionals in our research community
Over 52 million data points each quarter
4,500+ reports published each year covering 2,000+ innovative technology & service providers
Headquartered in New York City with offices in London, Boston, San Francisco, and Washington D.C.
451 Research and its sister company Uptime Institute comprise the two divisions of The 451 Group
Research & Data
Advisory Services
Events
2
114
Rela%onal zone
Non-‐rela%onal zone
Lotus Notes
Objec/vity
MarkLogic
InterSystems Caché
McObject
Starcounter
ArangoDB
Founda/onDB
Neo4J
InfiniteGraph
CouchDB
Oracle NoSQL
Redis
Handlersocket
RavenDB
RethinkDB Google App
Engine Datastore LevelDB
Accumulo
Cassandra
HBase
Riak Couchbase
Splice Machine
Ac/an Ingres SAP Sybase ASE
EnterpriseDB
SQL Server
MySQL
Informix MariaDB
SAP HANA
IBM DB2
Database.com
ClearDB
Google Cloud SQL
Rackspace Cloud Databases
AWS RDS
Azure SQL Database
FathomDB
HP Cloud RDB for MySQL
StormDB
Teradata Aster
HPCC
Cloudera
Hortonworks MapR IBM BigInsights
AWS EMR
Google Compute Engine
ZeWaset
NGDATA
Infochimps Metascale
Mortar Data
Rackspace
Qubole
Voldemort
Aerospike
Teradata
IBM PureData for Analy/cs
Pivotal Greenplum HP Ver/ca
SAP Sybase IQ
IBM InfoSphere
Ac/an Vector
XtremeData
Kx Systems
Exasol
Ac/an Matrix
ParStream TokuDB
ScaleDB
ScaleArc
Con/nuent TransLa[ce
NuoDB
Drizzle
JustOneDB
Pivotal GemFire XD
Galera
ScaleBase
Zimory Scale
Clustrix
Tesora
MemSQL GenieDB
Datomic YarcData
FlockDB
Allegrograph HypergraphDB
AffinityDB
Giraph
Trinity MemCachier
Redis Labs Memcached Cloud
FairCom
BitYota
IronCache
Grid/cache zone Memcached
Ehcache
ScaleOut So^ware
IBM eXtreme Scale
Oracle Coherence
GigaSpaces XAP GridGain
Pivotal GemFire
CloudTran
InfiniSpan
Hazelcast
Oracle Exaly/cs
Oracle Database
MySQL Cluster
Oracle Endeca Server A[vio
Elas/csearch
LucidWorks Big Data
Lucene/Solr
IBM InfoSphere Data Explorer
Towards E-‐discovery
Towards enterprise search
Documentum xDB
Tamino XML Server
Ipedo XML Database
ObjectStore
LucidDB
MonetDB
Metamarkets Druid
Databricks/Spark
AWS Elas/Cache
Firebird
SciDB SQLite
Oracle TimesTen solidDB
Adabas
IBM IMS
UniData
UniVerse
WakandaDB
Al/scale
Oracle Big Data Appliance
RainStor
OrientDB
Sparksee
Metamarkets
Treasure Data
PostgreSQL Percona
vFabric Postgres
HyperDex
TIBCO Ac/veSpaces
Titan
SAP Sybase SQL Anywhere
JethroData
CitusDB
Pivotal HD/HAWQ
BigMemory
Ac/an Versant
DataStax Enterprise
DeepDB
Infobright
FatDB
Google Cloud Datastore
Heroku Postgres
GrapheneDB Cassandra.io
Hypertable
BerkeleyDB
Sqrrl Enterprise
Microso^ HDInsight
HP Autonomy
Oracle Exadata
IBM PureData
IBM Big SQL
Impala
Apache Drill
Presto
Microso^ SQL Server
PDW
Apache Tajo
Apache Hive
SPARQLBASE
MammothDB
Al/base HDB LogicBlox
SRCH2
TIBCO LogLogic
Splunk
Towards SIEM
Loggly Sumo Logic Logentries
InfiniSQL
JumboDB
Ac/an PSQL Progress OpenEdge
Kogni/o
Al/base XDB
Savvis
So^layer Verizon
xPlenty
Stardog
MariaDB Enterprise
Apache Storm Apache S4
IBM InfoSphere Streams
TIBCO StreamBase
DataTorrent
AWS Kinesis
Feedzai
Guavus Lokad
SQLStream
So^ware AG
Key: General purpose Specialist analy/c
BigTables Graph Document Key value stores
-‐as-‐a-‐Service
Key value direct access Hadoop
MySQL ecosystem
Advanced clustering/sharding New SQL databases
Data caching
Data grid
Search Appliances
In-‐memory
Stream processing
OpenStack Trove
1010data
Google BigQuery
AWS Redshi^
TempoIQ
InfluxDB WebScaleSQL
MySQL Fabric Spider
2
E
D
A
B
C
T-‐Systems
E
D
A
B
C
2 4 3 5
SQream
SpaceCurve
Postgres-‐XL
Google Cloud Dataflow
Trafodion Hadapt
Azure Search
Red Hat JBoss Data Grid
6 5 4
MongoDB
Cloudant
Iris Couch MongoLab
Compose
ObjectRocket
CloudBird
Azure DocumentDB
1 3
1 6
Data Platforms
Map October 2014
CockroachDB
AWS DynamoDB
Redis-‐to-‐go
AWS SimpleDB
Redis Labs Redis Cloud
RedisGreen
AWS Elas/Cache with Redis
MagnetoDB
ObjectRocket Redis
TokuMX
VoltDB
CortexDB
CodeFutures
Oracle Big Data Cloud
3
Stra/o
Data Platforms Map
The industry analyst’s insatiable desire to categorize
5
Darrenkw on Flickr: hWps://www.flickr.com/photos/darrenkw/3327185668
Segmenting the market
§ NoSQL, NewSQL and Beyond
• Assessing the drivers behind the development and adoption of NoSQL and NewSQL databases, as well as data grid/caching technologies
• Published April 2011
6
Rela/onal
Non-‐rela/onal Analy/c
Oracle Opera/onal IBM DB2 SQL Server PostgreSQL MySQL Ingres
SAP Sybase ASE
Hadoop Netezza
JustOne
EMC Greenplum Aster Data
ParAccel
HP Ver/ca
-‐as-‐a-‐Service
SimpleDB
Amazon RDS
Xeround
NewSQL
Calpont
GenieDB
VoltDB
ScalArc
NoSQL
Document Lotus Notes
CouchDB MongoDB
Graph
Key value
Big tables
Objec/vity MarkLogic InterSystems
Versant
Progress
McObject
HBase Hypertable
Redis Riak
Voldemort BerkeleyDB
Membrain
InfiniteGraph Neo4J
GraphDB
App Engine Datastore
Data Grid/Cache
Clustrix
Schooner MySQL Tokutek
Akiban
CodeFutures
Con/nuent ScaleBase
Transla[ce
SQL Azure
FathomDB
EnterpriseDB
Database.com
Infobright SAP Sybase IQ IBM InfoSphere
NimbusDB
VectorWise
HandlerSocket
Cassandra
Cloudant
Memcached IBM eXtreme Scale
Oracle Coherence GigaSpaces TerracoWa GridGain ScaleOut Vmware GemFire CloudTran InfiniSpan
Couchbase RavenDB Drizzle
Piccolo Dryad Hadapt
Mapr
Brisk
MySQL Cluster
Teradata
7
The “Drawn to Scale” problem:
Company developing a Hadoop-‐based NewSQL database.
8
Inspiration § Harry Beck’s London Underground Tube Map
hWp://briWon.disted.camosun.bc.ca/beck_map.jpg
9
Inspiration
Kai Chan Vong on Flickr: hWps://www.flickr.com/photos/kaichanvong/3723850455
10
§ Harry Beck’s London Underground Tube Map
Inspiration
hWp://www.artrepublic.com/biographies/200-‐simon-‐paWerson.html
11
§ Simon Patterson’s “The Great Bear”
Inspiration
12
§ The Real Story Group’s Content Technology Vendor Map
hWp://www.realstorygroup.com/vendormap/
Left to my own devices
13
14
15
16
17
18
19
114
Rela%onal zone
Non-‐rela%onal zone
Lotus Notes
Objec/vity
MarkLogic
InterSystems Caché
McObject
Starcounter
ArangoDB
Founda/onDB
Neo4J
InfiniteGraph
CouchDB
Oracle NoSQL
Redis
Handlersocket
RavenDB
RethinkDB Google App
Engine Datastore LevelDB
Accumulo
Cassandra
HBase
Riak Couchbase
Splice Machine
Ac/an Ingres SAP Sybase ASE
EnterpriseDB
SQL Server
MySQL
Informix MariaDB
SAP HANA
IBM DB2
Database.com
ClearDB
Google Cloud SQL
Rackspace Cloud Databases
AWS RDS
Azure SQL Database
FathomDB
HP Cloud RDB for MySQL
StormDB
Teradata Aster
HPCC
Cloudera
Hortonworks MapR IBM BigInsights
ZeWaset
NGDATA
Infochimps Metascale
Mortar Data
Rackspace
Qubole
Voldemort
Aerospike
Teradata
IBM PureData for Analy/cs
Pivotal Greenplum HP Ver/ca
SAP Sybase IQ
IBM InfoSphere
Ac/an Vector
XtremeData
Kx Systems
Exasol
Ac/an Matrix
ParStream TokuDB
ScaleDB
ScaleArc
Con/nuent TransLa[ce
NuoDB
Drizzle
JustOneDB
Pivotal GemFire XD
Galera
ScaleBase
Zimory Scale
Clustrix
Tesora
MemSQL GenieDB
Datomic YarcData
FlockDB
Allegrograph HypergraphDB
AffinityDB
Giraph
Trinity MemCachier
Redis Labs Memcached Cloud
FairCom
BitYota
IronCache
Grid/cache zone Memcached
Ehcache
ScaleOut So^ware
IBM eXtreme Scale
Oracle Coherence
GigaSpaces XAP GridGain
Pivotal GemFire
CloudTran
InfiniSpan
Hazelcast
Oracle Exaly/cs
Oracle Database
MySQL Cluster
Oracle Endeca Server A[vio
Elas/csearch
LucidWorks Big Data
Lucene/Solr
IBM InfoSphere Data Explorer
Towards E-‐discovery
Towards enterprise search
Documentum xDB
Tamino XML Server
Ipedo XML Database
ObjectStore
LucidDB
MonetDB
Metamarkets Druid
Databricks/Spark
AWS Elas/Cache
Firebird
SciDB SQLite
Oracle TimesTen solidDB
Adabas
IBM IMS
UniData
UniVerse
WakandaDB
Al/scale
Oracle Big Data Appliance
RainStor
OrientDB
Sparksee
Metamarkets
Treasure Data
PostgreSQL Percona
vFabric Postgres
HyperDex
TIBCO Ac/veSpaces
Titan
SAP Sybase SQL Anywhere
JethroData
CitusDB
Pivotal HD/HAWQ
BigMemory
Ac/an Versant
DataStax Enterprise
DeepDB
Infobright
FatDB
Google Cloud Datastore
Heroku Postgres
GrapheneDB Cassandra.io
Hypertable
BerkeleyDB
Sqrrl Enterprise
Microso^ HDInsight
HP Autonomy
Oracle Exadata
IBM PureData
IBM Big SQL
Impala
Apache Drill
Presto
Microso^ SQL Server
PDW
Apache Tajo
Apache Hive
SPARQLBASE
MammothDB
Al/base HDB LogicBlox
SRCH2
TIBCO LogLogic
Splunk
Towards SIEM
Loggly Sumo Logic Logentries
InfiniSQL
JumboDB
Ac/an PSQL Progress OpenEdge
Kogni/o
Al/base XDB
Savvis
So^layer Verizon
xPlenty
Stardog
MariaDB Enterprise
Apache Storm Apache S4
IBM InfoSphere Streams
TIBCO StreamBase
DataTorrent
AWS Kinesis
Feedzai
Guavus Lokad
SQLStream
So^ware AG
Key: General purpose Specialist analy/c
BigTables Graph Document Key value stores
-‐as-‐a-‐Service
Key value direct access Hadoop
MySQL ecosystem
Advanced clustering/sharding New SQL databases
Data caching
Data grid
Search Appliances
In-‐memory
Stream processing
OpenStack Trove
1010data
Google BigQuery
AWS Redshi^
TempoIQ
InfluxDB WebScaleSQL
MySQL Fabric Spider
2
E
D
A
B
C
T-‐Systems
E
D
A
B
C
2 4 3 5
SQream
SpaceCurve
Postgres-‐XL
Google Cloud Dataflow
Trafodion Hadapt
Azure Search
Red Hat JBoss Data Grid
6 5 4
MongoDB
Cloudant
Iris Couch MongoLab
Compose
ObjectRocket
CloudBird
Azure DocumentDB
1 3
1 6
Data Platforms
Map October 2014
CockroachDB
AWS DynamoDB
Redis-‐to-‐go
AWS SimpleDB
Redis Labs Redis Cloud
RedisGreen
AWS Elas/Cache with Redis
MagnetoDB
ObjectRocket Redis
TokuMX
VoltDB
CortexDB
CodeFutures
Oracle Big Data Cloud
20
AWS EMR
Google Compute Engine
Stra/o
Data Platforms Landscape
114
Rela%onal zone
Non-‐rela%onal zone
Lotus Notes
Objec/vity
MarkLogic
InterSystems Caché
McObject
Ingres Sybase ASE
EnterpriseDB
SQL Server
MySQL
Informix
Teradata
Greenplum Ver/ca
Sybase IQ
IBM InfoSphere
VectorWise
Kx Systems ParAccel
Calpont
Grid/cache zone
ScaleOut So^ware
IBM eXtreme Scale
Tangosol Coherence
GigaSpaces
GemStone
Memcached
TerracoWa
Oracle Database
Endeca A[vio
Lucid Imagina/on
Lucene/Solr
Vivisimo
Towards E-‐discovery
Towards enterprise search
Documentum xDB
Tamino XML Server
Ipedo XML Database
Progress ObjectStore
MonetDB
Firebird SQLite
Oracle TimesTen IBM solidDB
Adabas
IBM IMS
UniData
UniVerse
PostgreSQL
TIBCO Ac/veSpaces
Sybase SQL Anywhere
Versant
BerkeleyDB
Autonomy
IBM DB2
Aster Data
LogLogic
Splunk
Towards SIEM
Pervasive PSQL Progress OpenEdge
Kogni/o
Progress Apama
StreamBase
TIBCO Aleri
BEA
SQLStream
Coral8
1010data
2
E
D
A
B
C
E
D
A
B
C
2 4 3 5
Infobright
6 5 4 1 3
1 6
Data Platforms
Map Circa 2008/9
Netezza
Key: General purpose Specialist analy/c
Data caching
Search In-‐memory
Stream processing
22
114
Rela%onal zone
Non-‐rela%onal zone
InterSystems Caché
McObject
Ac/an Ingres SAP Sybase ASE
EnterpriseDB
SQL Server
Informix
SAP HANA
IBM DB2
Teradata Aster
HPCC
IBM PureData for Analy/cs
Pivotal Greenplum HP Ver/ca
SAP Sybase IQ
IBM InfoSphere
XtremeData
Kx Systems
Exasol
Ac/an Matrix
ParStream
Grid/cache zone Memcached
Ehcache
ScaleOut So^ware
IBM eXtreme Scale
Oracle Coherence
GigaSpaces XAP GridGain
Pivotal GemFire
CloudTran
InfiniSpan
Hazelcast
Oracle Database
Oracle Endeca Server A[vio
Elas/csearch
LucidWorks Big Data
Lucene/Solr
IBM InfoSphere Data Explorer
Towards E-‐discovery
Towards enterprise search
Documentum xDB
Tamino XML Server
Ipedo XML Database
ObjectStore
LucidDB
MonetDB
Firebird
SciDB SQLite
Oracle TimesTen solidDB
Adabas
IBM IMS
UniData
UniVerse
WakandaDB
RainStor
PostgreSQL vFabric Postgres
TIBCO Ac/veSpaces
SAP Sybase SQL Anywhere
BigMemory
Ac/an Versant
HP Autonomy
Oracle Exadata
IBM PureData
Microso^ SQL Server
PDW
LogicBlox
SRCH2
TIBCO LogLogic
Splunk
Towards SIEM
Loggly Sumo Logic Logentries
Ac/an PSQL Progress OpenEdge
IBM InfoSphere Streams
TIBCO StreamBase
Feedzai
Guavus Lokad
So^ware AG
Key: General purpose Specialist analy/c
BigTables Graph Document Key value stores
-‐as-‐a-‐Service
Key value direct access Hadoop
MySQL ecosystem
Advanced clustering/sharding New SQL databases
Data caching
Data grid
Search Appliances
In-‐memory
Stream processing
2
E
D
A
B
C
E
D
A
B
C
2 4 3 5
SQream Postgres-‐XL
Red Hat JBoss Data Grid
6 5 4 1 3
1 6
Data Platforms
Map Expansion
VectorWise Kogni/o
1010data
SQLStream
Infobright
MySQL
Lotus Notes
Objec/vity
BerkeleyDB
MarkLogic
Teradata Oracle Exaly/cs
23
114
Rela%onal zone
Non-‐rela%onal zone
Lotus Notes
Objec/vity
MarkLogic
InterSystems Caché
McObject
ArangoDB
Founda/onDB
Neo4J
InfiniteGraph
CouchDB
Oracle NoSQL
Redis
RavenDB
RethinkDB
LevelDB
Accumulo
Cassandra
HBase
Riak Couchbase
Ac/an Ingres SAP Sybase ASE
EnterpriseDB
SQL Server
Informix
SAP HANA
IBM DB2
Teradata Aster
HPCC
Voldemort
Aerospike
IBM PureData for Analy/cs
Pivotal Greenplum HP Ver/ca
SAP Sybase IQ
IBM InfoSphere
XtremeData
Kx Systems
Exasol
Ac/an Matrix
ParStream YarcData
FlockDB
Allegrograph HypergraphDB
AffinityDB
Giraph
Trinity Grid/cache zone Memcached
Ehcache
ScaleOut So^ware
IBM eXtreme Scale
Oracle Coherence
GigaSpaces XAP GridGain
Pivotal GemFire
CloudTran
InfiniSpan
Hazelcast
Oracle Database
Oracle Endeca Server A[vio
Elas/csearch
LucidWorks Big Data
Lucene/Solr
IBM InfoSphere Data Explorer
Towards E-‐discovery
Towards enterprise search
Documentum xDB
Tamino XML Server
Ipedo XML Database
ObjectStore
LucidDB
MonetDB
Firebird
SciDB SQLite
Oracle TimesTen solidDB
Adabas
IBM IMS
UniData
UniVerse
WakandaDB
RainStor
OrientDB
Sparksee
PostgreSQL vFabric Postgres
HyperDex
TIBCO Ac/veSpaces
Titan
SAP Sybase SQL Anywhere
BigMemory
Ac/an Versant
DataStax Enterprise
FatDB Hypertable
BerkeleyDB
Sqrrl Enterprise
HP Autonomy
Oracle Exadata
IBM PureData
Microso^ SQL Server
PDW
SPARQLBASE
LogicBlox
SRCH2
TIBCO LogLogic
Splunk
Towards SIEM
Loggly Sumo Logic Logentries
JumboDB
Ac/an PSQL Progress OpenEdge
Stardog
IBM InfoSphere Streams
TIBCO StreamBase
Feedzai
Guavus Lokad
So^ware AG
Key: General purpose Specialist analy/c
BigTables Graph Document Key value stores
-‐as-‐a-‐Service
Key value direct access Hadoop
MySQL ecosystem
Advanced clustering/sharding New SQL databases
Data caching
Data grid
Search Appliances
In-‐memory
Stream processing
2
E
D
A
B
C
E
D
A
B
C
2 4 3 5
SQream Postgres-‐XL
Red Hat JBoss Data Grid
6 5 4
MongoDB
1 3
1 6
Data Platforms
Map NoSQL
TokuMX
VectorWise Kogni/o
1010data
SQLStream
Infobright
MySQL
CortexDB
Teradata Oracle Exaly/cs
24
114
Rela%onal zone
Non-‐rela%onal zone
Lotus Notes
Objec/vity
MarkLogic
InterSystems Caché
McObject
Starcounter
ArangoDB
Founda/onDB
Neo4J
InfiniteGraph
CouchDB
Oracle NoSQL
Redis
Handlersocket
RavenDB
RethinkDB
LevelDB
Accumulo
Cassandra
HBase
Riak Couchbase
Ac/an Ingres SAP Sybase ASE
EnterpriseDB
SQL Server
MySQL
Informix MariaDB
SAP HANA
Teradata Aster
HPCC
Voldemort
Aerospike
IBM PureData for Analy/cs
Pivotal Greenplum HP Ver/ca
SAP Sybase IQ
IBM InfoSphere
XtremeData
Kx Systems
Exasol
Ac/an Matrix
ParStream TokuDB
ScaleDB
ScaleArc
Con/nuent TransLa[ce
NuoDB
Drizzle
JustOneDB
Pivotal GemFire XD
Galera
CodeFutures
ScaleBase
Zimory Scale
Clustrix
Tesora
GenieDB
Datomic YarcData
FlockDB
Allegrograph HypergraphDB
AffinityDB
Giraph
Trinity Grid/cache zone Memcached
Ehcache
ScaleOut So^ware
IBM eXtreme Scale
Oracle Coherence
GigaSpaces XAP GridGain
Pivotal GemFire
CloudTran
InfiniSpan
Hazelcast
MySQL Cluster
Oracle Endeca Server A[vio
Elas/csearch
LucidWorks Big Data
Lucene/Solr
IBM InfoSphere Data Explorer
Towards E-‐discovery
Towards enterprise search
Documentum xDB
Tamino XML Server
Ipedo XML Database
ObjectStore
LucidDB
MonetDB
Firebird
SciDB SQLite
Oracle TimesTen solidDB
Adabas
IBM IMS
UniData
UniVerse
WakandaDB
RainStor
OrientDB
Sparksee
PostgreSQL Percona
vFabric Postgres
HyperDex
TIBCO Ac/veSpaces
Titan
SAP Sybase SQL Anywhere
BigMemory
Ac/an Versant
DataStax Enterprise
DeepDB
Infobright
FatDB Hypertable
BerkeleyDB
Sqrrl Enterprise
HP Autonomy
IBM PureData
Microso^ SQL Server
PDW
SPARQLBASE
Al/base HDB LogicBlox
SRCH2
TIBCO LogLogic
Splunk
Towards SIEM
Loggly Sumo Logic Logentries
InfiniSQL
JumboDB
Ac/an PSQL Progress OpenEdge
Al/base XDB
Stardog
MariaDB Enterprise
IBM InfoSphere Streams
TIBCO StreamBase
Feedzai
Guavus Lokad
So^ware AG
Key: General purpose Specialist analy/c
BigTables Graph Document Key value stores
-‐as-‐a-‐Service
Key value direct access Hadoop
MySQL ecosystem
Advanced clustering/sharding New SQL databases
Data caching
Data grid
Search Appliances
In-‐memory
Stream processing
WebScaleSQL
MySQL Fabric Spider
2
E
D
A
B
C
E
D
A
B
C
2 4 3 5
SQream Postgres-‐XL
Red Hat JBoss Data Grid
6 5 4
MongoDB
1 3
1 6
Data Platforms
Map NewSQL
TokuMX
VectorWise Kogni/o
1010data
SQLStream
IBM DB2
Oracle Database
VoltDB
MemSQL FairCom
CockroachDB
Oracle Exadata
CortexDB
Teradata Oracle Exaly/cs
25
114
Rela%onal zone
Non-‐rela%onal zone
Lotus Notes
Objec/vity
MarkLogic
InterSystems Caché
McObject
Starcounter
ArangoDB
Founda/onDB
Neo4J
InfiniteGraph
CouchDB
Oracle NoSQL
Redis
Handlersocket
RavenDB
RethinkDB
LevelDB
Accumulo
Cassandra
HBase
Riak Couchbase
Splice Machine
Ac/an Ingres SAP Sybase ASE
EnterpriseDB
SQL Server
MySQL
Informix MariaDB
SAP HANA
Teradata Aster
HPCC
Cloudera
Hortonworks MapR IBM BigInsights
NGDATA
Voldemort
Aerospike
IBM PureData for Analy/cs
Pivotal Greenplum HP Ver/ca
SAP Sybase IQ
IBM InfoSphere
XtremeData
Kx Systems
Exasol
Ac/an Matrix
ParStream TokuDB
ScaleDB
ScaleArc
Con/nuent TransLa[ce
NuoDB
Drizzle
JustOneDB
Pivotal GemFire XD
Galera
CodeFutures
ScaleBase
Zimory Scale
Clustrix
Tesora
GenieDB
Datomic YarcData
FlockDB
Allegrograph HypergraphDB
AffinityDB
Giraph
Trinity Grid/cache zone Memcached
Ehcache
ScaleOut So^ware
IBM eXtreme Scale
Oracle Coherence
GigaSpaces XAP GridGain
Pivotal GemFire
CloudTran
InfiniSpan
Hazelcast
MySQL Cluster
Oracle Endeca Server A[vio
Elas/csearch
LucidWorks Big Data
Lucene/Solr
IBM InfoSphere Data Explorer
Towards E-‐discovery
Towards enterprise search
Documentum xDB
Tamino XML Server
Ipedo XML Database
ObjectStore
LucidDB
MonetDB
Firebird
SciDB SQLite
Oracle TimesTen solidDB
Adabas
IBM IMS
UniData
UniVerse
WakandaDB
Oracle Big Data Appliance
RainStor
OrientDB
Sparksee
PostgreSQL Percona
vFabric Postgres
HyperDex
TIBCO Ac/veSpaces
Titan
SAP Sybase SQL Anywhere
JethroData
CitusDB
Pivotal HD/HAWQ
BigMemory
Ac/an Versant
DataStax Enterprise
DeepDB
Infobright
FatDB Hypertable
BerkeleyDB
Sqrrl Enterprise
HP Autonomy
IBM PureData
IBM Big SQL
Impala
Apache Drill
Presto
Microso^ SQL Server
PDW
Apache Tajo
Apache Hive
SPARQLBASE
MammothDB
Al/base HDB LogicBlox
SRCH2
TIBCO LogLogic
Splunk
Towards SIEM
Loggly Sumo Logic Logentries
InfiniSQL
JumboDB
Ac/an PSQL Progress OpenEdge
Al/base XDB
Stardog
MariaDB Enterprise
Apache Storm Apache S4
IBM InfoSphere Streams
TIBCO StreamBase
DataTorrent Feedzai
Guavus Lokad
SQLStream
So^ware AG
Key: General purpose Specialist analy/c
BigTables Graph Document Key value stores
-‐as-‐a-‐Service
Key value direct access Hadoop
MySQL ecosystem
Advanced clustering/sharding New SQL databases
Data caching
Data grid
Search Appliances
In-‐memory
Stream processing
WebScaleSQL
MySQL Fabric Spider
2
E
D
A
B
C
E
D
A
B
C
2 4 3 5
SQream Postgres-‐XL
Trafodion Hadapt
Red Hat JBoss Data Grid
6 5 4
MongoDB
1 3
1 6
Data Platforms
Map Hadoop
TokuMX
VectorWise Kogni/o
1010data
ZeWaset
IBM DB2
Oracle Database
MemSQL FairCom
CockroachDB
Oracle Exadata
VoltDB
CortexDB
Teradata Oracle Exaly/cs
26
114
Rela%onal zone
Non-‐rela%onal zone
Lotus Notes
Objec/vity
MarkLogic
InterSystems Caché
McObject
Starcounter
ArangoDB
Founda/onDB
Neo4J
InfiniteGraph
CouchDB
Oracle NoSQL
Redis
Handlersocket
RavenDB
RethinkDB Google App
Engine Datastore LevelDB
Accumulo
Cassandra
HBase
Riak Couchbase
Splice Machine
Ac/an Ingres SAP Sybase ASE
SQL Server
MySQL
Informix MariaDB
SAP HANA
Database.com
ClearDB
Google Cloud SQL
Rackspace Cloud Databases
AWS RDS
Azure SQL Database
FathomDB
HP Cloud RDB for MySQL
StormDB
Teradata Aster
HPCC
Cloudera
Hortonworks MapR IBM BigInsights
ZeWaset
NGDATA
Infochimps Metascale
Mortar Data
Rackspace
Qubole
Voldemort
Aerospike
IBM PureData for Analy/cs
Pivotal Greenplum HP Ver/ca
SAP Sybase IQ
IBM InfoSphere
Ac/an Vector
XtremeData
Kx Systems
Exasol
Ac/an Matrix
ParStream TokuDB
ScaleDB
ScaleArc
Con/nuent TransLa[ce
NuoDB
Drizzle
JustOneDB
Pivotal GemFire XD
Galera
ScaleBase
Zimory Scale
Clustrix
Tesora
GenieDB
Datomic YarcData
FlockDB
Allegrograph HypergraphDB
AffinityDB
Giraph
Trinity MemCachier
Redis Labs Memcached Cloud
BitYota
IronCache
Grid/cache zone Memcached
Ehcache
ScaleOut So^ware
IBM eXtreme Scale
Oracle Coherence
GigaSpaces XAP GridGain
Pivotal GemFire
CloudTran
InfiniSpan
Hazelcast
MySQL Cluster
Oracle Endeca Server A[vio
Elas/csearch
LucidWorks Big Data
Lucene/Solr
IBM InfoSphere Data Explorer
Towards E-‐discovery
Towards enterprise search
Documentum xDB
Tamino XML Server
Ipedo XML Database
ObjectStore
LucidDB
MonetDB
Metamarkets Druid
Databricks/Spark
AWS Elas/Cache
Firebird
SciDB SQLite
Oracle TimesTen solidDB
Adabas
IBM IMS
UniData
UniVerse
WakandaDB
Al/scale
Oracle Big Data Appliance
RainStor
OrientDB
Sparksee
Metamarkets
Treasure Data
PostgreSQL Percona
HyperDex
TIBCO Ac/veSpaces
Titan
SAP Sybase SQL Anywhere
JethroData
CitusDB
Pivotal HD/HAWQ
BigMemory
Ac/an Versant
DataStax Enterprise
DeepDB
Infobright
FatDB
Google Cloud Datastore
Heroku Postgres
GrapheneDB Cassandra.io
Hypertable
BerkeleyDB
Sqrrl Enterprise
Microso^ HDInsight
HP Autonomy
IBM Big SQL
Impala
Apache Drill
Presto
Microso^ SQL Server
PDW
Apache Tajo
Apache Hive
SPARQLBASE
MammothDB
Al/base HDB LogicBlox
SRCH2
TIBCO LogLogic
Splunk
Towards SIEM
Loggly Sumo Logic Logentries
InfiniSQL
JumboDB
Ac/an PSQL Progress OpenEdge
Kogni/o
Al/base XDB
Savvis
So^layer Verizon
xPlenty
Stardog
MariaDB Enterprise
Apache Storm Apache S4
IBM InfoSphere Streams
TIBCO StreamBase
DataTorrent
AWS Kinesis
Feedzai
Guavus Lokad
SQLStream
So^ware AG
Key: General purpose Specialist analy/c
BigTables Graph Document Key value stores
-‐as-‐a-‐Service
Key value direct access Hadoop
MySQL ecosystem
Advanced clustering/sharding New SQL databases
Data caching
Data grid
Search Appliances
In-‐memory
Stream processing
OpenStack Trove
1010data
Google BigQuery
AWS Redshi^
TempoIQ
InfluxDB WebScaleSQL
MySQL Fabric Spider
2
E
D
A
B
C
T-‐Systems
E
D
A
B
C
2 4 3 5
SQream
SpaceCurve
Postgres-‐XL
Google Cloud Dataflow
Trafodion Hadapt
Azure Search
Red Hat JBoss Data Grid
6 5 4
MongoDB
Cloudant
Iris Couch MongoLab
Compose
ObjectRocket
CloudBird
Azure DocumentDB
1 3
1 6
Data Platforms
Map DBaaS
AWS DynamoDB
Redis-‐to-‐go
AWS SimpleDB
Redis Labs Redis Cloud
RedisGreen
AWS Elas/Cache with Redis
MagnetoDB
ObjectRocket Redis
TokuMX
IBM DB2
Oracle Database
Oracle Exadata
IBM PureData
MemSQL FairCom
CockroachDB
VoltDB
CortexDB
EnterpriseDB
vFabric Postgres
CodeFutures
Oracle Big Data Cloud
Teradata Oracle Exaly/cs
27
AWS EMR
Google Compute Engine
Stra/o
The big bang: drivers for change
NoSQL DBaaS Hadoop NewSQL
28
Architecture
Cloud Elas/c Distributed
Scalable Virtual Flexible
Developers
JSON Agile Schema-‐on-‐read
REST Schemaless
Flexible
Applica%ons Web Mobile
Interac/ve
Social Always-‐on Local
Global
Drivers for change influence each other
§ New applica/ons require distributed architecture
§ Distributed architecture encourages new development approaches
§ New development approaches demand new architecture
§ Distributed architecture enables new applica/ons
§ New app requirements demand new development approaches
§ New dev approaches enable new lightweight apps
Architecture
Cloud Elas/c Distributed
Scalable Virtual Flexible
Developers
JSON Agile Schema-‐on-‐read
REST Schemaless
Flexible
Applica%ons Web Mobile
Interac/ve
Social Always-‐on Local
Global
29
Drivers for change: applications
§ Social – increased interac/vity generates data
§ Mobile – different form factors and access methods
§ Global – applica/ons need to be immediately available everywhere
§ Local – need to deliver localized content
Social, mobile, global, local all have implica/ons for data connec/vity Applica%ons
Web Mobile Interac/ve
Social Always-‐on Local
Global
30
Drivers for change: developers
§ Need to unite applica/on development and database management people/processes to achieve common goals
§ DevOps movement growing apace
§ Rapid development and con/nuous delivery is inconsistent with tradi/onal database management processes
Developers increasingly drive data management and database selec/on Developers
JSON Agile Schema-‐on-‐read
REST Schemaless
Flexible
31
Drivers for change: DevOps
Findings: • 51% cite business and strategy
drivers: compe//ve, business produc/vity, revenue
• 49% cite technology func/onality, new feature sets, reduced development costs
• As DevOps matures as a discipline, aligning business and IT goals is ever more important.
451 Research survey of 200 qualified North American DevOps
prac//oners or influencers
Developers
JSON Agile Schema-‐on-‐read
REST Schemaless
Flexible
32
Drivers for change: architecture
Transi/oning from a tradi/onal database to a distributed database
§ Interac/ve applica/ons means the pace of user growth and mul/plicity of data types is too great for tradi/onal rela/onal databases to efficiently absorb.
§ Scalability § Performance § Relaxed consistency § Agility § Intricacy § Necessity
Architecture
Cloud Elas/c Distributed
Scalable Virtual Flexible
33
Transi/oning from on-‐premises compu/ng to the cloud
Drivers for change: architecture
Transi/oning from a tradi/onal database to a distributed database
Architecture
Cloud Elas/c Distributed
Scalable Virtual Flexible
Transi/oning from on-‐premises compu/ng to the cloud
§ Amazon’s top enterprise use cases are (in order of popularity starting with the most popular):
• Development and test • New workloads • Supplement existing workloads with cloud • Migration of existing workloads to the cloud • Datacenter migration • All-in cloud
• Top three adoption drivers for public cloud are additive to the existing database landscape
34
Drivers for change: shift to private cloud
Transi/oning from on-‐premises compu/ng to the cloud
Cloud Elas/c
Virtual
Distributed
Flexible Scalable
Transi/oning from on-‐premises compu/ng to the cloud
For each of the major digital infrastructure deployment methods, how are your IT services (% of applica/ons/workloads delivered) distributed?
Architecture
Cloud Elas/c Distributed
Scalable Virtual Flexible
1%
6%
9%
25%
13%
30%
41%
1%
1%
5%
10%
12%
19%
62%
Other Deployment Method
Off-‐premises Public Cloud Services (Other Than SaaS)
Off-‐premises Public Cloud Services (SaaS)
Hybrid Cloud
Off-‐premises IT Outsourcing, Dedicated/Managed
Internal, On-‐premises, Private Cloud-‐based
Internal, On-‐premises, Non-‐cloud Digital Infrastructure (Physical or
2013 2015
The majority of workloads today are on internal, on-‐premises, non-‐cloud and are likely to remain on-‐premises albeit with a move to private cloud
35
Source: Cloud Compu<ng – Wave 6 hAps://451research.com/report-‐long?icid=3021
Drivers for change: public cloud not significant
Transi/oning from on-‐premises compu/ng to the cloud
Cloud Elas/c
Virtual
Distributed
Flexible Scalable
Transi/oning from on-‐premises compu/ng to the cloud
Architecture
Cloud Elas/c Distributed
Scalable Virtual Flexible
36
Source: Cloud Compu<ng – Wave 6 hAps://451research.com/report-‐long?icid=3021
For each of the major digital infrastructure deployment methods, how are your IT services (% of applica/ons/workloads delivered) distributed?
1%
6%
9%
25%
13%
30%
41%
1%
1%
5%
10%
12%
19%
62%
Other Deployment Method
Off-‐premises Public Cloud Services (Other Than SaaS)
Off-‐premises Public Cloud Services (SaaS)
Hybrid Cloud
Off-‐premises IT Outsourcing, Dedicated/Managed
Internal, On-‐premises, Private Cloud-‐based
Internal, On-‐premises, Non-‐cloud Digital Infrastructure (Physical or
2013 2015
Pubic cloud and SaaS are growing, but not making a significant impact
1%
6%
9%
25%
13%
30%
41%
1%
1%
5%
10%
12%
19%
62%
Other Deployment Method
Off-‐premises Public Cloud Services (Other Than SaaS)
Off-‐premises Public Cloud Services (SaaS)
Hybrid Cloud
Off-‐premises IT Outsourcing, Dedicated/Managed
Internal, On-‐premises, Private Cloud-‐based
Internal, On-‐premises, Non-‐cloud Digital Infrastructure (Physical or
2013 2015
Drivers for change: hybrid cloud is the future Cloud
Elas/c
Virtual
Distributed
Flexible Scalable
Transi/oning from on-‐premises compu/ng to the cloud
Architecture
Cloud Elas/c Distributed
Scalable Virtual Flexible
The future of the cloud is hybrid
37
Source: Cloud Compu<ng – Wave 6 hAps://451research.com/report-‐long?icid=3021
For each of the major digital infrastructure deployment methods, how are your IT services (% of applica/ons/workloads delivered) distributed?
Drivers for change: the route to hybrid is private
Transi/oning from on-‐premises compu/ng to the cloud
Cloud
Virtual
Distributed
Flexible Scalable
Transi/oning from on-‐premises compu/ng to the cloud
For each of the following categories of workload/business func/ons, what is your primary deployment method likely to be in the next two years?
53%
68%
11%
35%
53%
18%
42%
18%
14%
23%
17%
26%
43%
27%
29%
18%
66%
46%
21%
39%
31%
Back-‐office Enterprise Applica/ons
Batch Compu/ng Applica/ons
Cloud-‐na/ve Applica/ons
Collabora/ve Applica/ons
Customer-‐facing Enterprise Applica/ons
E-‐business Hos/ng
Test and Development of Applica/ons
Internal Private Cloud Hybrid Cloud External Public Cloud and SaaS
Architecture
Cloud Elas/c Distributed
Scalable Virtual Flexible
38
Source: Cloud Compu<ng – Wave 6 hAps://451research.com/report-‐long?icid=3021
Developers Applica/ons
Architecture Developers
JSON Agile Schema-‐on-‐read
REST Schemaless
Flexible
Drivers for change
Architecture
Cloud Elas/c Distributed
Scalable Virtual Flexible
NoSQL DBaaS Hadoop NewSQL
Applica%ons Web Mobile
Interac/ve
Social Always-‐on Local
Global
39
New databases: differences
40
Non-‐rela/onal data models. Trade-‐off consistency for availability NoSQL
Adds availability and flexibility to the familiar rela/onal data model
NewSQL
Any of the above, or tradi/onal RDBMS, delivered as a service
DBaaS
Batch (and now interac/ve) analy/c processing of unstructured data Hadoop
New databases: use-cases Approach Details Examples
NoSQL MongoDB, Couchbase,
Cassandra, Redis, Aerospike, Cloudant
Non-‐transac/onal opera/onal applica/ons, unstructured data, lightweight query
NewSQL NuoDB, MemSQL,
Transla[ce, VoltDB, Splice Machine
Transac/onal opera/onal apps, structured data, complex query, opera/onal intelligence
Hadoop Cloudera, MapR,
Hortonworks, Pivotal, IBM, Teradata
Non-‐transac/onal analy/c applica/ons, mul/-‐structured data, complex query
DBaaS ObjectRocket, AWS
DynamoDB, AWS RDS, Al/scale, Qubole
Any of the above, or tradi/onal RDBMS, delivered as a service
41
The big bang § The database market expanded rapidly driven partly by polyglot persistence
• Specialist databases for specialist use-cases
42
Wide-‐column stores
Key Value stores
Graph databases
Document stores
NewSQL Hadoop DBaaS
The big crunch? § Multi-model databases are a growing trend to support multiple use-cases and
ease operational complexity – is the database market contracting?
43
Wide-‐column stores
Key Value stores
Graph databases
Document stores
Mul/-‐model databases Support a combina/on of the various individual NoSQL data models.
NewSQL Hadoop DBaaS
§ Increasingly that also means SQL as NoSQL vendors add SQL support and SQL vendors add support for other models
NewSQL
The big crunch?
44
Wide-‐column stores
Key Value stores
Graph databases
Document stores
Mul/-‐model databases Support a combina/on of various individual data models.
Hadoop DBaaS
§ While aaS is really just a deployment option
NewSQL
The big crunch?
45
Wide-‐column stores
Key Value stores
Graph databases
Document stores
Mul/-‐model databases Support a combina/on of various individual data models.
XaaS
Hadoop
§ And HDFS could potential become a common substrate for multiple data processing models
NewSQL
The big crunch?
46
Wide-‐column stores
Key Value stores
Graph databases
Document stores
Mul/-‐model databases Support a combina/on of various individual data models.
Hadoop Distributed File System
XaaS
§ Distributed, elastic, flexible, new projects
NewSQL
Building blocks of the next-generation data platform?
47
Wide-‐column stores
Key Value stores
Graph databases
Document stores
Mul/-‐model databases Support a combina/on of various individual data models.
Hadoop Distributed File System
XaaS
?
The journey to a next-generation data platform
48 Status quo
Mul/-‐tenant, mul/-‐model, mul/-‐data center, hybrid, agile, elas/c, distributed, as-‐a-‐service, automated
Strategic NewSQL, NoSQL, Hadoop, DBaaS produc/on deployments, strategic vendor rela/onships
Tac/cal NewSQL, NoSQL, Hadoop, DBaaS PoCs and departmental deployments, shadow IT
Centralized, scale-‐up, SQL rela/onal databases
Managed polyglot data plauorms
Strategic expansion
Unmanaged polyglot databases
Tac/cal expansion
Managed polyglot databases
Tac/cal consolida/on
Managed strategic data plauorm
Strategic consolida/on
SQL-‐on-‐Hadoop, federated query, mul/-‐model NoSQL and NewSQL, managed DBaaS, hybrid cloud
114
Rela%onal zone
Non-‐rela%onal zone
Lotus Notes
Objec/vity
MarkLogic
InterSystems Caché
McObject
Starcounter
ArangoDB
Founda/onDB
Neo4J
InfiniteGraph
CouchDB
Oracle NoSQL
Redis
Handlersocket
RavenDB
RethinkDB Google App
Engine Datastore LevelDB
Accumulo
Cassandra
HBase
Riak Couchbase
Splice Machine
Ac/an Ingres SAP Sybase ASE
EnterpriseDB
SQL Server
MySQL
Informix MariaDB
SAP HANA
IBM DB2
Database.com
ClearDB
Google Cloud SQL
Rackspace Cloud Databases
AWS RDS
Azure SQL Database
FathomDB
HP Cloud RDB for MySQL
StormDB
Teradata Aster
HPCC
Cloudera
Hortonworks MapR IBM BigInsights
AWS EMR
Google Compute Engine
ZeWaset
NGDATA
Infochimps Metascale
Mortar Data
Rackspace
Qubole
Voldemort
Aerospike
Teradata
IBM PureData for Analy/cs
Pivotal Greenplum HP Ver/ca
SAP Sybase IQ
IBM InfoSphere
Ac/an Vector
XtremeData
Kx Systems
Exasol
Ac/an Matrix
ParStream TokuDB
ScaleDB
ScaleArc
Con/nuent TransLa[ce
NuoDB
Drizzle
JustOneDB
Pivotal GemFire XD
Galera
ScaleBase
Zimory Scale
Clustrix
Tesora
MemSQL GenieDB
Datomic YarcData
FlockDB
Allegrograph HypergraphDB
AffinityDB
Giraph
Trinity MemCachier
Redis Labs Memcached Cloud
FairCom
BitYota
IronCache
Grid/cache zone Memcached
Ehcache
ScaleOut So^ware
IBM eXtreme Scale
Oracle Coherence
GigaSpaces XAP GridGain
Pivotal GemFire
CloudTran
InfiniSpan
Hazelcast
Oracle Exaly/cs
Oracle Database
MySQL Cluster
Oracle Endeca Server A[vio
Elas/csearch
LucidWorks Big Data
Lucene/Solr
IBM InfoSphere Data Explorer
Towards E-‐discovery
Towards enterprise search
Documentum xDB
Tamino XML Server
Ipedo XML Database
ObjectStore
LucidDB
MonetDB
Metamarkets Druid
Databricks/Spark
AWS Elas/Cache
Firebird
SciDB SQLite
Oracle TimesTen solidDB
Adabas
IBM IMS
UniData
UniVerse
WakandaDB
Al/scale
Oracle Big Data Appliance
RainStor
OrientDB
Sparksee
Metamarkets
Treasure Data
PostgreSQL Percona
vFabric Postgres
HyperDex
TIBCO Ac/veSpaces
Titan
SAP Sybase SQL Anywhere
JethroData
CitusDB
Pivotal HD/HAWQ
BigMemory
Ac/an Versant
DataStax Enterprise
DeepDB
Infobright
FatDB
Google Cloud Datastore
Heroku Postgres
GrapheneDB Cassandra.io
Hypertable
BerkeleyDB
Sqrrl Enterprise
Microso^ HDInsight
HP Autonomy
Oracle Exadata
IBM PureData
IBM Big SQL
Impala
Apache Drill
Presto
Microso^ SQL Server
PDW
Apache Tajo
Apache Hive
SPARQLBASE
MammothDB
Al/base HDB LogicBlox
SRCH2
TIBCO LogLogic
Splunk
Towards SIEM
Loggly Sumo Logic Logentries
InfiniSQL
JumboDB
Ac/an PSQL Progress OpenEdge
Kogni/o
Al/base XDB
Savvis
So^layer Verizon
xPlenty
Stardog
MariaDB Enterprise
Apache Storm Apache S4
IBM InfoSphere Streams
TIBCO StreamBase
DataTorrent
AWS Kinesis
Feedzai
Guavus Lokad
SQLStream
So^ware AG
Key: General purpose Specialist analy/c
BigTables Graph Document Key value stores
-‐as-‐a-‐Service
Key value direct access Hadoop
MySQL ecosystem
Advanced clustering/sharding New SQL databases
Data caching
Data grid
Search Appliances
In-‐memory
Stream processing
OpenStack Trove
1010data
Google BigQuery
AWS Redshi^
TempoIQ
InfluxDB WebScaleSQL
MySQL Fabric Spider
2
E
D
A
B
C
T-‐Systems
E
D
A
B
C
2 4 3 5
SQream
SpaceCurve
Postgres-‐XL
Google Cloud Dataflow
Trafodion Hadapt
Azure Search
Red Hat JBoss Data Grid
6 5 4
MongoDB
Cloudant
Iris Couch MongoLab
Compose
ObjectRocket
CloudBird
Azure DocumentDB
1 3
1 6
Data Platforms
Map October 2014
CockroachDB
AWS DynamoDB
Redis-‐to-‐go
AWS SimpleDB
Redis Labs Redis Cloud
RedisGreen
AWS Elas/Cache with Redis
MagnetoDB
ObjectRocket Redis
TokuMX
VoltDB
CortexDB
CodeFutures
Oracle Big Data Cloud
49
Stra/o
Thank You! [email protected] @maslett www.451research.com