+ All Categories
Transcript
Page 1: Big data   hype or reality
Page 2: Big data   hype or reality

Agenda

• What is Big Data?

• Technology Radar

• Technologies in scope.

• Architecture

• Wanted!

• Next steps.

Page 3: Big data   hype or reality

The world of data is changing.

Page 4: Big data   hype or reality

Data has a chaotic nature.

Page 5: Big data   hype or reality

Big Data <> Big DataBig Data == Big in Data.

Page 6: Big data   hype or reality

Big Data = 4 V’s.

Page 7: Big data   hype or reality

Volume = Dealing with the size.

Page 8: Big data   hype or reality

Variety =

Handling the multiplicity of types, sources and formats.

Page 9: Big data   hype or reality

Velocity=

Reacting to the flood of information in the time required by the application.

Page 10: Big data   hype or reality

Veracity =

How can we cope with uncertainty, imprecision, missing values or untruths.

Page 11: Big data   hype or reality

Big Data 1.0=

Building the capabilities to process large dataIn support of their current operations

(efficiency improvement).

Page 12: Big data   hype or reality

Big Data 2.0=

What can I now do that I couldn’t do before, or do better then I could do before.

Page 13: Big data   hype or reality

Polyglot persistence

• Relational databases are not dead.

• Enterprises should expect multiple data-storage technologies for different applications.

• Even for a single application, polyglot persistence is good.

• Do not replace one database solution with another to expect wonders.

Page 14: Big data   hype or reality
Page 15: Big data   hype or reality
Page 16: Big data   hype or reality

Technologies in the picture

• Hadoop and technologies build on top of it.

• ElasticSearch.

• neo4J.

Page 17: Big data   hype or reality

Hadoop

• Apache Foundation

• Commercial solutions

• Hortonworks

• Cloudera

• MapR

Page 18: Big data   hype or reality
Page 19: Big data   hype or reality
Page 20: Big data   hype or reality

And many more...

Page 21: Big data   hype or reality

ElasticSearch

• Based on lucene.

• ElasticSearch is also the name of the company.

• Search, analyze and index in realtime.

• Distributed.

• High availability.

• Document-oriented.

• Schema free

• RESTful api

Page 22: Big data   hype or reality
Page 23: Big data   hype or reality
Page 24: Big data   hype or reality

neo4j

• Graph database.

• Ideal for metadata and relationships.

• Not for large content.

• Not for large graphs.

Page 25: Big data   hype or reality
Page 26: Big data   hype or reality
Page 27: Big data   hype or reality

Polyglot persistence

• Relational databases are not dead.

• Enterprises should expect multiple data-storage technologies for different applications.

• Even for a single application, polyglot persistence is good.

• Do not replace one database solution with another to expect wonders.

Page 28: Big data   hype or reality
Page 29: Big data   hype or reality
Page 30: Big data   hype or reality

Next steps.

Page 31: Big data   hype or reality

Learning and case study group.

Page 32: Big data   hype or reality

I need datastores:- Openstreetmap.- NASA- ....

Page 33: Big data   hype or reality

Q&A


Top Related