Two Elephants in The Room!!
Denish Patel Database Architect
Challenge
Big Data !
Challenge Simplified
1. Volume a. Size
2. Velocitya. streaming b. response time
3. Varietya. structuredb. semi structuredc. unstructured
4. Veracity a. uncertaintyb. truthfulness
Solution
Sqoop
PostgreSQL
Multi Model Database Server● Relational● Object Relational● Nested Relational ● Array Stores● Key-Value Store (hstore)● Document Store (XML,JSON)● Range Types ( PostgreSQL 9.2)
Hadoop
A Distributed File Structure● ETL
○ Helps to convert unstructured data into Structured data for Analytics
● Ease of working on unstructured data○ Log Processing○ Data streaming in real time
● Parallel Processing○ MapReduce
● Scale at Petabytes
Take away?
Two Elephants (PostgreSQL and Hadoop) works together very well & can solve "most" of the Big data challenges.