Hiro Yoshikawa, Founder and CEO [email protected]
650-810-6184
Kazuki Ohta, Founder and CTO [email protected]
650-223-5679
Treasure DataCloud Data Platform
Friday, August 2, 13
2
Hiro Yoshikawa – CEO - Open Source business veteran at Red Hat
Kazuki Ohta – CTO - Founder of the World’s largest Hadoop group
Keith Goldstein – VP Business Dev - VP of BD at TIBCO, Talend
Jeff Yuan - Engineering Director - LinkedIn, MIT/Michael Stonebraker Lab
Investors (part):Bill Tai - Chairman of the board
Jerry Yang – Yahoo! founder
James Lindenbaum – Heroku Founder
Yukihiro “Matz” Matsumoto – Ruby creator
Othman Laraki - ex-VP Growth at Twitter
Business, Team & Investors
Founded to deliver big data analytics in days not months without specialist IT resources Service based subscription
business model Treasure Data is in production for
80+ customers• incl. Fortune 500 companies• 500+ billion records stored• Wide variety of use casen
World class team • Great open source team• Top investors
Friday, August 2, 13
The Problem with Other Solutions3
CustomerValue
TimeSign-up or PO
On-Premise Solutions
Obsolescenceover time
Treasure Data
Fully integrated Big Data full-stack service with simple interface, low friction initial engagement & continuous
technical upgrade
Need Upgrade
AWS(or hosted Hadoops)EC2
EMR
RedShift
S3 Step-by-step manual integrations
Maintain
NO SpecialistsTOO LONG to get Live
=
Complex Solutions
+
Data Collection
+
Friday, August 2, 13
Columnar Storage+
HadoopMapReduce
500bil+ records2mil+ jobs
Product4
Data Collection Data Warehouse Data Analysis
Open-SourceLog Collector
2,000+ companies(incl. LinkedIn, etc)
Bulk Loader
CSV / TSVMySQL, Postgres
Oracle, etc.
Web Log
App Log
Sensor
RDBMS
CRM
ERP
BI Tools
Tableau, QlikViewExcel, etc.
RESTJDBC / ODBC
SQL(HiveQL)
Pig
Bulk UploadParallel Upload
Value Proposition:“Time-to-Answer” 20bil+, 2 weeks,
UK/Austria3bil+, 3 weeks
Singapore 2 weeks, US
2 weeks, US
3 weeks,Japan
Dashboard
Custom App,RDBMS, FTP, etc.
Result push
Multi-Tenant: Speed of Improvements + Ease of Management (e.g. SFDC, Heroku)
Streaming Upload>80billion / month
JSON(MsgPack)
Friday, August 2, 13
5
A case: “14 Days” from Signup to Success
1. Europe’s largest mobile ad exchange.
2. Serving >20 billion imps/month for >15,000 mobile apps (Q1 2013)
3. Immediate need of analytics infrastructure: ASAP!
4. With TD, MobFox got into production only in 14 days, by one engineer.
"Time is the most precious asset in our fast-moving business,and Treasure Data saved us a lot of it."
Julian Zehetmayr, CEO & Founder
td-agent = fluentd rpm/deb
Friday, August 2, 13
6
A case: “Replace” in-house Hadoop to TD
1. Global “Hulu” - Online Video Service with millions of users
2. Video contents are distributed to over 150 languages.
3. Had hard time maintaining Hadoop cluster
4. With TD, Viki deprecated their in-house Hadoop cluster and use engineer for core businesses.
Before
After
“Treasure Data has always given us thorough and timely support peppered with insightful tips to make the best use of their service."
Huy Nguyen, Software Engineer
Friday, August 2, 13
7
A case: Treasure Data with BI Tool (Tableau)
1. World’s largest android application market
2. Serving >3 billion app downloads for >100 million users
3. Only one engineer managing the data infrastructure
4. With TD, the data engineer can focus on analyzing data with existing BI tool
"I will recommend Treasure Data to my friends in a heartbeat because it benefits all three stakeholders: Operations, Engineering and Business."
Simon Dong, Principal Architect - Data Engineering
Friday, August 2, 13
8
AWS (IaaS)
Columnar Storage
Hadoop Hive / Pig
Low-LatencyQuery Executor
Log Collector
REST API&
MgmtConsole
Data Mart
BI + Analytics tool connectivity
Dynamic Table Partitioning
Full-Stack Cloud Data Platform
Multi-Tenant
Inter-DC FairScheduler
ResourceIsolation
AccessControl
CatalogServices
ConfigAutomation
Other Cloud stack, on-premise
ANSI SQL
Bulk Loader Mobile SDK External Sources
Friday, August 2, 13
Competitive Landscape
Data Storage
On-Premise
Data collection
Hadoop Distro
Visualization, BI,Analytical apps
Processing Platform
Connections/Integrations
EMR
Flume
• Most big data players, regardless of cloud or on-premise, have had technical challenges in data collection and trustful multi-tenancy data storage design.
• Treasure Data solves both with Fluentd and our own columnar DB on top of cloud storage solutions
TD is the one stop, full-
stack solution
Cloud
Redshift
Partnering
9
Friday, August 2, 13
Streaming upload
Partner Eco-System
Data Collection
Data Warehouse
Data Analysis(Data visualization/BI, ETL)
Data Sources(PaaS/app runtime, SaaS, IaaS)
System Integration + OEM
JDBC etc.
10
to be launched
Friday, August 2, 13
www.treasure-data.com | @TreasureData
Friday, August 2, 13