Pranav RastogiProduct Specialist for Big Data, Microsoft
With host Andrew BrustMarket Strategy Advisor, Io-TahoeCEO, Blue Badge Insights
Big Data and AI – does one serve the other?
For external use
Speaker bios
2
Pranav Rastogi• Leader on team for HDInsight, Microsoft Azure’s
Hadoop/Spark Big Data-as-a-Service offering• Developer ecosystem experience; tenure includes
roles on Azure Redis Cache and WebJobs services• Key contributor to .NET Web dev tools and platform
Andrew Brust• Covers Big Data and analytics for ZDNet• Strategy Advisor to Io-Tahoe• Data-focused tech career started in 1985
For external use
Analytics analysis
For external use
Evolution of analytics
4
1. First there was query, drill-down and basic charts
2. Then came the Yahoo-like Big Data use casesBig Internet companies had the compute Cloud made this power accessible to other companies
3. Streaming data? Same story
4. Now analytics is about running stuff at scale
For external use
Standardization of analytics
5
Modern analytics is breaking down into four standardized workloads:• Batch• Query• Streaming• Data Science
If Data Science is just one of four standard workloads why does it seem that everyone is saying that’s the whole show?
For external use
Deconstructing the AI
juggernaut
For external use
Data Science audiences
Consumers• App developers• Apple Core ML, Tensor#, Mobius• Non-determinism is paradigm-
dissonant for developers• How to standardize conveyance
of accuracy/confidence level
Producers• Databricks, Sagemaker,
Azure Machine Learning (ML) are geared to this audience
• What about operationalization?
Users• Can accept/reject and
further train the model
Developer-Producers?
7
For external use
Trends and market factors
8
The move from batch to streaming
Also marks shift to time-series data
ML/predictive analytics follows, in turn
Spark SQLSnowflake
Databricks DeltaKSQL
SQL 2017 Native Scoring
Appeals to "Consumers" and "Users"
The SQL gold standardRecommendations
Time Series predictions (e.g. for revenue in Excel)
Smarter Tools:Excel
PowerPointIo-Tahoe
“Mainstream ML” examples
For external use
Analytics as AI’s apprentice
9
Analytics for analytics’ sakeUnderstanding truth is often
more important than predicting likelihood
Basic discovery helps identify what to predict
Common goal is to discover what the data can reveal
It’s all about AIAnalysis serves feature
engineering
Data prep serves to cleanse training data
Everything subservient to prediction
Embedded AI or DIY?
10
Thank you
11
This file is private and may contain confidential and proprietary information. If you have received this file in error, please notify us and remove it from your system, and note that you must not copy, distribute or take any action in reliance on it. Any unauthorized use or disclosure of the contents of this file is not permitted and may be unlawful. Io-Tahoe LLC, 111 Broadway, Suite 601, New York, NY 10006. www.io-tahoe.com
Io-Tahoe® is a registered trade mark of Centrica plc. © Io-Tahoe LLC 2018