Date post: | 21-Jan-2017 |
Category: |
Internet |
Upload: | dr-haxel-congress-and-event-management-gmbh |
View: | 597 times |
Download: | 3 times |
Data ScienceWith R & Vanilla Air
Business Intelligence &Analytics - Data Science Platform
Patrick Beaucamp – [email protected]
General Introduction
II-SDV, Nice 19th April 2016
Presentation Agenda
2Data Science with R & Vanilla Air
Landscape for Statistics & Analytics- Open Source : R, Knime, Weka, RapidMiner- Commercial : SAS – SPSS - Watson- A Key Decision from FDA for R : december 2014
Demo Platform : Vanilla & Vanilla Air
Business Intelligence versus Data Science
R Platform Introduction… need for visualization and server-ready !!!
Introduction
3Data Science with R & Vanilla Air
If you don’t find it, it doesn’t exist !
Document Data inside document
Business Intelligence - Subject
4
Project Initialisation
• Requests for Report, Dashboard, to visualize data stored in production database
• Requests to access data from various database and build global activity report, kpi projects
• Projects to align number with process, to set global rules for calculation of Kpi, to deliver legacy reports, etc …
Focus on
• Data Quality & Data consistancy, using ETL & Data Quality tools
• Define rules to aggregate data, to standardize informations, to clean data, using Master Data Management tools
• Loading Data into Datawarehouse (ODS, DWH and DTM parts), using ETL tools
• Define Reports, Dashboard, KPI and Cube with end users, and adjust Datamart structure to comply with the
expectation
• Create Report, Dashboard, Cube and various Metadata to provide access to validated data
• Define Workflow to process - for example - data loading + kpi calculation + report creation
Business Intelligence• Reporting• OLAP (cubes)• Dashboards• KPI (performance indicators)• Maps (OSM support)
ETL & WorkFlow• Master Data
Management• Data Quality• Data Profiling
Data Science with R & Vanilla Air
Data Science - Subject
7
Project Initialisation
• Requests to understand why such data results are available – Business Question
• Request to cross existing information with additional information, to add value to existing data
• Projects to try to build model to understand data, such as clustering, association, decision tree
• Projects to try to build forecasting & predictive models
Focus on
• Platform & Components, such as predictive language (R is recommanded)
• External data analysis & integration : what are the external information which influence my data
• Analysing data and building model to explain correlation between data, impact on data input
modification
• Building statistics, analytics & predicative models
• Providing tools to advanced users to access data, visualize data, manipulate data
Data Science with R & Vanilla Air
Data Science - Platform
8Data Science with R & Vanilla Air
Data Acquisition(Internal – External)
Data Lake(Hadoop)
PredictiveEngine
Data Viz
Data Science - Visualisation
9Data Science with R & Vanilla Air
Data Mining Open Source Landscape
10Data Science with R & Vanilla Air
RapidMiner
Weka
Knime
R :- Rstudio & Shiny- RevolutionAnalytics (Microsoft R Server)- Vanilla Air- ORE (Oracle R Enterprise)
Commercial Corner
13Data Science with R & Vanilla Air
Visualization : Qlik - Tableau
Statistics : Matlab, Statistica, Stata, etc …
DataMining : SAS – SPSS – IBM Watson
R Introduction
16Data Science with R & Vanilla Air
What is R ?R is a programming language and software environment for statistical computing and graphics.
www.R-project.org
Need for Visualization (2/4)
22Data Science with R & Vanilla Air
Jupyter Notebook (Python, Microsoft Azure)
Need for Visualization (3/4)
23Data Science with R & Vanilla Air
Apache Zeppelin (incubation project)
R – Need for Enterprise Ready
25Data Science with R & Vanilla Air
Vanilla Air
Shiny Server
Microsoft R Server
Oracle R Enterprise
Very recently (end 2015) : R Foundation
Certified Packages Server Side Architecture
35Data Science with R & Vanilla Air
Thanks for your attention
Try Vanilla Air:Download and Share your Experience
Questions & Answers
Vanilla Smart Data Business Case
47Vanilla - General Introduction
What does influence my sales ?
How weather can influence sales on product ?
If I can have some weather prediction, can I forecast my sales ?
Retail Industry
Vanilla Smart Data Business Case
48Vanilla - General Introduction
• How to find the better price for my product using more data sources ?• How social media comments on a product can influence its price ?
Purchase Platform
Vanilla Smart Data Business Case
49Vanilla - General Introduction
• Why some products are damaged during the transport: which product ?which transporter ?
• What external events like weather or transport duration can explain the situation ?• What is the best transporter for specific products based on weather forecast ?
Pharmaceutical Industry
Vanilla Smart Data Business Case
50Vanilla - General Introduction
How does temperature evolution and weather impact pathologies
How does holiday & week-end impact pathologies
How the patient are splited in different groupes, based on pathologies, age, gender …
Hospital Analysis
Vanilla Smart Data Business Case
51Vanilla - General Introduction
How does social media impact sales
How to get alerts when social media start discussion on my products
How to set alerts on various « products / social media activity » (including
competition) and evaluate impact on my sales
Beauty Industry