Date post: | 13-Apr-2017 |
Category: |
Education |
Upload: | menish-gupta |
View: | 170 times |
Download: | 1 times |
© 2015 Hudson Data Corp. All Rights Reserved. www.bitbootcamp.com
2
AlgorithmsThe Brains
– Introduction to Data Science
– Data Munging & Fusion– Text Mining
•Naïve Bayes– Recommendation Engines– Principal Component
Analysis– Classification
•Decision Trees• Random Forest•Gradient Boosting Machines
– Generalized Linear Models– Clustering
• KNN• K-Means
– Graph Theory– Stable Marriage
HadoopBig Data
CoreEngineering
Our Training OfferingsSkills you need
© 2015 Hudson Data Corp. All Rights Reserved. www.bitbootcamp.com
3Corporate TrainingOur Process
1 3 5 7
2 4 6
Deliver TrainingDevelop Use Case
Measure Impact
Understand Business Needs
.
Proposal / Contract
.
Pre Training Support
• Reading Materials• Environment
Setup
Post Training Support• Emails• Private discussion
boards
© 2015 Hudson Data Corp. All Rights Reserved. www.bitbootcamp.com
4Sample Corporate Training3 Day Corporate Training in Data Science
Day 1
CoreEngineering
– Introduction to Data Science
– Recommendation Engine
– Classifications•Decision Trees•Random Forest
Day 2
AlgorithmsThe Brains
– Business Problem
– Ext. Data Dictionary
– Univariate Analysis
– Random Forest
– Model Validation
– Results
Day 3
Use Case IPractice
For Data Science
© 2015 Hudson Data Corp. All Rights Reserved. www.bitbootcamp.com
5Sample Corporate Training5 Day Corporate Training in Data Science
Day 1 Day 2– Introduction to
Data Science
– Recommendation Engine
– Classifications•Decision Trees•Random Forest•Gradient Boosting Machines (GBM )
Day 3– Business Problem
– Ext. Data Dictionary
– Univariate Analysis
– Random Forest
– Model Validation
– Results
Day 4 Day 5
CoreEngineering
HadoopBig Data
AlgorithmsThe Brains
Use Case IPractice
Use Case IIPractice
– Business Problem
– Ext. Data Dictionary
– Univariate Analysis
– GBM
– Model Validation
– Results
For Data Science
© 2015 Hudson Data Corp. All Rights Reserved. www.bitbootcamp.com
6
Day 1
Introductions• Motivation for Big Data• Unix for Data Science• Pushing and Pulling data from remote
servers• Columnar Compressions• Extended Data Dictionary
Morning Afternoon
Python for Data Science• Thinking in Python• Python design patterns for data analytics• Pandas• Data Frames• Aggregations• Python with Parallel powers
Unix Assignments• Process data in parallel• Working with remote
Machines
Python Assignments
Sample Day Breakdown
Data Set Used• Google N-Gram• 100 Million Records
• Data Processing in Python• Python scripts and automation
© 2015 Hudson Data Corp. All Rights Reserved. www.bitbootcamp.com
7
[email protected] 917-819-0106201-314-5838
www.bitbootcamp.com25 BroadwaySuite 1032
New York, NY
Contact UsMade in NYC