+ All Categories
Home > Education > Hudson Data Corp Training

Hudson Data Corp Training

Date post: 13-Apr-2017
Category:
Upload: menish-gupta
View: 170 times
Download: 1 times
Share this document with a friend
7
Transcript
Page 1: Hudson Data Corp Training
Page 2: Hudson Data Corp Training

© 2015 Hudson Data Corp. All Rights Reserved. www.bitbootcamp.com

2

AlgorithmsThe Brains

– Introduction to Data Science

– Data Munging & Fusion– Text Mining

•Naïve Bayes– Recommendation Engines– Principal Component

Analysis– Classification

•Decision Trees• Random Forest•Gradient Boosting Machines

– Generalized Linear Models– Clustering

• KNN• K-Means

– Graph Theory– Stable Marriage

HadoopBig Data

CoreEngineering

Our Training OfferingsSkills you need

Page 3: Hudson Data Corp Training

© 2015 Hudson Data Corp. All Rights Reserved. www.bitbootcamp.com

3Corporate TrainingOur Process

1 3 5 7

2 4 6

Deliver TrainingDevelop Use Case

Measure Impact

Understand Business Needs

.

Proposal / Contract

.

Pre Training Support

• Reading Materials• Environment

Setup

Post Training Support• Emails• Private discussion

boards

Page 4: Hudson Data Corp Training

© 2015 Hudson Data Corp. All Rights Reserved. www.bitbootcamp.com

4Sample Corporate Training3 Day Corporate Training in Data Science

Day 1

CoreEngineering

– Introduction to Data Science

– Recommendation Engine

– Classifications•Decision Trees•Random Forest

Day 2

AlgorithmsThe Brains

– Business Problem

– Ext. Data Dictionary

– Univariate Analysis

– Random Forest

– Model Validation

– Results

Day 3

Use Case IPractice

For Data Science

Page 5: Hudson Data Corp Training

© 2015 Hudson Data Corp. All Rights Reserved. www.bitbootcamp.com

5Sample Corporate Training5 Day Corporate Training in Data Science

Day 1 Day 2– Introduction to

Data Science

– Recommendation Engine

– Classifications•Decision Trees•Random Forest•Gradient Boosting Machines (GBM )

Day 3– Business Problem

– Ext. Data Dictionary

– Univariate Analysis

– Random Forest

– Model Validation

– Results

Day 4 Day 5

CoreEngineering

HadoopBig Data

AlgorithmsThe Brains

Use Case IPractice

Use Case IIPractice

– Business Problem

– Ext. Data Dictionary

– Univariate Analysis

– GBM

– Model Validation

– Results

For Data Science

Page 6: Hudson Data Corp Training

© 2015 Hudson Data Corp. All Rights Reserved. www.bitbootcamp.com

6

Day 1

Introductions• Motivation for Big Data• Unix for Data Science• Pushing and Pulling data from remote

servers• Columnar Compressions• Extended Data Dictionary

Morning Afternoon

Python for Data Science• Thinking in Python• Python design patterns for data analytics• Pandas• Data Frames• Aggregations• Python with Parallel powers

Unix Assignments• Process data in parallel• Working with remote

Machines

Python Assignments

Sample Day Breakdown

Data Set Used• Google N-Gram• 100 Million Records

• Data Processing in Python• Python scripts and automation

Page 7: Hudson Data Corp Training

© 2015 Hudson Data Corp. All Rights Reserved. www.bitbootcamp.com

7

[email protected] 917-819-0106201-314-5838

www.bitbootcamp.com25 BroadwaySuite 1032

New York, NY

Contact UsMade in NYC


Recommended