+ All Categories
Transcript
Page 1: Enterprise data science - What it takes to build?

Enterprise data science learning solution

A practical approach to big data learning

CloneSkills, Inc.(916)-296-0228

Learn to lead big data - Enterprise data science a practical approach

CloneSkills, Inc.

http://www.CloneSkills.com

Architect : Karthik Rajamanickam

Page 2: Enterprise data science - What it takes to build?

Learn to lead big data - Enterprise data science a practical approach

CloneSkills, Inc.

http://www.CloneSkills.com

Architect : Karthik Rajamanickam

Objective

� Educate various key components that’s are typically used to deliver enterprise data sciences

� Demonstrate the steps to move data between Oracle 12C and HADOOP using Sqoop

� Review data flow between SAP HANA and HADOOP using smart data access

CloneSkills, Inc.(916)-296-0228

Page 3: Enterprise data science - What it takes to build?

Our Enterprise Data Science Platform

HADOOP Distribution

SAP HANA Oracle 12C

Social | Forum | Blog | Web

File | Text

Analytics

What’s involved in building enterprise data science?

Learn to lead big data - Enterprise data science a practical approach

CloneSkills, Inc.

http://www.CloneSkills.com

Architect : Karthik Rajamanickam

CloneSkills, Inc.(916)-296-0228

Page 4: Enterprise data science - What it takes to build?

Our enterprise data science platform components - Our lab(CSLAB)

Learn to lead big data - Enterprise data science a practical approach

CloneSkills, Inc.

http://www.CloneSkills.com

Architect : Karthik Rajamanickam

� SAP HANA

� SAP BOBJ

� Oracle 12C

� Oracle ODI

Enterprise Components

� HDFS

� HBase

� Hive

� Impala

� Pig

� Search

� Shell

� Mapreduce

� Sqoop

� OOIZE

� ZOOKEEPER

� Hue

� Dashboard

� Editor

HADOOP Components

CloneSkills, Inc.(916)-296-0228

Page 5: Enterprise data science - What it takes to build?

Our (CSLAB) On demand Lab Infrastructure

__________________________________

� SAP HANA� SAP BOBJ� Oracle 12C� Oracle ODI� HADOOP

Learn to lead big data - Enterprise data science a practical approach

CloneSkills, Inc.

http://www.CloneSkills.com

Architect : Karthik Rajamanickam

Node 1

Node 2

Node 3

Node 4

Node 5

Node 6

Our enterprise data science platform technical components

CloneSkills, Inc.(916)-296-0228

Page 6: Enterprise data science - What it takes to build?

Our three (3) node

HADOOP cluster

Learn to lead big data - Enterprise data science a practical approach

CloneSkills, Inc.

http://www.CloneSkills.com

Architect : Karthik Rajamanickam

Our enterprise data science platform - HADOOP infrastructure

CloneSkills, Inc.(916)-296-0228

Page 7: Enterprise data science - What it takes to build?

Our HADOOP core

components

________________� Hive� Impala� Pig� Search� Hbase� Shell� Mapreduce� Sqoop� Hue� HDFS� OOIZE� ZOOKEEPER

Learn to lead big data - Enterprise data science a practical approach

CloneSkills, Inc.

http://www.CloneSkills.com

Architect : Karthik Rajamanickam

Our enterprise data science platform - HADOOP components

CloneSkills, Inc.(916)-296-0228

Page 8: Enterprise data science - What it takes to build?

Our HADOOP core

components

________________

� Hive

� Impala

� Pig

� Search

� Hbase

� Shell

Learn to lead big data - Enterprise data science a practical approach

CloneSkills, Inc.

http://www.CloneSkills.com

Architect : Karthik Rajamanickam

Our enterprise data science platform - Hue components

CloneSkills, Inc.(916)-296-0228

Page 9: Enterprise data science - What it takes to build?

Our Oracle 12 C

Infrastructure

Learn to lead big data - Enterprise data science a practical approach

CloneSkills, Inc.

http://www.CloneSkills.com

Architect : Karthik Rajamanickam

Our enterprise data science platform - Oracle

CloneSkills, Inc.(916)-296-0228

Page 10: Enterprise data science - What it takes to build?

Our Oracle 12 C

Infrastructure

Learn to lead big data - Enterprise data science a practical approach

CloneSkills, Inc.

http://www.CloneSkills.com

Architect : Karthik Rajamanickam

Our enterprise data science platform - Oracle

CloneSkills, Inc.(916)-296-0228

Page 11: Enterprise data science - What it takes to build?

Our Oracle ODI (

Oracle Data

Integrator)

Infrastructure

Learn to lead big data - Enterprise data science a practical approach

CloneSkills, Inc.

http://www.CloneSkills.com

Architect : Karthik Rajamanickam

Our enterprise data science platform - Oracle data integrator (ODI)

CloneSkills, Inc.(916)-296-0228

Page 12: Enterprise data science - What it takes to build?

Learn to lead big data - Enterprise data science a practical approach

CloneSkills, Inc.

http://www.CloneSkills.com

Architect : Karthik Rajamanickam

SAP HANA

_______________

Smart Data Access

Connects SAP HANA

and HADOOP

Our enterprise data science platform – SAP HANA

CloneSkills, Inc.(916)-296-0228

Page 13: Enterprise data science - What it takes to build?

Learn to lead big data - Enterprise data science a practical approach

CloneSkills, Inc.

http://www.CloneSkills.com

Architect : Karthik Rajamanickam

SAP HANA

_______________

Smart Data Access

Connects SAP HANA

and HADOOP

Our enterprise data science platform - SAP HANA and HADOOP integration

CloneSkills, Inc.(916)-296-0228

Page 14: Enterprise data science - What it takes to build?

Learn to lead big data - Enterprise data science a practical approach

CloneSkills, Inc.

http://www.CloneSkills.com

Architect : Karthik Rajamanickam

HADOOP Distribution

Oracle 12C Sqoop

Import

Export

Steps to move data between Oracle and HADOOP using Sqoop

CloneSkills, Inc.(916)-296-0228

Page 15: Enterprise data science - What it takes to build?

Learn to lead big data - Enterprise data science a practical approach

CloneSkills, Inc.

http://www.CloneSkills.com

Architect : Karthik Rajamanickam

Oracle table and it’s

data

Review Oracle table – EMPLOYEE_JP

CloneSkills, Inc.(916)-296-0228

Page 16: Enterprise data science - What it takes to build?

Learn to lead big data - Enterprise data science a practical approach

CloneSkills, Inc.

http://www.CloneSkills.com

Architect : Karthik Rajamanickam

Sqoop Job

Sqoop job creation

Page 17: Enterprise data science - What it takes to build?

Learn to lead big data - Enterprise data science a practical approach

CloneSkills, Inc.

http://www.CloneSkills.com

Architect : Karthik Rajamanickam

Sqoop Job

____________

Create connection to

Oracle

Sqoop job creation - Create connection to Oracle

CloneSkills, Inc.(916)-296-0228

Page 18: Enterprise data science - What it takes to build?

Learn to lead big data - Enterprise data science a practical approach

CloneSkills, Inc.

http://www.CloneSkills.com

Architect : Karthik Rajamanickam

Sqoop Job

____________

Oracle source table

details

Sqoop job creation - Configure source table

CloneSkills, Inc.(916)-296-0228

Page 19: Enterprise data science - What it takes to build?

Learn to lead big data - Enterprise data science a practical approach

CloneSkills, Inc.

http://www.CloneSkills.com

Architect : Karthik Rajamanickam

Sqoop Job

____________

Oracle source table

and column details

Sqoop job creation - Configure source table and the primary key of the table

CloneSkills, Inc.(916)-296-0228

Page 20: Enterprise data science - What it takes to build?

Learn to lead big data - Enterprise data science a practical approach

CloneSkills, Inc.

http://www.CloneSkills.com

Architect : Karthik Rajamanickam

Sqoop Job

____________

Destination in

HADOOP ( HDFS

output files)

Sqoop job creation - Configure data target , HDFS files (output files)

CloneSkills, Inc.(916)-296-0228

Page 21: Enterprise data science - What it takes to build?

Learn to lead big data - Enterprise data science a practical approach

CloneSkills, Inc.

http://www.CloneSkills.com

Architect : Karthik Rajamanickam

Sqoop Job

____________

Job extraction log

Run Sqoop job - review job log

CloneSkills, Inc.(916)-296-0228

Page 22: Enterprise data science - What it takes to build?

Learn to lead big data - Enterprise data science a practical approach

CloneSkills, Inc.

http://www.CloneSkills.com

Architect : Karthik Rajamanickam

Sqoop Job

____________

HDFS destination

files

Sqoop job output - HDFS output file, destination files

CloneSkills, Inc.(916)-296-0228

Page 23: Enterprise data science - What it takes to build?

Learn to lead big data - Enterprise data science a practical approach

CloneSkills, Inc.

http://www.CloneSkills.com

Architect : Karthik Rajamanickam

Sqoop Job

____________

Oracle data in

HADOOP - preview

Sqoop job output - Oracle data in HADOOP HDFS files

CloneSkills, Inc.(916)-296-0228

Page 24: Enterprise data science - What it takes to build?

Learn to lead big data - Enterprise data science a practical approach

CloneSkills, Inc.

http://www.CloneSkills.com

Architect : Karthik Rajamanickam

Sqoop Job

____________

Data has been

imported from Oracle

to HADOOP

Sqoop Job

____________

We can also export

data from HADOOP

and then load them

into Oracle

Sqoop job output - Data has been moved from Oracle to HADOOP

CloneSkills, Inc.(916)-296-0228

Page 25: Enterprise data science - What it takes to build?

Learn to lead big data - Enterprise data science a practical approach

CloneSkills, Inc.

http://www.CloneSkills.com

Architect : Karthik Rajamanickam

Our enterprise data

sciences use case

CloneSkills, Inc.(916)-296-0228

Page 26: Enterprise data science - What it takes to build?

Learn to lead big data - Enterprise data science a practical approach

CloneSkills, Inc.

http://www.CloneSkills.com

Architect : Karthik Rajamanickam

Stay tuned, more to come Thank You !

CloneSkills, Inc.(916)-296-0228


Top Related