+ All Categories
Home > Software > Streaming Data Integration - For Women in Big Data Meetup

Streaming Data Integration - For Women in Big Data Meetup

Date post: 16-Apr-2017
Category:
Upload: gwen-chen-shapira
View: 502 times
Download: 1 times
Share this document with a friend
31
1 Confidential Streaming Data Integration with Apache Kafka
Transcript
Page 1: Streaming Data Integration - For Women in Big Data Meetup

1Confidential

Streaming Data Integrationwith Apache Kafka

Page 2: Streaming Data Integration - For Women in Big Data Meetup

2Confidential

About Gwen

Gwen Shapira – System Architect @Confluent

PMC @ Apache Kafka

Moving data round since 2000

Previously:

• Software Engineer @ Cloudera

• Oracle Database Consultant

Find me:

[email protected]

• @gwenshap

Page 3: Streaming Data Integration - For Women in Big Data Meetup

3Confidential

The Plan

1. What is Data Integration About?2. How things changed?3. What is difficult and important?4. How we solve things in Kafka?

Page 4: Streaming Data Integration - For Women in Big Data Meetup

4Confidential

Data Integration

Making sure the right dataGets to the right places

Page 5: Streaming Data Integration - For Women in Big Data Meetup

5Confidential

10 years ago…

 

InformaticaDataStageManual Optimizations

Page 6: Streaming Data Integration - For Women in Big Data Meetup

6Confidential

5 years ago…

Page 7: Streaming Data Integration - For Women in Big Data Meetup

7Confidential

Page 8: Streaming Data Integration - For Women in Big Data Meetup

8Confidential

Page 9: Streaming Data Integration - For Women in Big Data Meetup

9Confidential

Today…

• Everything streaming• Everything real-time• Everything in-memory• Everything containers• Everything clouds

Page 10: Streaming Data Integration - For Women in Big Data Meetup

10Confidential

These Things Matter

• Reliability – Losing data is (usually) not OK. • Exactly Once vs At Least Once

• Timeliness • Push vs Pull• High throughput, Varying throughput

• Compression, Parallelism, Back Pressure

• Data Formats• Flexibility, Structure

• Security• Error Handling

Page 11: Streaming Data Integration - For Women in Big Data Meetup

11Confidential

Page 12: Streaming Data Integration - For Women in Big Data Meetup

12Confidential

After: Stream Data Platform with Kafka Distribute

d Fault Tolerant Stores Messages

Search Security

Fraud Detection Application

User Tracking Operational Logs Operational MetricsEspresso Cassandra Oracle

Hadoop Log Search Monitoring Data Warehouse

Kafka

Processes Streams

Page 13: Streaming Data Integration - For Women in Big Data Meetup

13Confidential

Page 14: Streaming Data Integration - For Women in Big Data Meetup

14Confidential

14

Page 15: Streaming Data Integration - For Women in Big Data Meetup

15Confidential

15

Page 16: Streaming Data Integration - For Women in Big Data Meetup

16Confidential

16

Page 17: Streaming Data Integration - For Women in Big Data Meetup

17Confidential

17

Page 18: Streaming Data Integration - For Women in Big Data Meetup

18Confidential

IntroducingKafka Connect

Large-scale streaming data import/export for Kafka

Page 19: Streaming Data Integration - For Women in Big Data Meetup

19Confidential

Page 20: Streaming Data Integration - For Women in Big Data Meetup

20Confidential

Overview of Connect

1. Install a cluster of Workers2. Download / Build and install Connector Plugins3. Use REST API to Start and Configure Connectors4. Connectors start Tasks. Tasks run inside Workers and copy data.

Page 21: Streaming Data Integration - For Women in Big Data Meetup

21Confidential

Page 22: Streaming Data Integration - For Women in Big Data Meetup

22Confidential

Page 23: Streaming Data Integration - For Women in Big Data Meetup

23Confidential

Page 24: Streaming Data Integration - For Women in Big Data Meetup

24Confidential

Page 25: Streaming Data Integration - For Women in Big Data Meetup

25Confidential

Page 26: Streaming Data Integration - For Women in Big Data Meetup

26Confidential

Page 27: Streaming Data Integration - For Women in Big Data Meetup

27Confidential

Page 28: Streaming Data Integration - For Women in Big Data Meetup

28Confidential

Page 29: Streaming Data Integration - For Women in Big Data Meetup

30Confidential

Page 30: Streaming Data Integration - For Women in Big Data Meetup

31Confidential

Page 31: Streaming Data Integration - For Women in Big Data Meetup

32Confidential

Questions?


Recommended