+ All Categories
Home > Technology > SSIS 2008 R2 data flow

SSIS 2008 R2 data flow

Date post: 27-Jan-2015
Category:
Upload: slava-kokaev
View: 116 times
Download: 4 times
Share this document with a friend
Description:
Firestarter SSIS 06 data flow (old 20010 presentation)
Popular Tags:
21
Data Flow The Data Flow Task Encapsulates the data flow engine Exists in the context of an overall control flow Performs traditional ETL in addition to other extended scenarios Is fast and scalable Data Flow Components Extract data from Sources Load data into Destinations Modify data with Transformations Service Paths Connect data flow components Create the pipeline
Transcript
Page 1: SSIS 2008 R2 data flow

Data FlowThe Data Flow Task

Encapsulates the data flow engineExists in the context of an overall control flowPerforms traditional ETL in addition to other extended scenariosIs fast and scalable

Data Flow ComponentsExtract data from SourcesLoad data into DestinationsModify data with Transformations

Service PathsConnect data flow componentsCreate the pipeline

Page 2: SSIS 2008 R2 data flow

Data Flow TaskOne of the most valuable control flow tasks is the Data Flow Task.

Encapsulates the data flow engine

Load

Extract

Transform

Page 3: SSIS 2008 R2 data flow

Understanding a ETL Processing

Page 4: SSIS 2008 R2 data flow

Data Flow ElementsSQL Server Integration Services provides three different types of data flow components:

Data flow source - Sources extract data from data stores such as tables and views in relational databases, files, and Analysis Services databases. Data flow transformations - Transformations modify, summarize, and clean data.Data flow destination - Destinations load data into data stores or create in-memory datasets.

Page 5: SSIS 2008 R2 data flow

Integration Services PathsA Path connects two components in a data flow by connecting the output of one data flow component to the input of another component. A path has a source and a destination.

Page 6: SSIS 2008 R2 data flow

Defining Data Flow SourcesIn SSIS, a source is the data flow component that extracts data from different external data sources and makes it available to the other components in the data flow. Sources have one regular output, and many sources in addition also have one error output.All the output columns are available as input columns to the next data flow component in the data flow.

Sources extract data from:

Relational tables and views

Files

Analysis Services databases

Page 7: SSIS 2008 R2 data flow

Understanding Data Flow Sources

OLEDB Oracle Connection

Data SourceSource Adapter

Page 8: SSIS 2008 R2 data flow

Data Flow DestinationsDestinations are the data flow components that load the data from a data flow into different types of data sources or create an in-memory dataset. Destinations have one input and one error output.

Destinations load data to:

Relational tables and views

Files

Analysis Services databases and objects

DataReaders and Recordsets

Enterprise Edition only

Page 9: SSIS 2008 R2 data flow

Understanding Data Flow Destinations

ADO.NET Connection TargetDestination

Adapter

Page 10: SSIS 2008 R2 data flow

Defining Data Flow Transformations

SSIS Transformations are the components in the data flow of a package that give you the ability to modify and manipulate data in the data flow. A transformation performs an operation either on one row of data at a time or on several rows of data at once. For example aggregate, merge, distribute, and modify data and also can perform lookup operations and generate sample datasets.

Page 11: SSIS 2008 R2 data flow

Understanding Data Flow Transformations

Page 12: SSIS 2008 R2 data flow

Mapping Columns and Dataflow Pipeline

DimProduct

ProductKey

Color

Name

Cost

DimProduct

ProductKey

Color

Name

Cost

DimProduct

ProductKey

Color

Name

Cost

Source

Transformation

Destination

Page 13: SSIS 2008 R2 data flow

Best Practices

Page 14: SSIS 2008 R2 data flow

Transformations

We can logically group them by functionality: Row Transformations Rowset Transformations Split and Join Transformations Auditing Transformations Business Intelligence Transformations Custom Transformations

Page 15: SSIS 2008 R2 data flow

Row Transformations The most common and easily configured transformations perform operations on rows without needing other rows from the source. These transformations, which logically work at the row level, often perform very well.

Update column values or create new columns

Transform each row in the pipeline input

Page 16: SSIS 2008 R2 data flow

Rowset Transformations

Create new rowsets that can include

Aggregated values

Sorted values

Sample rowsets

Pivoted or unpivoted rowsets

Page 17: SSIS 2008 R2 data flow

Split and Join Transformations

Distribute rows to different outputs

Create copies of the transformation inputs

Join multiple inputs into one output

Perform lookup operations

Page 18: SSIS 2008 R2 data flow

Auditing Transformations Integration Services includes the following transformations to add audit information and count rows.

Page 19: SSIS 2008 R2 data flow

Business Intelligence Transformations

cleaning data

updating of a slowly changing dimension

looks up values

mining text

running data mining prediction queries

The final grouping of transformations lets you perform advanced operations on rows in the data flow pipeline.

Page 20: SSIS 2008 R2 data flow

ADO.NET Connection

Dataflow Summary

Sources

Transformations

Destinations

OLEDB Oracle Connection

EXCELConnection

Page 21: SSIS 2008 R2 data flow

DEMO


Recommended