+ All Categories
Home > Documents > PC8x_Module01

PC8x_Module01

Date post: 26-Oct-2014
Category:
Upload: nitrojunk88
View: 104 times
Download: 1 times
Share this document with a friend
Popular Tags:
36
Course Introduction 0.2 Copyright © 2008 Informatica Corp PowerCenter 8.x Level I Developer Module 0: Course Introduction
Transcript
Page 1: PC8x_Module01

Course Introduction 0.2

Copyright © 2008 Informatica CorpPowerCenter 8.x Level I Developer

Module 0: Course Introduction

Page 2: PC8x_Module01

Course Introduction 0.5

Copyright © 2008 Informatica CorpPowerCenter 8.x Level I Developer

Module 0: Course Introduction 5 of 28

Course Objectives

• When you have completed this course, you should be able to:• Use PowerCenter 8 Designer to build mappings that move

data from sources to targets

• Use PowerCenter 8 Workflow Manager to build and run a workflow that runs a session based on a mapping

• Design simple mappings and workflows based on business needs

• Perform basic troubleshooting of PowerCenter mappings and transformations

• Use Informatica Support options to resolve questions and problems about Informatica PowerCenter 8

0

Page 3: PC8x_Module01

Course Introduction 0.6

Copyright © 2008 Informatica CorpPowerCenter 8.x Level I Developer

Module 0: Course Introduction 6 of 28

Course Agenda

1. PowerCenter Overview

2. Mapping Fundamentals

3. Workflow Basics

4. Expression and Filter Transformations

5. Joining and Merging Data

6. Lookup Transformations

7. Sorter and Aggregator Transformations

Page 4: PC8x_Module01

Course Introduction 0.7

Copyright © 2008 Informatica CorpPowerCenter 8.x Level I Developer

Module 0: Course Introduction 7 of 28

Course Agenda Continued

8. Using the Debugger

9. Updating Target Tables

10.Mapping Techniques

11.Mapplets and Worklets

12.Controlling Workflows

13.Mapping Design Workshop

14.Workflow Design Workshop

0

Page 5: PC8x_Module01

PowerCenter Overview 1.1

Copyright © 2008 Informatica CorpPowerCenter 8.x Level I Developer

Module 1: PowerCenter Overview

1

Page 6: PC8x_Module01

PowerCenter Overview 1.2

Copyright © 2008 Informatica CorpPowerCenter 8.x Level I Developer

Module 1: PowerCenter Overview 2 of 32

Module Objectives

After completing this module you will be able to:• Explain the purposes of PowerCenter• Define terms used in PowerCenter• Name major PowerCenter components

Page 7: PC8x_Module01

PowerCenter Overview 1.3

Copyright © 2008 Informatica CorpPowerCenter 8.x Level I Developer

Module 1: PowerCenter Overview 3 of 32

The Problem

• Large organizations have a lot of data• The data can be stored in many formats, including databases

and unstructured files• This data must be collated, combined, compared, and

made to work as a seamless whole• But the different databases don’t talk to each other!

Accounting(SAP)

Marketing(ORCL)

Billing(Sybase)

Sales(SalesForce)Manufacturing

(DB2) ResourcePlanning(PSFT)

Inventory(SQL Server)

1

Page 8: PC8x_Module01

PowerCenter Overview 1.4

Copyright © 2008 Informatica CorpPowerCenter 8.x Level I Developer

Module 1: PowerCenter Overview 4 of 32

The Problem Continued

• Many organizations have implemented connectors between these databases• Every pair of databases requires a unique connector

Accounting(SAP)

Marketing(ORCL)

Sales(SalesForce)

Manufacturing(DB2)

ResourcePlanning(PSFT)

Inventory(SQL Server)

Billing(Sybase)

A connector is a piece of custom software that performs two functions: it converts data from the format of one application to the format of another application, and it transports the data between the two applications.

Connector

To be precise, to connect n databases fully requires (1+2+…+(n-1)) connectors.

Technical Note

Page 9: PC8x_Module01

PowerCenter Overview 1.5

Copyright © 2008 Informatica CorpPowerCenter 8.x Level I Developer

Module 1: PowerCenter Overview 5 of 32

The Problem Continued

• If you change one database, many connectors have to be upgraded

Marketing(ORCL)

Sales(SalesForce)

Manufacturing(DB2)

ResourcePlanning(PSFT)

Inventory(SQL Server)

Billing(Sybase)

Accounting(upgraded)

1

Page 10: PC8x_Module01

PowerCenter Overview 1.6

Copyright © 2008 Informatica CorpPowerCenter 8.x Level I Developer

Module 1: PowerCenter Overview 6 of 32

The Solution: Data Integration

• Data Integration technologies allow applications using different databases and formats to communicate with each other using one connection each

Accounting(SAP)

Marketing(ORCL)

Sales(SalesForce)

Manufacturing(DB2)

ResourcePlanning(PSFT)

Inventory(SQL Server)

Billing(Sybase)

DataIntegration!

Page 11: PC8x_Module01

PowerCenter Overview 1.7

Copyright © 2008 Informatica CorpPowerCenter 8.x Level I Developer

Module 1: PowerCenter Overview 7 of 32

Informatica PowerCenter

• Informatica PowerCenter is the premium data integration solution available today• “Database neutral” – will communicate with any database• Powerful data transformations convert one application’s data to

another’s format

Accounting(SAP)

Marketing(ORCL)

Sales(SalesForce)

Manufacturing(DB2)

ResourcePlanning(PSFT)

Inventory(SQL Server)

Billing(Sybase)

InformaticaPowerCenter

1

Page 12: PC8x_Module01

PowerCenter Overview 1.8

Copyright © 2008 Informatica CorpPowerCenter 8.x Level I Developer

Module 1: PowerCenter Overview 8 of 32

Example: Data Migration

• A company purchases a new accounts payable application

• PowerCenter can move the existing account data to the new application• Preserves data lineage for tax, accounting, and other legally

mandated purposes

Accounting(old)

InformaticaPowerCenter

Accounting(new)

Page 13: PC8x_Module01

PowerCenter Overview 1.9

Copyright © 2008 Informatica CorpPowerCenter 8.x Level I Developer

Module 1: PowerCenter Overview 9 of 32

Example: Application Integration

• Company A purchases Company B• To achieve the benefits of consolidation, Company B’s

billing system must be integrated into Company A’s billing system

Billing AInformatica

PowerCenter Billing B

1

Page 14: PC8x_Module01

PowerCenter Overview 1.10

Copyright © 2008 Informatica CorpPowerCenter 8.x Level I Developer

Module 1: PowerCenter Overview 10 of 32

Example: Data Warehousing

• Data warehouses put information from many sources together for analysis

• Data is moved from many databases to the warehouse

Accounting(SAP)

Marketing(ORCL)

Billing(Sybase)

Sales(SalesForce)Manufacturing

(DB2) ResourcePlanning(PSFT)

Inventory(SQL Server)

InformaticaPowerCenter

DataWarehouse

In addition to the examples given on these slides, PowerCenter is deployed for:

• Data Synchronization – ongoing exchange of data between disparate applications

• Data Hubs – master data management; reference data hubs; single view of customer, product, supplier, employee, etc.

• Business Activity Monitoring – business process improvement, real-time reporting

Additional Examples of Data Integration

Page 15: PC8x_Module01

PowerCenter Overview 1.11

Copyright © 2008 Informatica CorpPowerCenter 8.x Level I Developer

Module 1: PowerCenter Overview 11 of 32

Data Integration Methodology: ETL

•Extract the data from a source system

•Transform it as needed

•Load it to a target system

Source(s)

Target(s)

DataIntegration!

1

Page 16: PC8x_Module01

PowerCenter Overview 1.12

Copyright © 2008 Informatica CorpPowerCenter 8.x Level I Developer

Module 1: PowerCenter Overview 12 of 32

ETL for Data Warehousing

• Transaction data• Optimized for transaction

response time• Current• Normalized or

De-normalized data

Operational Systems

RDBMS Other

• Aggregated data• Historical data

Decision Support

Data Warehouse

ETL Load

Transform

Extract

• Aggregate data• Cleanse data• Consolidate data• Apply business rules• De-normalize data

Mainframe

Page 17: PC8x_Module01

PowerCenter Overview 1.13

Copyright © 2008 Informatica CorpPowerCenter 8.x Level I Developer

Module 1: PowerCenter Overview 13 of 32

ETL: Extract

• PowerCenter reads data, row by row, from a table (or group of related tables) in a database, or from a file

• This database or file is referred to as the source• The structure of the source is contained in a source

definition object.

SourceInformatica

PowerCenter

Extract

1

Page 18: PC8x_Module01

PowerCenter Overview 1.14

Copyright © 2008 Informatica CorpPowerCenter 8.x Level I Developer

Module 1: PowerCenter Overview 14 of 32

ETL: Transform

• PowerCenter converts the rows into a format the second (target) system will be able to use

• The logic for this conversion is defined in transformationobjects

SourceInformatica

PowerCenter

Extract Transform

Page 19: PC8x_Module01

PowerCenter Overview 1.15

Copyright © 2008 Informatica CorpPowerCenter 8.x Level I Developer

Module 1: PowerCenter Overview 15 of 32

ETL: Load

• PowerCenter writes data, row by row, to a table (or group of related tables) in a database, or to a file

• This database or file is referred to as the target• The structure of the target is contained in a target

definition object

SourceInformatica

PowerCenter Target

Extract LoadTransform

1

Page 20: PC8x_Module01

PowerCenter Overview 1.16

Copyright © 2008 Informatica CorpPowerCenter 8.x Level I Developer

Module 1: PowerCenter Overview 16 of 32

Mapping

• A set of transformations, in sequence or in parallel, that move and transform data from one or more source(s) to one or more target(s)

• Mappings exist entirely inside PowerCenter

Source(s)

Informatica PowerCenter

Target(s)

transformationsmapping

A mapping logically defines the ETL process. It reads data from sources, applies transformation logic to the data, and writes the transformed data to targets.

Mappings

Page 21: PC8x_Module01

PowerCenter Overview 1.17

Copyright © 2008 Informatica CorpPowerCenter 8.x Level I Developer

Module 1: PowerCenter Overview 17 of 32

Transformations

• Transformations receive data and transform it• Generate new fields• Modify data• Select and pass data

Transformations

1

Page 22: PC8x_Module01

PowerCenter Overview 1.18

Copyright © 2008 Informatica CorpPowerCenter 8.x Level I Developer

Module 1: PowerCenter Overview 18 of 32

Session

• The object that runs a mapping

Transformation

Mapping

Session

Page 23: PC8x_Module01

PowerCenter Overview 1.19

Copyright © 2008 Informatica CorpPowerCenter 8.x Level I Developer

Module 1: PowerCenter Overview 19 of 32

Workflow

• An ordered set of one or more sessions and other tasks, designed to accomplishan overall operationalpurpose

Transformation

Mapping

Session

Workflow

Other Tasks

A task is an executable set of actions, functions, or commands. A session is a task that runs a mapping. Other tasks include:

• Command – runs a shell script

• Email – sends an email

• Decision – branches a workflow conditionally

• Timer – waits for a defined period

Tasks

1

Page 24: PC8x_Module01

PowerCenter Overview 1.20

Copyright © 2008 Informatica CorpPowerCenter 8.x Level I Developer

Module 1: PowerCenter Overview 20 of 32

Metadata

• Defines data and processes• Examples:

• Source and target definitions• Type (flat file, database table, XML file, etc)• Datatype (character string, integer, decimal, etc)• Other attributes (length, precision, etc.)

• Mapping logic• Workflow logic

• Stored in a metadata repository

Repository

PowerCenter uses metadata to define sources, targets, transformations, mappings, and workflows

Metadata in PowerCenter

The word metadata literally means “data about data.” It is the information that describes data. Common contents of metadata include the source of a dataset, how it should be accessed, and its limitations.

Metadata

The repository is implemented as a schema, which may reside in any of a number of supported relational database management systems.

Note

Page 25: PC8x_Module01

PowerCenter Overview 1.21

Copyright © 2008 Informatica CorpPowerCenter 8.x Level I Developer

Module 1: PowerCenter Overview 21 of 32

PowerCenter 8.5 Architecture

Sources Targets

Repository

Integration Service

Repository Service Process

Repository Service

Domain

Administration Console

PowerCenter Client Native drivers

TCP/IP

TCP/IPODBC

Native drivers/ODBC

Native drivers/ODBC

ODBC

TCP/IP

HTTPS

Security Domain

Domain Metadata

Native drivers

Desktop tools used to populate the repository with metadata, execute workflows on the Integration Service, monitor the workflows, and manage the repository

Client Tools

Contains all the metadata needed to run the ETL processRepository

Multithreaded process that retrieves, inserts, and updates repository metadata

Repository Service Process

Manages connectivity to metadata repositories that contain mapping and workflow definitions

Repository Service

The engine which performs all the ETL logicIntegration Service

Can be relational tables or heterogeneous files (such as flat files, VSAM files, or XML)

Sources and Targets

1

Page 26: PC8x_Module01

PowerCenter Overview 1.22

Copyright © 2008 Informatica CorpPowerCenter 8.x Level I Developer

Module 1: PowerCenter Overview 22 of 32

PowerCenter Client Applications

Designer

Create ETL mappings

Workflow Manager

Create and start workflows

Workflow Monitor

Monitor and control workflows

Development

Repository Manager

Manage repository• connections• folders• objects• users and groups

(in PowerCenter 8.1)

Administration Console (browser-based)

Perform domain and repository service tasks:• Create/configure nodes and repository services• Upgrade/delete• Start/stop• Backup/restore• Manage users and groups

(In PowerCenter 8.5)

Administration

Administration Console

Designer and Repository Manager access the repository through the Repository Service.

Workflow Manager and Workflow Monitor connect to the Integration Service.

Each client has its own user interface. The UIs typically have toolbars, a navigation window to the left, a workspace to the right, and an output window at the bottom.

Note

Page 27: PC8x_Module01

PowerCenter Overview 1.23

Copyright © 2008 Informatica CorpPowerCenter 8.x Level I Developer

Module 1: PowerCenter Overview 23 of 32

Transformationtoolbar

Output Window

Navigator Window

Workspace

Status Bar

Designer Tools

Client Apps*

Designer: Create Mappings, Sources, Targets

* Single login to client applications – click on icons to open other tools – they are already connected to repository

1

Page 28: PC8x_Module01

PowerCenter Overview 1.24

Copyright © 2008 Informatica CorpPowerCenter 8.x Level I Developer

Module 1: PowerCenter Overview 24 of 32

Designer Tools

SourceAnalyzer:

create source objects

TargetDesigner:

create target objects

TransformationDeveloper:

create reusable transformations

MappletDesigner:

createmapplets

MappingDesigner:

create mappings

Page 29: PC8x_Module01

PowerCenter Overview 1.25

Copyright © 2008 Informatica CorpPowerCenter 8.x Level I Developer

Module 1: PowerCenter Overview 25 of 32

Transformation Views

Iconized – shows the transformation in relation to the rest of the mapping

Normal – shows the flow of data through the transformation

Edit – shows transformation ports and properties; allows editing

Ports represent table columns or file fields

1

Page 30: PC8x_Module01

PowerCenter Overview 1.26

Copyright © 2008 Informatica CorpPowerCenter 8.x Level I Developer

Module 1: PowerCenter Overview 26 of 32

Workflow Manager: Create and Run Workflows

Connections

Output Window

Navigator Window

Workspace

Status Bar

Workflow Manager

Tools

Client Applications Tasks

Page 31: PC8x_Module01

PowerCenter Overview 1.27

Copyright © 2008 Informatica CorpPowerCenter 8.x Level I Developer

Module 1: PowerCenter Overview 27 of 32

Workflow Manager Tools

Create reusable tasks Create worklets Create workflows

1

Page 32: PC8x_Module01

PowerCenter Overview 1.28

Copyright © 2008 Informatica CorpPowerCenter 8.x Level I Developer

Module 1: PowerCenter Overview 28 of 32

Session Tasks: Run a Single Mapping

Iconized view – shows the task in relation to the rest of the workflow

Edit view – shows taskproperties, allows editing

Two or more Sessions can point to a single mapping

Page 33: PC8x_Module01

PowerCenter Overview 1.29

Copyright © 2008 Informatica CorpPowerCenter 8.x Level I Developer

Module 1: PowerCenter Overview 29 of 32

Workflow Monitor: View Workflow Progress

NavigatorWindow

Gantt ChartView

TaskView

OutputWindow

TimeWindow

1

Page 34: PC8x_Module01

PowerCenter Overview 1.30

Copyright © 2008 Informatica CorpPowerCenter 8.x Level I Developer

Module 1: PowerCenter Overview 30 of 32

Repository Manager: Administer Folders

NavigatorWindow

MainWindow

OutputWindow

StatusBar

Page 35: PC8x_Module01

PowerCenter Overview 1.31

Copyright © 2008 Informatica CorpPowerCenter 8.x Level I Developer

Module 1: PowerCenter Overview 31 of 32

Business Function:Staging

Data format similar to OLTP, used

for populating ODS

DB Schema:STGxx

Business Function:OLTP

Data in an onlineTransaction

processing system

DB Schema:SDBU

Class Scenario

Business Function:ODS/EDW

Operational DataStore or ElectronicData Warehouse isnormalized data inan enterprise dataModel that alignsdata from various

OLTP Systems

DB Schema:ODSxx

Business Function:DDW

Dimensional DataWarehouse uses Fact

and Dimensiontables in second

normal form to speedreport generation

and allow forhistorical data

Covered in Level twoDeveloper class

In the labs for this course, we are simulating part of the creation of a (very simple) Dimensional Data Warehouse. In these labs, you will begin with data in OLTP tables and flat files, bring data to Staging, and from Staging (STG) to the Operational Data Store (ODS).

Because creation of Staging tables is fairly trivial, you will do more work on moving data from STG to ODS. This will provide more realistic uses of the capabilities of PowerCenter.

Note

1

Page 36: PC8x_Module01

PowerCenter Overview 1.32

Copyright © 2008 Informatica CorpPowerCenter 8.x Level I Developer

Module 1: PowerCenter Overview 32 of 32

Summary

This module showed you how to:• Explain the purposes of PowerCenter• Define terms used in PowerCenter• Name major PowerCenter components


Recommended