Date post: | 26-Oct-2014 |
Category: |
Documents |
Upload: | nitrojunk88 |
View: | 104 times |
Download: | 1 times |
Course Introduction 0.2
Copyright © 2008 Informatica CorpPowerCenter 8.x Level I Developer
Module 0: Course Introduction
Course Introduction 0.5
Copyright © 2008 Informatica CorpPowerCenter 8.x Level I Developer
Module 0: Course Introduction 5 of 28
Course Objectives
• When you have completed this course, you should be able to:• Use PowerCenter 8 Designer to build mappings that move
data from sources to targets
• Use PowerCenter 8 Workflow Manager to build and run a workflow that runs a session based on a mapping
• Design simple mappings and workflows based on business needs
• Perform basic troubleshooting of PowerCenter mappings and transformations
• Use Informatica Support options to resolve questions and problems about Informatica PowerCenter 8
0
Course Introduction 0.6
Copyright © 2008 Informatica CorpPowerCenter 8.x Level I Developer
Module 0: Course Introduction 6 of 28
Course Agenda
1. PowerCenter Overview
2. Mapping Fundamentals
3. Workflow Basics
4. Expression and Filter Transformations
5. Joining and Merging Data
6. Lookup Transformations
7. Sorter and Aggregator Transformations
Course Introduction 0.7
Copyright © 2008 Informatica CorpPowerCenter 8.x Level I Developer
Module 0: Course Introduction 7 of 28
Course Agenda Continued
8. Using the Debugger
9. Updating Target Tables
10.Mapping Techniques
11.Mapplets and Worklets
12.Controlling Workflows
13.Mapping Design Workshop
14.Workflow Design Workshop
0
PowerCenter Overview 1.1
Copyright © 2008 Informatica CorpPowerCenter 8.x Level I Developer
Module 1: PowerCenter Overview
1
PowerCenter Overview 1.2
Copyright © 2008 Informatica CorpPowerCenter 8.x Level I Developer
Module 1: PowerCenter Overview 2 of 32
Module Objectives
After completing this module you will be able to:• Explain the purposes of PowerCenter• Define terms used in PowerCenter• Name major PowerCenter components
PowerCenter Overview 1.3
Copyright © 2008 Informatica CorpPowerCenter 8.x Level I Developer
Module 1: PowerCenter Overview 3 of 32
The Problem
• Large organizations have a lot of data• The data can be stored in many formats, including databases
and unstructured files• This data must be collated, combined, compared, and
made to work as a seamless whole• But the different databases don’t talk to each other!
Accounting(SAP)
Marketing(ORCL)
Billing(Sybase)
Sales(SalesForce)Manufacturing
(DB2) ResourcePlanning(PSFT)
Inventory(SQL Server)
1
PowerCenter Overview 1.4
Copyright © 2008 Informatica CorpPowerCenter 8.x Level I Developer
Module 1: PowerCenter Overview 4 of 32
The Problem Continued
• Many organizations have implemented connectors between these databases• Every pair of databases requires a unique connector
Accounting(SAP)
Marketing(ORCL)
Sales(SalesForce)
Manufacturing(DB2)
ResourcePlanning(PSFT)
Inventory(SQL Server)
Billing(Sybase)
A connector is a piece of custom software that performs two functions: it converts data from the format of one application to the format of another application, and it transports the data between the two applications.
Connector
To be precise, to connect n databases fully requires (1+2+…+(n-1)) connectors.
Technical Note
PowerCenter Overview 1.5
Copyright © 2008 Informatica CorpPowerCenter 8.x Level I Developer
Module 1: PowerCenter Overview 5 of 32
The Problem Continued
• If you change one database, many connectors have to be upgraded
Marketing(ORCL)
Sales(SalesForce)
Manufacturing(DB2)
ResourcePlanning(PSFT)
Inventory(SQL Server)
Billing(Sybase)
Accounting(upgraded)
1
PowerCenter Overview 1.6
Copyright © 2008 Informatica CorpPowerCenter 8.x Level I Developer
Module 1: PowerCenter Overview 6 of 32
The Solution: Data Integration
• Data Integration technologies allow applications using different databases and formats to communicate with each other using one connection each
Accounting(SAP)
Marketing(ORCL)
Sales(SalesForce)
Manufacturing(DB2)
ResourcePlanning(PSFT)
Inventory(SQL Server)
Billing(Sybase)
DataIntegration!
PowerCenter Overview 1.7
Copyright © 2008 Informatica CorpPowerCenter 8.x Level I Developer
Module 1: PowerCenter Overview 7 of 32
Informatica PowerCenter
• Informatica PowerCenter is the premium data integration solution available today• “Database neutral” – will communicate with any database• Powerful data transformations convert one application’s data to
another’s format
Accounting(SAP)
Marketing(ORCL)
Sales(SalesForce)
Manufacturing(DB2)
ResourcePlanning(PSFT)
Inventory(SQL Server)
Billing(Sybase)
InformaticaPowerCenter
1
PowerCenter Overview 1.8
Copyright © 2008 Informatica CorpPowerCenter 8.x Level I Developer
Module 1: PowerCenter Overview 8 of 32
Example: Data Migration
• A company purchases a new accounts payable application
• PowerCenter can move the existing account data to the new application• Preserves data lineage for tax, accounting, and other legally
mandated purposes
Accounting(old)
InformaticaPowerCenter
Accounting(new)
PowerCenter Overview 1.9
Copyright © 2008 Informatica CorpPowerCenter 8.x Level I Developer
Module 1: PowerCenter Overview 9 of 32
Example: Application Integration
• Company A purchases Company B• To achieve the benefits of consolidation, Company B’s
billing system must be integrated into Company A’s billing system
Billing AInformatica
PowerCenter Billing B
1
PowerCenter Overview 1.10
Copyright © 2008 Informatica CorpPowerCenter 8.x Level I Developer
Module 1: PowerCenter Overview 10 of 32
Example: Data Warehousing
• Data warehouses put information from many sources together for analysis
• Data is moved from many databases to the warehouse
Accounting(SAP)
Marketing(ORCL)
Billing(Sybase)
Sales(SalesForce)Manufacturing
(DB2) ResourcePlanning(PSFT)
Inventory(SQL Server)
InformaticaPowerCenter
DataWarehouse
In addition to the examples given on these slides, PowerCenter is deployed for:
• Data Synchronization – ongoing exchange of data between disparate applications
• Data Hubs – master data management; reference data hubs; single view of customer, product, supplier, employee, etc.
• Business Activity Monitoring – business process improvement, real-time reporting
Additional Examples of Data Integration
PowerCenter Overview 1.11
Copyright © 2008 Informatica CorpPowerCenter 8.x Level I Developer
Module 1: PowerCenter Overview 11 of 32
Data Integration Methodology: ETL
•Extract the data from a source system
•Transform it as needed
•Load it to a target system
Source(s)
Target(s)
DataIntegration!
1
PowerCenter Overview 1.12
Copyright © 2008 Informatica CorpPowerCenter 8.x Level I Developer
Module 1: PowerCenter Overview 12 of 32
ETL for Data Warehousing
• Transaction data• Optimized for transaction
response time• Current• Normalized or
De-normalized data
Operational Systems
RDBMS Other
• Aggregated data• Historical data
Decision Support
Data Warehouse
ETL Load
Transform
Extract
• Aggregate data• Cleanse data• Consolidate data• Apply business rules• De-normalize data
Mainframe
PowerCenter Overview 1.13
Copyright © 2008 Informatica CorpPowerCenter 8.x Level I Developer
Module 1: PowerCenter Overview 13 of 32
ETL: Extract
• PowerCenter reads data, row by row, from a table (or group of related tables) in a database, or from a file
• This database or file is referred to as the source• The structure of the source is contained in a source
definition object.
SourceInformatica
PowerCenter
Extract
1
PowerCenter Overview 1.14
Copyright © 2008 Informatica CorpPowerCenter 8.x Level I Developer
Module 1: PowerCenter Overview 14 of 32
ETL: Transform
• PowerCenter converts the rows into a format the second (target) system will be able to use
• The logic for this conversion is defined in transformationobjects
SourceInformatica
PowerCenter
Extract Transform
PowerCenter Overview 1.15
Copyright © 2008 Informatica CorpPowerCenter 8.x Level I Developer
Module 1: PowerCenter Overview 15 of 32
ETL: Load
• PowerCenter writes data, row by row, to a table (or group of related tables) in a database, or to a file
• This database or file is referred to as the target• The structure of the target is contained in a target
definition object
SourceInformatica
PowerCenter Target
Extract LoadTransform
1
PowerCenter Overview 1.16
Copyright © 2008 Informatica CorpPowerCenter 8.x Level I Developer
Module 1: PowerCenter Overview 16 of 32
Mapping
• A set of transformations, in sequence or in parallel, that move and transform data from one or more source(s) to one or more target(s)
• Mappings exist entirely inside PowerCenter
Source(s)
Informatica PowerCenter
Target(s)
transformationsmapping
A mapping logically defines the ETL process. It reads data from sources, applies transformation logic to the data, and writes the transformed data to targets.
Mappings
PowerCenter Overview 1.17
Copyright © 2008 Informatica CorpPowerCenter 8.x Level I Developer
Module 1: PowerCenter Overview 17 of 32
Transformations
• Transformations receive data and transform it• Generate new fields• Modify data• Select and pass data
Transformations
1
PowerCenter Overview 1.18
Copyright © 2008 Informatica CorpPowerCenter 8.x Level I Developer
Module 1: PowerCenter Overview 18 of 32
Session
• The object that runs a mapping
Transformation
Mapping
Session
PowerCenter Overview 1.19
Copyright © 2008 Informatica CorpPowerCenter 8.x Level I Developer
Module 1: PowerCenter Overview 19 of 32
Workflow
• An ordered set of one or more sessions and other tasks, designed to accomplishan overall operationalpurpose
Transformation
Mapping
Session
Workflow
Other Tasks
A task is an executable set of actions, functions, or commands. A session is a task that runs a mapping. Other tasks include:
• Command – runs a shell script
• Email – sends an email
• Decision – branches a workflow conditionally
• Timer – waits for a defined period
Tasks
1
PowerCenter Overview 1.20
Copyright © 2008 Informatica CorpPowerCenter 8.x Level I Developer
Module 1: PowerCenter Overview 20 of 32
Metadata
• Defines data and processes• Examples:
• Source and target definitions• Type (flat file, database table, XML file, etc)• Datatype (character string, integer, decimal, etc)• Other attributes (length, precision, etc.)
• Mapping logic• Workflow logic
• Stored in a metadata repository
Repository
PowerCenter uses metadata to define sources, targets, transformations, mappings, and workflows
Metadata in PowerCenter
The word metadata literally means “data about data.” It is the information that describes data. Common contents of metadata include the source of a dataset, how it should be accessed, and its limitations.
Metadata
The repository is implemented as a schema, which may reside in any of a number of supported relational database management systems.
Note
PowerCenter Overview 1.21
Copyright © 2008 Informatica CorpPowerCenter 8.x Level I Developer
Module 1: PowerCenter Overview 21 of 32
PowerCenter 8.5 Architecture
Sources Targets
Repository
Integration Service
Repository Service Process
Repository Service
Domain
Administration Console
PowerCenter Client Native drivers
TCP/IP
TCP/IPODBC
Native drivers/ODBC
Native drivers/ODBC
ODBC
TCP/IP
HTTPS
Security Domain
Domain Metadata
Native drivers
Desktop tools used to populate the repository with metadata, execute workflows on the Integration Service, monitor the workflows, and manage the repository
Client Tools
Contains all the metadata needed to run the ETL processRepository
Multithreaded process that retrieves, inserts, and updates repository metadata
Repository Service Process
Manages connectivity to metadata repositories that contain mapping and workflow definitions
Repository Service
The engine which performs all the ETL logicIntegration Service
Can be relational tables or heterogeneous files (such as flat files, VSAM files, or XML)
Sources and Targets
1
PowerCenter Overview 1.22
Copyright © 2008 Informatica CorpPowerCenter 8.x Level I Developer
Module 1: PowerCenter Overview 22 of 32
PowerCenter Client Applications
Designer
Create ETL mappings
Workflow Manager
Create and start workflows
Workflow Monitor
Monitor and control workflows
Development
Repository Manager
Manage repository• connections• folders• objects• users and groups
(in PowerCenter 8.1)
Administration Console (browser-based)
Perform domain and repository service tasks:• Create/configure nodes and repository services• Upgrade/delete• Start/stop• Backup/restore• Manage users and groups
(In PowerCenter 8.5)
Administration
Administration Console
Designer and Repository Manager access the repository through the Repository Service.
Workflow Manager and Workflow Monitor connect to the Integration Service.
Each client has its own user interface. The UIs typically have toolbars, a navigation window to the left, a workspace to the right, and an output window at the bottom.
Note
PowerCenter Overview 1.23
Copyright © 2008 Informatica CorpPowerCenter 8.x Level I Developer
Module 1: PowerCenter Overview 23 of 32
Transformationtoolbar
Output Window
Navigator Window
Workspace
Status Bar
Designer Tools
Client Apps*
Designer: Create Mappings, Sources, Targets
* Single login to client applications – click on icons to open other tools – they are already connected to repository
1
PowerCenter Overview 1.24
Copyright © 2008 Informatica CorpPowerCenter 8.x Level I Developer
Module 1: PowerCenter Overview 24 of 32
Designer Tools
SourceAnalyzer:
create source objects
TargetDesigner:
create target objects
TransformationDeveloper:
create reusable transformations
MappletDesigner:
createmapplets
MappingDesigner:
create mappings
PowerCenter Overview 1.25
Copyright © 2008 Informatica CorpPowerCenter 8.x Level I Developer
Module 1: PowerCenter Overview 25 of 32
Transformation Views
Iconized – shows the transformation in relation to the rest of the mapping
Normal – shows the flow of data through the transformation
Edit – shows transformation ports and properties; allows editing
Ports represent table columns or file fields
1
PowerCenter Overview 1.26
Copyright © 2008 Informatica CorpPowerCenter 8.x Level I Developer
Module 1: PowerCenter Overview 26 of 32
Workflow Manager: Create and Run Workflows
Connections
Output Window
Navigator Window
Workspace
Status Bar
Workflow Manager
Tools
Client Applications Tasks
PowerCenter Overview 1.27
Copyright © 2008 Informatica CorpPowerCenter 8.x Level I Developer
Module 1: PowerCenter Overview 27 of 32
Workflow Manager Tools
Create reusable tasks Create worklets Create workflows
1
PowerCenter Overview 1.28
Copyright © 2008 Informatica CorpPowerCenter 8.x Level I Developer
Module 1: PowerCenter Overview 28 of 32
Session Tasks: Run a Single Mapping
Iconized view – shows the task in relation to the rest of the workflow
Edit view – shows taskproperties, allows editing
Two or more Sessions can point to a single mapping
PowerCenter Overview 1.29
Copyright © 2008 Informatica CorpPowerCenter 8.x Level I Developer
Module 1: PowerCenter Overview 29 of 32
Workflow Monitor: View Workflow Progress
NavigatorWindow
Gantt ChartView
TaskView
OutputWindow
TimeWindow
1
PowerCenter Overview 1.30
Copyright © 2008 Informatica CorpPowerCenter 8.x Level I Developer
Module 1: PowerCenter Overview 30 of 32
Repository Manager: Administer Folders
NavigatorWindow
MainWindow
OutputWindow
StatusBar
PowerCenter Overview 1.31
Copyright © 2008 Informatica CorpPowerCenter 8.x Level I Developer
Module 1: PowerCenter Overview 31 of 32
Business Function:Staging
Data format similar to OLTP, used
for populating ODS
DB Schema:STGxx
Business Function:OLTP
Data in an onlineTransaction
processing system
DB Schema:SDBU
Class Scenario
Business Function:ODS/EDW
Operational DataStore or ElectronicData Warehouse isnormalized data inan enterprise dataModel that alignsdata from various
OLTP Systems
DB Schema:ODSxx
Business Function:DDW
Dimensional DataWarehouse uses Fact
and Dimensiontables in second
normal form to speedreport generation
and allow forhistorical data
Covered in Level twoDeveloper class
In the labs for this course, we are simulating part of the creation of a (very simple) Dimensional Data Warehouse. In these labs, you will begin with data in OLTP tables and flat files, bring data to Staging, and from Staging (STG) to the Operational Data Store (ODS).
Because creation of Staging tables is fairly trivial, you will do more work on moving data from STG to ODS. This will provide more realistic uses of the capabilities of PowerCenter.
Note
1
PowerCenter Overview 1.32
Copyright © 2008 Informatica CorpPowerCenter 8.x Level I Developer
Module 1: PowerCenter Overview 32 of 32
Summary
This module showed you how to:• Explain the purposes of PowerCenter• Define terms used in PowerCenter• Name major PowerCenter components