© 2011 IBM Corporation
Information Management
AFPOA Virtual Vendor DayTopic: Data IntegrationGregory J. Vaughan – Executive Consultant, WW Military and Defense Lead, Information Agenda Tiger Team
© 2011 IBM Corporation
Information Management
2
There’s no “easy button” for this…Data Integration is a complex
problem
A myopic view of the problem frustrates the desired end state
Scoping the problem too narrowly reduces the likelihood of success
Focusing later on data integration requires a revisit of the problem scope
Data integration presents the greatest risk to IT related business initiatives
Data Governance is required, but frequently overlooked
The complexities of data integration requires a comprehensive solution
© 2011 IBM Corporation
Information Management
3
Define & Govern
Operational
Systems/Data
APPLICATIONS
INTERNALDATABASES
EXTERNALDATABASES
BI (REPORTS, DASHBOARDS,QUERY, OLAP)
Analytics
PREDICTIVEANALYTICS
TEXTANALYTICS
OPTIMIZATION
OLAPCUBES
DATAWAREHOUSE
DATA MARTS
Analytical
Information
UNSTRUCTURECONTENT
METADATA
OPERATIONALDATA
MASTERDATA
Trusted
Information
Info. Integration Data Quality Info. Services
USERS
APPLICATIONS
INTERNAL/EXTERNALDATBASES
Solution Architecture – General View
© 2011 IBM Corporation
Information Management
4
The IBM Solution: IBM Information ServerDelivering information you can trust
Understand
Cleanse Transform Deliver
Parallel ProcessingRich Connectivity to Applications, Data, and
Content
IBM Information Server
Discover, model, and govern information
structure and content
Standardize, merge,and correct information
Combine and restructure
information for new uses
Synchronize, virtualize and move information for in-
line delivery
Unified Deployment
Unified Metadata Management
© 2011 IBM Corporation
Information Management
5
Align business and IT objectives using single platform that creates trusted information for use in key initiatives
SourcesSourcesBusiness InitiativesBusiness Initiativeslegacylegacy
appsapps
dbsdbs
Xls., xml, flat
Xls., xml, flat
warehousewarehouse
z/OSz/OS
customcustom
BIBI
SAPSAP
warehousewarehouse
mdmmdm
Business AnalystsBusiness Analysts
ExecutivesExecutivesEnterprise ArchitectsEnterprise Architects
Data Analysts & Architects
Data Analysts & Architects
Subject Matter Experts
Subject Matter Experts
ERP System Manager
ERP System Manager
DeveloperDeveloper
DBADBA
System ArchitectSystem
Architect
Data Steward
Data Steward
© 2011 IBM Corporation
Information Management
6
Align business and IT objectives using single platform that creates trusted information for use in key initiatives
SourcesSourcesBusiness InitiativesBusiness Initiativeslegacylegacy
appsapps
dbsdbs
Xls., xml, flat
Xls., xml, flat
warehousewarehouse
z/OSz/OS
customcustom
BIBI
SAPSAP
warehousewarehouse
mdmmdm
Business AnalystsBusiness Analysts
ExecutivesExecutivesEnterprise ArchitectsEnterprise Architects
Data Analysts & Architects
Data Analysts & Architects
Subject Matter Experts
Subject Matter Experts
ERP System Manager
ERP System Manager
DeveloperDeveloper
DBADBA
System ArchitectSystem
Architect
Data Steward
Data Steward
© 2011 IBM Corporation
Information Management
77
Identify data quality issues early to reduce project risks
Monitor quality metrics over time for compliance
Create business confidence with trusted information
Perform data quality assessment
Define business rules to monitor data quality
Establish stewards for governance of data quality
Requirements
Benefits
Information AnalyzerInformation Analyzer
Analyze source data quality and monitor adherence to
integration and quality rules
InfoSphere Information Analyzer
© 2011 IBM Corporation
Information Management
88 8
Context for information is available to everyone, immediately
IT projects are aligned with data governance
Collaboration increases across business and IT
Capture business terms and classifications
Link business terms and classifications to IT assets
Identify data stewards and make glossary accessible
Requirements
Benefits
Business GlossaryBusiness Glossary
Create and manage business vocabulary and relationships and
related to physical sources
InfoSphere Business Glossary
© 2011 IBM Corporation
Information Management
9
Align business and IT objectives using single platform that creates trusted information for use in key initiatives
SourcesSourcesBusiness InitiativesBusiness Initiativeslegacylegacy
appsapps
dbsdbs
Xls., xml, flat
Xls., xml, flat
warehousewarehouse
z/OSz/OS
customcustom
BIBI
SAPSAP
warehousewarehouse
mdmmdm
Business AnalystsBusiness Analysts
ExecutivesExecutivesEnterprise ArchitectsEnterprise Architects
Data Analysts & Architects
Data Analysts & Architects
Subject Matter Experts
Subject Matter Experts
ERP System Manager
ERP System Manager
DeveloperDeveloper
DBADBA
System ArchitectSystem
Architect
Data Steward
Data Steward
© 2011 IBM Corporation
Information Management
1010
QualityStage
Removes duplicates
Cross-references matching records
Survives a single, complete record
Validate and enriches data
Resolution of data quality issues
Standardization of data formats
Cleanse data
Manage duplicate data
Enable ongoing quality
Requirements
Standardize, cleanse and deduplicate data, ensuring a complete, accurate view of
information
Benefits
InfoSphere QualityStage
© 2011 IBM Corporation
Information Management
11
Align business and IT objectives using single platform that creates trusted information for use in key initiatives
SourcesSourcesBusiness InitiativesBusiness Initiativeslegacylegacy
appsapps
dbsdbs
Xls., xml, flat
Xls., xml, flat
warehousewarehouse
z/OSz/OS
customcustom
BIBI
SAPSAP
warehousewarehouse
mdmmdm
Business AnalystsBusiness Analysts
ExecutivesExecutivesEnterprise ArchitectsEnterprise Architects
Data Analysts & Architects
Data Analysts & Architects
Subject Matter Experts
Subject Matter Experts
ERP System Manager
ERP System Manager
DeveloperDeveloper
DBADBA
System ArchitectSystem
Architect
Data Steward
Data Steward
© 2011 IBM Corporation
Information Management
1212
InfoSphere Metadata Workbench
Deliver enterprise audit control information.
Mediate system disruptions.
Govern enterprise assets over time.
Ensure effective collaboration with line of business stakeholders.
Handle Change Management processes with measured impact.
Visualize and trace information flows across enterprise landscape
Access and report on operational and design metadata
Requirements
Benefits
Metadata WorkbenchMetadata Workbench
Support information governance with traceability on data movement,
modeling & BI applications
© 2011 IBM Corporation
Information Management
13
Align business and IT objectives using single platform that creates trusted information for use in key initiatives
SourcesSourcesBusiness InitiativesBusiness Initiativeslegacylegacy
appsapps
dbsdbs
Xls., xml, flat
Xls., xml, flat
warehousewarehouse
z/OSz/OS
customcustom
BIBI
SAPSAP
warehousewarehouse
mdmmdm
Business AnalystsBusiness Analysts
ExecutivesExecutivesEnterprise ArchitectsEnterprise Architects
Data Analysts & Architects
Data Analysts & Architects
Subject Matter Experts
Subject Matter Experts
ERP System Manager
ERP System Manager
DeveloperDeveloper
DBADBA
System ArchitectSystem
Architect
Data Steward
Data Steward
© 2011 IBM Corporation
Information Management
14 14
InfoSphere Data ArchitectRequirements
Benefits
Model, visualize, and relate diverse and distributed data assets
Data ArchitectData Architect
Design and manage enterprise models
Enforce model conformance to enterprise standards
Leverage industry data models for best practices
Speed design activities
Populate Business Glossary from model terms
Validate models for enterprise conformance
© 2011 IBM Corporation
Information Management
15 15
InfoSphere FastTrackRequirements
Benefits
Capture Design Specifications and accelerate translation into data
integration projects
Capture business requirements for source to target mappings
Leverage source analysis and business vocabulary
Generate candidate ETL jobs
Accelerate development of integration processes
Centralized management of specifications
Audit design decisions over time
FastTrackFastTrack
© 2011 IBM Corporation
Information Management
16
IBM InfoSphere Optim Data Masking Solution
Protect sensitive information from misuse and fraud
Prevent data breaches and associated fines
Achieve better data governance
Protect confidential data used in test, training & development systems
Implement proven data masking techniques
Support compliance with privacy regulations
Solution supports custom & packaged ERP applications
Requirements
Benefits
De-identify sensitive informationwith realistic but fictional data for testing & development purposes
Personal identifiable information is masked with realistic but fictional data for testing & development purposes.
JASON MICHAELSJASON MICHAELS ROBERT SMITHROBERT SMITH
Understand &Define
Monitor & AuditSecure &
Protect
Information Governance Core DisciplinesSecurity and Privacy
© 2011 IBM Corporation
Information Management
17
IBM InfoSphere Optim Test Data Management Solution
Create “right-size” production-like environments
for application testing
Test Data Management
Test Data Management
100 GB100 GB
25 GB
50 GB50 GB
25 GB
2TB2TB
Development
Unit Test
TrainingIntegration Test
Subset & Mask
Production or Production Clone
InfoSphere Optim TDM supports data on distributed platforms (LUW) and z/OS.
Out-of-the-box subset support for packaged applications ERP/CRM solutions as well as :
OtherOther
Understand &Define
Monitor & AuditSecure &
Protect
Information Governance Core DisciplinesSecurity and Privacy
Deploy new functionality more quickly and with improved quality
Easily refresh & maintain test environments
Reduce storage and operational costs
Create referentially intact, “right-sized” test databases
Automate test result comparisons to identify hidden errors
Shorten iterative testing cycles and accelerate time to market
Requirements
Benefits
© 2011 IBM Corporation
Information Management
18
Guardium: Full Lifecycle of Database Security & Compliance
© 2011 IBM Corporation
Information Management
19
Best Practices Capabilities & Differentiators
Single data integration platform with multiple components
Consistent and repeatable methodology for mitigating risks
Industry leading Probabilistic Matching Engine for data
standardization jobs
Native Parallel Processing Engine for scalability
Shared GUI Interface between major components of the platform
Centralized repository of critical metadata shared across the
platform
Data integration enablement in an SOA environment
© 2011 IBM Corporation
Information Management
20
IBM Information Server Federal Customers
• Agency data migrations• Authoritative source• Personnel record consolidation• System synchronization
• Personnel and recruiting analysis• Procurement system consolidation• Real-time data management• Inventory parts analysis
© 2011 IBM Corporation
Information Management
21
Questions?