© 2006 IBM Corporation
IBM Information Server –An Introduction
Richard HedgesProgram Director, Product ManagementIBM Information [email protected]
2
Corporate View of Information Architecture Is Changing
Information is the key to Business Innovation
– Organizations highly effective at driving information integration are 5 times more likely to drive value creation
– Information architecture can’t exist in a vacuum – it needs to be tied to enterprise architecture
87% of CEOs believe fundamental change is required in next two yearsto drive innovation
Source: 2006 IBM Global CEO Survey
Over 60% of CEOs believe their organizations need to do a better job leveraging information
3
Customer Business Issues
Too much information and not knowing what’s important– Not using demand signals to drive supply chain– Not using customer analysis to tailor marketing and sales– Not leveraging valuable unstructured information
Multiple versions of the truth– Problems managing customer, product and partner interactions
– Regulatory compliance inhibited by poor transparency
Lack of trusted information– Incomplete, out-of-date, inaccurate, misinterpreted data
– Difficult to understand or control how information is used
Lack of agility– Inability to take advantage of opportunities for innovation
– Escalating costs due to inflexible systems and changing needs
4
Across the EnterpriseProcess Simplification
Eliminate redundancyDrive to standardizationPromote reuse and data quality
ComplianceReduce risks with conflicting sourcesMake information transparent
"Infoglut"Manage expanding volume and velocityControl unstructured content
Vendor ConsolidationSpend less on same functionality/technology
M&AReduce integration burdens
EfficiencyEnterprise Agility
Sense and respondProvide consistency, accuracySupport continuous information flowsRapid orchestrate processes
Real TimeEnable closed-loop analyticsImmediately integrate with partners, suppliers
Single ViewCreate consistent and holistic view across all channels Manage relationships
Revenue OptimizationSupport top-line growth on cross-sell/upsellLeverage global purchasing power
Differentiation
Trx.
DocumentsMedia
Customers Employees Partners
Databases
Orgs.
Financials
Products
WebContent
ReportsE-Mail
Management
Enterprise
Information
Across All ContentEnterprise Information Management: Getting Value From Information Assets Gartner Business Intelligence Summit 2006 David Newman 6-8 March 2006
What is Driving the Change? – Gartner Perspective
5
Product Master Data
Asset Master Data
External Data Sources
Across Structured, Semi-Structured and Unstructured Content
Across Transactional, Operational and Analytical Sources
Enterprise Data
WarehouseCustomer
Master Data
• Models • Standards• Schemas • Repositories • Search• Business Rules • ClassificationMetadata Management and Semantic Reconciliation
Data Services• Data Transformation• Content Integration
• Data Access• Data Quality
• Data Stewardship• Data sourcing
• Data Movement• Data Enrichment
Customer Data Integration
Product Content & Data Management
Business Intelligence Applications
Enterprise Content Management
CustomApplications
External Services
PackageApplications
Business Services Repository
Integrated Composition Technologies
Business Process Composition
How Gartner Defines the Requirement
Enterprise Information Management: Getting Value From Information Assets Gartner Business Intelligence Summit 2006 David Newman 6-8 March 2006
6
Developers
Subject Matter Experts
DataAnalysts
Business Users
Architects
DBAsWhy aren’t my tools more integrated?
How do I know that that I have an accurate view?
How can I actively collaborate with developers?
What about Governance, Security, Scalability?
Simplify administration, deployment, & maintenance
I want the tools to work the way I do..
7
DevelopersSubject Matter Experts
DataAnalysts
Business Users
Architects DBAs
Simplify the Complexity of Information IntegrationSimplify the Complexity of Information Integration
Facilitate Effective Business & IT CollaborationFacilitate Effective Business & IT Collaboration
Build Tools to Support the Way I Do My JobBuild Tools to Support the Way I Do My Job
Leverage My Existing IT InvestmentsLeverage My Existing IT Investments
Promote Reuse across Projects and the EnterprisePromote Reuse across Projects and the Enterprise
Major Development ThemesCustomer Inspired Innovation
8
The Construction of Our Platform
2002 2003 2004 2005
Metadata Integration
Integrated Cleansing
Integrated Matching
Integrated Profiling SOA Deployment
Grid Deployment
Integrated Semi-structured Data Handling
2006
ArchitecturalUnification
IBM Information
ServerFederation Ascential
AcquisitionReplicationEvent Pub(pre-2002)
Ascential
IBM
MainframeIntegration
ContentIntegration
UnstructuredInformationMgmt
9
“Those organizations that do nothing and continue to ignore the issue of poor data quality will at best continue to operate inefficiently, miss opportunities, and invest their resources in the wrong areas. At worst, they will experience a slow decline in their ability to compete, eventually leading to failure.”
Ted Friedman, Gartner Group
DevelopersSubject Matter Experts
DataAnalysts
Business Users
Architects DBAs
Delivering Information You Can TRUSTAuditable Data Quality in Batch and Real-time
10
DevelopersSubject Matter Experts
DataAnalysts
Business Users
Architects DBAs
Improved Productivity Role-Optimized Tools
11
Simplify Integration Increase trust and confidence in informationIncrease compliance to standards
Facilitate change management & reuseDesign Operational
Enhanced Collaboration & Alignment Role-Optimized Tools with Integrated Metadata
DevelopersSubject Matter Experts
DataAnalysts
Business Users
Architects DBAs
Unified Metadata Management
12
Reuse Designed Into the Platform Advanced Search & Impact Analysis
DevelopersSubject Matter Experts
DataAnalysts
Business Users
Architects DBAs
13
Linear Scalability To Support Growth Parallel Processing & Rich Connectivity
Understand Cleanse Transform Deliver
•Design sequentially, deploy in parallel •Codeless parallelization•Proven linear scalability
Unified Metadata Management
Unified Parallel Processing
High Performance Connectivity - Structured, Unstructured, Applications, Mainframe
DevelopersSubject Matter Experts
DataAnalysts
Business Users
Architects DBAs
14
IBM Information ServerDelivering information you can trust
Understand Cleanse Transform Deliver
Parallel Processing
Rich Connectivity to Applications, Data, and Content
IBM Information Server
Discover, model, and govern information
structure and content
Standardize, merge,and correct information
Combine and restructure information for new uses
Synchronize, virtualizeand move information for
in-line delivery
Unified Deployment
Unified Metadata Management
15
Where is my information?
How do I get it when I need it?
What does it mean?
Can I trust it?
How do I get it in the form I need?
How do I get it where it needs to go?
How do I control it?
Why Is it Important to Start with Understanding?
16
Business GlossaryBusiness Metadata
Web-based authoring, managing & sharing of business metadata
Aligns the efforts of IT with the goals of the business
Provides business context to information technology assets
Establishes responsibility and accountability
Subject Matter Experts
Create and manage business vocabulary and relationships, while
linking to physical sources
Business Glossary
Business Users
Business View
GL Account Number
The ten digit account number. Sometimes referred to as the account ID. This value is of the form L-FIIIIVVVV.
Database = DB2
Schema = NAACCT
Table = DLYTRANS
Column = ACCT_NO
data type = char(11)
Technical Business
Understand
17
Information AnalyzerPhysical Metadata
Data-centric analysis of application, database and file-based sources
Secure, detailed profiling of fields, across fields, and across sources
Creation of metadata from profiling results
Results instantly promotable across IBM Information Server
Analyze source data structures, and monitor adherence to integration and
quality rules
IBM Information Analyzer
DataAnalysts
Subject Matter Experts
Physical View
Understand
18
Rational Data ArchitectLogical Metadata
Data modeling for data structures and federations
Federated data discovery
Metadata relationship discovery & mapping
Impact analysis, and synchronization across models
SQL & XML generation capabilities
Subject Matter Experts
Create and manage business vocabulary and relationships, while
linking to physical sources
Data Modeling & Mapping
Architects
Rational Data ArchitectUnderstand
19
Why Should I Care About Cleansing Information?
Lack of information standards– Different formats & structures across
different systems
Data surprises in individual fields– Data misplaced in the database
Information buried in free-form fields
Data myopia– Lack of consistent identifiers inhibit a
single view
The redundancy nightmare– Duplicate records with a lack of
standards
Kate A. Roberts 416 Columbus Ave #2, Boston, Mass 02116
Catherine Roberts Four sixteen Columbus APT2, Boston, MA 02116
Mrs. K. Roberts 416 Columbus Suite #2, Suffolk County 02116
Name Tax ID Telephone
J Smith DBA Lime Cons. 228-02-1975 6173380300Williams & Co. C/O Bill 025-37-1888 415-392-20001st Natl Provident 34-2671434 3380321HP 15 State St. 508-466-1200 Orlando
WING ASSY DRILL 4 HOLE USE 5J868A HEXBOLT 1/4 INCH
WING ASSEMBY, USE 5J868-A HEX BOLT .25” - DRILL FOUR HOLES
USE 4 5J868A BOLTS (HEX .25) - DRILL HOLES FOR EA ON WING ASSEM
RUDER, TAP 6 WHOLES, SECURE W/KL2301 RIVETS (10 CM)
19-84-103 RS232 Cable 6' M-F CandS
CS-89641 6 ft. Cable Male-F, RS232 #87951
C&SUCH6 Male/Female 25 PIN 6 Foot Cable
90328574 IBM 187 N.Pk. Str. Salem NH 0145690328575 I.B.M. Inc. 187 N.Pk. St. Salem NH 0145690238495 Int. Bus. Machines 187 No. Park St Salem NH 0415690233479 International Bus. M. 187 Park Ave Salem NH 0415690233489 Inter-Nation Consults 15 Main Street Andover MA 0234190345672 I.B. Manufacturing Park Blvd. Bostno MA 04106
20
QualityStageData Cleansing
Specialized data quality functions seamlessly integrated with DataStage
Visual tools for defining complex matching and survivorship logic
Ensures clean, standardized, de-duplicated information
Enables a single version of the truth
Cleanse
Subject Matter Experts
Standardize and correct source data fields, and match records together
across sources to create a single view
QualityStage™
Visual Match Rule Design
DataAnalysts
21
What Is Important About Transformation & Delivery?
Transformation is key to enabling information to be used in new business contexts – it needs to be metadata-driven
Designed for use by information experts using the understanding imparted by the metadata
Transformation and Delivery can be reused across multiple mechanisms– Large volume batch movement– Real-time event-driven response– Service-oriented architecture– Federated query
DataAnalysts
DataArchitects
DBAs Subject Matter Experts
Request Response
LogicReuse
Query
22
DataStageData Transformation & Movement
Codeless visual design of data flows with hundreds of built-in transformation functions
Optimized reuse of data integration objects
Leverages parallel processing without requiring design changes
Capable of supporting batch and real-time operations
Transform
Transform and aggregate any volume of information in batch or real time
through visually designed logic
Hundreds of Built-inTransformation Functions
ArchitectsDevelopers
DataStage®
23
Federation ServerData Federation
Access diverse & distributed information as if it were in one system
Industry leading query optimization with single sign-on, unified views, and function compensation
Transactional write capabilities across heterogeneous sources
Visual tools for federated data discovery & data modeling
Federate
Federation Server
Access and integrate heterogeneous information across multiple sources
as if they were a single source
Extend value of existing analytical applications by providing real-time
access to integrated information
Visual Federation Design
24
Data Synchronization: Replication Server
Deliver
Distribution, consolidation or synchronization of information in different databases
Multi-directional delivery
Ease-of-use features:– Integrated monitoring and
statistics
– Changed data histories
– Configuration options:• Wizard-driven GUI• Command-line processor• Script-driven processor
Replication ServerReplicate and synchronize between databases in high-speed based on data events for high availability &
disaster recovery, data synchronization, and data distribution
Easy Replication Wizard
25
Flexibly deploy and manage reusable information services without hand coding
Information Services Director
Rapid SOA Deployment
Architects
Information Services DirectorRapid SOA Deployment
Packages information integration logic as services that insulate developers from underlying sources
Allows these services to be invoked as EJB, JMS, or Web services
Provides load balancing & fault tolerance for requests across multiple Information Servers
Provides foundation infrastructure for Information Services
Developers
26
Actionable Information Services
IBM Information ServerCustomer Order History Customer
Order History Service
OtherData Sources
ContentRepositories
• Where does the information come from?• What happens to it along the way?• How does this fit into how the business defines things?• How do I know I’m using the right service?
Information Services provide a basis for trust in information –providing visibility into lineage, relationships to other systems, and business definition
27
Actionable Information Services
IBM Information Server
Process Server ReceiveRequest
Order Handling Process
CheckAccount
ShipOrder
Check OrderPolicy
ReviewOrder
Customer Order History Customer
Order History Service
Customer Customer Info Customer
Customer Master Service
Customer Order Status Customer
Data Cleansing Service
Portal Call CenterPortal
28
Service Oriented ArchitectureInformation as a Service is Key
Getting the right data quickly and consistently for all applications continues to be a key challenge for many enterprises.
Forrester, January 2006
IBMInformation
Server
WebSphereProcessServer
WebSpherePortal
WebSphere ESB
You will waste your investment in SOA unless you have enterpriseinformation that SOA can exploit.
Gartner, March 2005
29
IBM Information Server Architecture
AnalysisInterface
Web AdminInterface
DevelopmentInterface
UNIFIED USER INTERFACE
COMMON SERVICES
MetadataServices
SecurityServices
Logging &ReportingServices
UNIFIED METADATA
Design Operational
UNIFIED PARALLEL PROCESSING
Understand Cleanse Transform
COMMON CONNECTIVITY
UnifiedService
Deployment
Deliver
Structured, Unstructured, Applications, Mainframe
30
WebSphere Metadata Server
MetaBrokers
Information Services Framework
Dynamic Repository
Analysis Services
Data Lineage
Impact Analysis
Object Merge
Import/Export
OperationalRepository
Common Model
Access Services
CRUD
WebSphere Metadata Server
Rational Data
Architect
Third PartyTools
WebSphereBusinessGlossary
WebSphereInformation
Analyzer
WebSphereInformation
ServicesDirector
IBM Info.Server
AdminReporting
WebSphereData
Stage
WebSphereQualityStage
31
Information Integration ServicesProviding a Roadmap to Project Success
Center of Excellence for Data Integration
Iterations® Methodology
Architecture Design/Support
Education and Mentoring
Enterprise Data Models
Virtual Consulting
Certification
32
Source System Analysis
Data Cleansing
Transformation Logic Construction
Data Management Services
Application System Connectivity
50+% gain
20+% gain
40+% gain
30+% gain
50+% gain
1 Compared to hand coding – gathered from IBM project studies
Customers Achieve Significant Productivity Benefits1
Example ETL Project
Approx.ProjectEffort
30%
20%
20%
20%
10%
100%
33
– Helps you leverage information in an SOA for greater agility
– Delivers trusted information to key business initiatives, such as business intelligence or compliance
– Unlocks information from silos and provides a single view of the truth
Information is the key to Business Innovation
Silos of Business
Information
IBM Information Server revolutionizes information integration
34