Date post: | 18-Feb-2016 |
Category: |
Documents |
Upload: | rajesh-rai |
View: | 220 times |
Download: | 0 times |
April 13, 2011
AGENDA
Dec 16, 2013
What is BI??
BI Lifecycle
Chalenges of Building BI Data Warehouse Concepts Key Performance Indicator
Normalization and its type
Data Cleansing Slowly Changing Dimension
April 13, 2011
The term Business Intelligence (BI) represents the tools and systems that play a key role in the strategic planning process of the corporation.
These systems allow a company to gather, store, access and analyze corporate data to aid in decision-making.
It is an environment in which business users receive information that is reliable, secure, consistent, understandable, easily manipulated and timely...facilitating more informed decision making.
Dec 16, 2013
April 13, 2011October 30, 2012
Prod
Mkt
HR
Fin
Acctg
Data Sources
Transac tion D ata
IBM
IMS
VSAM
Oracle
Sybase
ETL Software Data Stores Data AnalysisTools and Applications
Users
Other In ternal D ata
ERP SAP
Clickstream Informix
W eb D ata
External Data
Demographic Harte-Hanks
STAG ING
AREA
OPERAT IONAL
DATA
STORE
Ascential
Extract
Sagent
SAS
Clean/ScrubTrans formFi rst logic
Load
Informatica
Data Mart sTeradataIBM
Data Warehouse
Meta Data
Finance
Marketing
Sales
Essbase
Microsoft
A NA L YS TS
M AN A GER S
EX EC U TIV ES
OPER A TIO NA LPER SO NN EL
C US TOM ER S/SU PPLIE RS
SQL
Cognos
SAS
Queri es,Reporting,DSS/EIS, Data Mining
Micro Strat egy
Siebel
BusinessObjects
WebBrowser
April 13, 2011
Data exists in multiple placesData is not formatted to support complex analysisDifferent kinds of workers have different data needsWhat data should be examined and in what detailHow will users interact with that data
Dec 16, 2013
April 13, 2011
Cubes Measures Key Performance Indicator Dimensions ---Attributes ---Hierarchy
Dec 16, 2013
April 13, 2011
DATA MODELING
• process that produces abstract data models for one or more database components of the data warehouse
Types of Data Modeling– Conceptual Data Model– Logical Data Model– Physical Data Model
Dec 16, 2013
April 13, 2011
KEY PERFORMANCE INDICATOR (KPI)
• A set of quantifiable measures that a company or industry uses to gauge or compare performance in terms of meeting their strategic and operational goals.
• KPIs vary between companies and industries, depending on their priorities or performance criteria
Dec 16, 2013
April 13, 2011
NORMALIZATIONDatabase normalization is the process of removing
redundant data from your tables in to improve storage efficiency, data integrity, and scalability.
Normalization Forms:
• First Normal Form(1NF)• Second Normal Form(2NF)• Third Normal Form(3NF)• Boyce-Codd Normal Form(BCNF)
Dec 16, 2013
April 13, 2011
3NF
Third normal form (3NF) requires that there are no functional dependencies of non-key attributes on something other than a candidate key. A table is in 3NF if all of the non-primary key attributes are mutually independent
There should not be transitive dependencies
Dec 16, 2013
If I know # of Pages, can I find out Author's Name? No. Can I find out Author's pseudonym? No. If I know Author's Name, can I find out # of Pages? No. Can I find out Author's pseudonym YES.
Therefore, Author's pseudonym is functionally dependent upon Author's Name, not the PK for its existence. It has to go.
Dec 16, 2013
BCNFClient Interview
-FD1 clientNo, interviewDate � interviewTime, staffNo, roomNo (Primary Key)-FD2 staffNo, interviewDate, interviewTime � clientNo (Candidate key)-FD3 roomNo, interviewDate, interviewTime � clientNo, staffNo (Candidate key)-FD4 staffNo, interviewDate � roomNo (not a candidate key)-As a consequece the ClientInterview relation may suffer from update anomalies.-For example, two tuples have to be updated if the roomNo need be changed for staffNo SG5 on the 13-May-02.
Dec 16, 2013
To transform the ClientInterview relation to BCNF, we must remove the violating
functional dependency by creating two new relations called
Interview and StaffRoom as shown below,
Interview (clientNo, interviewDate, interviewTime, staffNo),
StaffRoom(staffNo, interviewDate, roomNo)
Dec 16, 2013
April 13, 2011
SLOWLY CHANGING DIMENSION(SCD)
• Dimensions that change over time are called SCD• Example: Product Price, Customer Address etc
• SCD Types:– Type I SCD– Type II SCD– Type III SCD– Type 4 SCD
Dec 16, 2013
April 13, 2011
SCD (IMPLEMENTATION)DIMENSIONS
CATEGORYBRANDMODELSTOREVENDOR
FACTSINVENTORY_TRANSACTION_FILE
GOODS_RECEIVING
SCD IMPLEMENTED ON DIMENSION TABLES ONLY AND NOT ON FACT TABLES
CATEGORY (SCD I)
BRAND (SCD I)
MODEL (SCD II by DATE)
STORE (SCD II by FLAG)
VENDOR (SCD I)
Dec 16, 2013