+ All Categories
Home > Documents > BI Concepts

BI Concepts

Date post: 18-Feb-2016
Category:
Upload: rajesh-rai
View: 220 times
Download: 0 times
Share this document with a friend
Description:
na
32
Business Intelligence Concepts RAJASHMITA KAR [email protected]
Transcript

Business Intelligence Concepts

RAJASHMITA [email protected]

April 13, 2011

AGENDA

Dec 16, 2013

What is BI??

BI Lifecycle

Chalenges of Building BI Data Warehouse Concepts Key Performance Indicator

Normalization and its type

Data Cleansing Slowly Changing Dimension

April 13, 2011

What is BI??

Dec 16, 2013

April 13, 2011

The term Business Intelligence (BI) represents the tools and systems that play a key role in the strategic planning process of the corporation.

These systems allow a company to gather, store, access and analyze corporate data to aid in decision-making.

It is an environment in which business users receive information that is reliable, secure, consistent, understandable, easily manipulated and timely...facilitating more informed decision making.

Dec 16, 2013

April 13, 2011

BI LIFECYCLE

Dec 16, 2013

April 13, 2011October 30, 2012

Prod

Mkt

HR

Fin

Acctg

Data Sources

Transac tion D ata

IBM

IMS

VSAM

Oracle

Sybase

ETL Software Data Stores Data AnalysisTools and Applications

Users

Other In ternal D ata

ERP SAP

Clickstream Informix

W eb D ata

External Data

Demographic Harte-Hanks

STAG ING

AREA

OPERAT IONAL

DATA

STORE

Ascential

Extract

Sagent

SAS

Clean/ScrubTrans formFi rst logic

Load

Informatica

Data Mart sTeradataIBM

Data Warehouse

Meta Data

Finance

Marketing

Sales

Essbase

Microsoft

A NA L YS TS

M AN A GER S

EX EC U TIV ES

OPER A TIO NA LPER SO NN EL

C US TOM ER S/SU PPLIE RS

SQL

Cognos

SAS

Queri es,Reporting,DSS/EIS, Data Mining

Micro Strat egy

Siebel

BusinessObjects

WebBrowser

April 13, 2011

Challenges of Building BISolution

Dec 16, 2013

April 13, 2011

Data exists in multiple placesData is not formatted to support complex analysisDifferent kinds of workers have different data needsWhat data should be examined and in what detailHow will users interact with that data

Dec 16, 2013

April 13, 2011

Data WareHouse

Dec 16, 2013

April 13, 2011

Components of Data Warehouse

Dec 16, 2013

April 13, 2011

Cubes Measures Key Performance Indicator Dimensions ---Attributes ---Hierarchy

Dec 16, 2013

April 13, 2011

DATA MODELLING

Dec 16, 2013

April 13, 2011

DATA MODELING

• process that produces abstract data models for one or more database components of the data warehouse

Types of Data Modeling– Conceptual Data Model– Logical Data Model– Physical Data Model

Dec 16, 2013

April 13, 2011

DIMENSIONAL MODELLING

• Fact Table• Dimension Table

Dec 16, 2013

April 13, 2011

STAR SCHEMA AND SNOW FLAKES SCHEMA

STAR SCHEMA SNOW FLAKES SCHEMA

Dec 16, 2013

April 13, 2011

KEY PERFORMANCE INDICATOR (KPI)

• A set of quantifiable measures that a company or industry uses to gauge or compare performance in terms of meeting their strategic and operational goals.

• KPIs vary between companies and industries, depending on their priorities or performance criteria

Dec 16, 2013

April 13, 2011

NORMALIZATION

Dec 16, 2013

April 13, 2011

NORMALIZATIONDatabase normalization is the process of removing

redundant data from your tables in to improve storage efficiency, data integrity, and scalability.

Normalization Forms:

• First Normal Form(1NF)• Second Normal Form(2NF)• Third Normal Form(3NF)• Boyce-Codd Normal Form(BCNF)

Dec 16, 2013

April 13, 2011

3NF

Third normal form (3NF) requires that there are no functional dependencies of non-key attributes on something other than a candidate key. A table is in 3NF if all of the non-primary key attributes are mutually independent

There should not be transitive dependencies

Dec 16, 2013

If I know # of Pages, can I find out Author's Name? No. Can I find out Author's pseudonym? No. If I know Author's Name, can I find out # of Pages? No. Can I find out Author's pseudonym YES.

Therefore, Author's pseudonym is functionally dependent upon Author's Name, not the PK for its existence. It has to go.  

Dec 16, 2013

BCNFClient Interview

-FD1 clientNo, interviewDate � interviewTime, staffNo, roomNo (Primary Key)-FD2 staffNo, interviewDate, interviewTime � clientNo (Candidate key)-FD3 roomNo, interviewDate, interviewTime � clientNo, staffNo (Candidate key)-FD4 staffNo, interviewDate � roomNo (not a candidate key)-As a consequece the ClientInterview relation may suffer from update anomalies.-For example, two tuples have to be updated if the roomNo need be changed for staffNo SG5 on the 13-May-02.

Dec 16, 2013

To transform the ClientInterview relation to BCNF, we must remove the violating

functional dependency by creating two new relations called

Interview and StaffRoom as shown below,

Interview (clientNo, interviewDate, interviewTime, staffNo),

StaffRoom(staffNo, interviewDate, roomNo)

Dec 16, 2013

April 13, 2011

DATA CLEANSING

HOWWHAT

WHEN

WHY

Dec 16, 2013

April 13, 2011

SLOWLY CHANGING DIMENSION(SCD)

• Dimensions that change over time are called SCD• Example: Product Price, Customer Address etc

• SCD Types:– Type I SCD– Type II SCD– Type III SCD– Type 4 SCD

Dec 16, 2013

April 13, 2011

SCD (IMPLEMENTATION)DIMENSIONS

CATEGORYBRANDMODELSTOREVENDOR

FACTSINVENTORY_TRANSACTION_FILE

GOODS_RECEIVING

SCD IMPLEMENTED ON DIMENSION TABLES ONLY AND NOT ON FACT TABLES

CATEGORY (SCD I)

BRAND (SCD I)

MODEL (SCD II by DATE)

STORE (SCD II by FLAG)

VENDOR (SCD I)

Dec 16, 2013

April 13, 2011

THANK YOU

Dec 16, 2013


Recommended