+ All Categories
Home > Data & Analytics > Warehouse chapter3

Warehouse chapter3

Date post: 26-Jan-2017
Category:
Upload: lal-shaik
View: 12 times
Download: 0 times
Share this document with a friend
30
Defining Data Warehouse Concepts and Terminology Chapter 3
Transcript
Page 1: Warehouse chapter3

Defining Data Warehouse Concepts and Terminology

Chapter 3

Page 2: Warehouse chapter3

Definition of a Data Warehouse

“ An enterprise structured repository of subject-oriented, time-variant, historical data used for information retrieval and decision support. The data warehouse stores atomic and summary data.”

Oracle Data Warehouse Method

Page 3: Warehouse chapter3

Data Warehouse Properties

DataWarehouse

Integrated

Time VariantNon Volatile

SubjectOriented

Page 4: Warehouse chapter3

Subject-OrientedData is categorized and stored by business subjectrather than by application

EquityPlans Shares Customer

financialinformation

Savings

Insurance

Loans

OLTP Applications Data Warehouse Subject

Page 5: Warehouse chapter3

Integrated

OLTP Applications

Savings

Currentaccounts

Loans

Data Warehouse

Data on a given subject is defined and stored once.

Customer

Page 6: Warehouse chapter3

Time-VariantData is stored as a series of snapshots, each representing a period of time

Time DataJan-97 JanuaryFeb-97 FebruaryMar-97 March

Page 7: Warehouse chapter3

NonvolatileTypically data in the data warehouse is not updated or delelted.

Insert UpdateDelete

Read Read

Operational Warehouse

Load

Page 8: Warehouse chapter3

Changing Data

Warehouse Database

First time load

Refresh

Refresh

Refresh

Operational Database

Page 9: Warehouse chapter3

Data Warehouse Versus OLTP

PropertyResponseTimeOperationsNature of Data

Data OrganizationSize

Data Source

Activities

OperationalSub seconds to secondsDML30-60 days

ApplicationsSmall to large

Operational, Internal

Processes

Data Warehouse

Seconds to hours

Snapshots over timeSubject, time

Large to very largeOperational, Internal,External

Analysis

Primarily read only

Page 10: Warehouse chapter3

Usage CurvesOperational system is predictableData warehouse - Variable - Random

Page 11: Warehouse chapter3

User ExpectationsControl expectationsSet achievable targets for query

responseSet SLAsEducateGrowth and use is exponential

Page 12: Warehouse chapter3

Enterprisewide WarehouseLarge scale implementationScope the entire businessData from all subject areasDeveloped incrementallySingle source of enterprisewide dataSingle distribution point to

dependent data marts

Page 13: Warehouse chapter3

Data Warehouses Versus Data Marts

Property Data Warehouse Data MartScope Enterprise DepartmentSubject Multiple Single-subject, LOBData Source Many FewSize(typical) 100 GB to>1 TB <100 GBImplementation time Months to years Months

DataWarehouse

DataMart

Page 14: Warehouse chapter3

Dependent Data Mart

MarketingSales

FinanceHuman Resources

Marketing

Marketing

Marketing

External Data

DataWarehouse

OperationalSystems

Flat Files

Data Marts

Page 15: Warehouse chapter3

Independent Data Mart

OperationalSystems

External Data

Sale or Marketing

Flat Files

Page 16: Warehouse chapter3

Data Warehouse TerminologyOperational data store (ODS) Stores tactical data from production

systems that are subject-oriented and integrated to address operational needs

MetadataMetadata

Page 17: Warehouse chapter3

Data Warehouse Terminology

DataIntegration

Enterprise data warehouse

Business areawarehouse

Source data

Architecture

Page 18: Warehouse chapter3

MethodolgyEnsures a successful data warehouseEncourages incremental developmentProvides a staged approach to an

enterprisewide warehouse - Safe - Manageable - Proven - Recommended

Page 19: Warehouse chapter3

ModelingWarehouses differ from operational structures: - Analytical requirements - Subject orientationData must map to subject oriented information: - Identify business subjects - Define relationships between subjects - Name the attributes of each subjectModeling is iterativeModeling tools are available

Page 20: Warehouse chapter3

Extraction, Transformation, and Transportation

Purchase specialist tools, or develop programsExtraction-- select data using different methodsTransformation--validate, clean, integrate, and

time stamp dataTransportation--move data into the warehouse

OLTP Databases Staging File Warehouse Database

Page 21: Warehouse chapter3

Data ManagementEfficient database server and management

tools for all aspects of data managementImperatives - Productive - Flexible - Robust - EfficientHardware, operating system and network

management

Page 22: Warehouse chapter3

Data Access and Reporting

Tools that retrieve data for business analysis Imperatives - Ease of use - Intuitive - Metadata - Training More than one tool may be required

WarehouseDatabase

Simple Queries

Forecasting

Drill-down

Page 23: Warehouse chapter3

Oracle Warehouse Components

Relational / Multidimensional

Text, image Spatial

Web Audio video

Externaldata

Operational data

Relational tools

OLAP tools

Applications/Web

Any DataAny Source Any Access

Page 24: Warehouse chapter3

Oracle Data Mart Suite

Data ModelingOracle Data Mart Designer

OLTPEngines

OLTPDatabases

DataExtraction

Oracle Data MartBuilder

Ware-housingEngines

Data MartDatabase

SQL*Plus

DataManagement

Oracle EnterpriseManager

Data Access& Analysis

Discoverer &Oracle Reports

Page 25: Warehouse chapter3

Data Mart Implementation with the Oracle Data Mart Suite

Oracle Enterprise ServerOracle Enterprise ManagerOracle Data Mart BuilderOracle Data Mart DesignerOracle DiscovererOracle Web Application ServerOracle Reports

Page 26: Warehouse chapter3

Oracle Warehouse Builder Architecture

Sources

ExtractionFacilities• Loader• Remotes SQL• Gateways - OLE-DB/ODBC - Mainframe - Specialized• ERP Data - SAP - Peoplesoft - Oracle

PL/SQL, JavaTransforms

TransformDriver

PL/SQL, JavaWrapper

ExternalFunctions

TargetTablesFilter

Transform

Oracle 8i

Page 27: Warehouse chapter3

Oracle Business Intelligence Tools

Current Tactical Strategic

IS developsuser’s Views Business users Analysis

Oracle Reports Oracle Discover Oracle Express

Page 28: Warehouse chapter3

The Tool for Each TaskTool

OracleReports

OracleDiscover

OracleExpress

Productionreporting

Ad hocquery and analysis

Advancedanalysis

Question

What were sales byregion last quarter?

What is driving the increase in NorthAmerican sales?

Given the rapid increasein Web sales, what willtotal sales be for the restof the year?

Task

Page 29: Warehouse chapter3

Oracle Warehouse Services

OracleEducation

OracleConsulting

Oracle Support Services

Customers

Page 30: Warehouse chapter3

SummaryThis lesson covered the following topics:Identifying a common, broadly accepted definition

of the data warehouseDistinguishing the differences between OLTP

systems and analytical systemsDefining some of the common data warehouse

terminologyIdentifying some of the elements and processes in a

data warehouseIdentifying and positioning the Oracle Warehouse

vision, products, and services


Recommended