+ All Categories
Home > Documents > Data Warehouse Design Xintao Wu University of North Carolina at Charlotte Nov 10, 2008.

Data Warehouse Design Xintao Wu University of North Carolina at Charlotte Nov 10, 2008.

Date post: 02-Jan-2016
Category:
Upload: marilynn-watson
View: 216 times
Download: 0 times
Share this document with a friend
Popular Tags:
29
Data Warehouse Design Xintao Wu University of North Carolina at Charlotte Nov 10, 2008
Transcript
Page 1: Data Warehouse Design Xintao Wu University of North Carolina at Charlotte Nov 10, 2008.

Data Warehouse Design

Xintao Wu

University of North Carolina at CharlotteNov 10, 2008

Page 2: Data Warehouse Design Xintao Wu University of North Carolina at Charlotte Nov 10, 2008.

2

Organization

• Concepts Data Warehousing Concepts (Ch1)

• Logical Design Logical design in data warehouse (Ch2)

• Physical Design Physical design in data warehouses (Ch3) Hardware and I/O considerations Parallelism and partitioning in data warehouses Indexes (Ch6) Integrity constraints (Ch7) Basic Materialized views (Ch8) Advanced materialized views Dimensions (Ch10)

Page 3: Data Warehouse Design Xintao Wu University of North Carolina at Charlotte Nov 10, 2008.

3

Organization• Managing DW environment

Overview of extraction, transformation, and loading Extraction Transportation Loading and transformation Maintaining the DW Change data capture SQLAccess advisor

• DW performance Query rewrite Schema modeling techniques SQL for aggregation in DW SQL for analysis and reporting SQL for modeling OLAP and data mining Using parallel execution

Page 4: Data Warehouse Design Xintao Wu University of North Carolina at Charlotte Nov 10, 2008.

4

What is DW

Page 5: Data Warehouse Design Xintao Wu University of North Carolina at Charlotte Nov 10, 2008.

5

Page 6: Data Warehouse Design Xintao Wu University of North Carolina at Charlotte Nov 10, 2008.

6

Page 7: Data Warehouse Design Xintao Wu University of North Carolina at Charlotte Nov 10, 2008.

7

Page 8: Data Warehouse Design Xintao Wu University of North Carolina at Charlotte Nov 10, 2008.

8

Logical vs. physical design

• In the logical design, you look at the logical relationships among the objects.

• In the physical design, you look at the most effective way of storing and retrieving the objects as well as handling them from a transportation and backup/recovery perspective.

• Your logical design should result in a set of entities and attributes corresponding to fact tables and

dimension tables A model of operational data from your source into subject-oriented

informaiton in your target data warehouse schema.

Page 9: Data Warehouse Design Xintao Wu University of North Carolina at Charlotte Nov 10, 2008.

9

Page 10: Data Warehouse Design Xintao Wu University of North Carolina at Charlotte Nov 10, 2008.

10

Page 11: Data Warehouse Design Xintao Wu University of North Carolina at Charlotte Nov 10, 2008.

11

Page 12: Data Warehouse Design Xintao Wu University of North Carolina at Charlotte Nov 10, 2008.

12

Page 13: Data Warehouse Design Xintao Wu University of North Carolina at Charlotte Nov 10, 2008.

13

Physical Design• Logical design can use pen/paper/oracle warehouse builder/oracle designer

while physical design is the creation of database with SQL• Physical design decisions are mainly driven by query performance and

database maaintenance aspects.• You need to create

Tablespaces Tables and partitioned tables Views

A view takes the output of a query and treats it as a table. Views do not require any space in the database

Integrity constraints In OLTP, they prevent the insertion of invalid data while in DW, they are only used for

query rewrite. Dimensions

A schema object that defines hierarchical relationships between columns or column sets. Indexes and partitioned indexes

Bitmap indexes vs. B-tree indexes. Bitmap indexes are efficient for set-oriented operations.

Materialized views Query results that have been stored in advance .

Page 14: Data Warehouse Design Xintao Wu University of North Carolina at Charlotte Nov 10, 2008.

14

Partition and parallel execution

•Range partitioning•Hash partitioning•List partitioning•Composite partitioning

Page 15: Data Warehouse Design Xintao Wu University of North Carolina at Charlotte Nov 10, 2008.

15

Bitmap index

Page 16: Data Warehouse Design Xintao Wu University of North Carolina at Charlotte Nov 10, 2008.

16

Page 17: Data Warehouse Design Xintao Wu University of North Carolina at Charlotte Nov 10, 2008.

17

One dimension table columns joins one fact table

Page 18: Data Warehouse Design Xintao Wu University of North Carolina at Charlotte Nov 10, 2008.

18

Page 19: Data Warehouse Design Xintao Wu University of North Carolina at Charlotte Nov 10, 2008.

19

extension

Page 20: Data Warehouse Design Xintao Wu University of North Carolina at Charlotte Nov 10, 2008.

20

Page 21: Data Warehouse Design Xintao Wu University of North Carolina at Charlotte Nov 10, 2008.

21

Integrity constraints

• Unique constraints• NOT NULL constraints• FOREIGN KEY constraints

Page 22: Data Warehouse Design Xintao Wu University of North Carolina at Charlotte Nov 10, 2008.

22

Page 23: Data Warehouse Design Xintao Wu University of North Carolina at Charlotte Nov 10, 2008.

23

Basic materialized views

Page 24: Data Warehouse Design Xintao Wu University of North Carolina at Charlotte Nov 10, 2008.

24

Materialized views with aggregates

Page 25: Data Warehouse Design Xintao Wu University of North Carolina at Charlotte Nov 10, 2008.

25

Page 26: Data Warehouse Design Xintao Wu University of North Carolina at Charlotte Nov 10, 2008.

26

Dimension

Page 27: Data Warehouse Design Xintao Wu University of North Carolina at Charlotte Nov 10, 2008.

27

Page 28: Data Warehouse Design Xintao Wu University of North Carolina at Charlotte Nov 10, 2008.

28

Page 29: Data Warehouse Design Xintao Wu University of North Carolina at Charlotte Nov 10, 2008.

29


Recommended