School of Software SUN YAT-SEN UNIVERSITY Mar, 27, 2011.

Post on 26-Dec-2015

220 views 2 download

transcript

School of SoftwareSUN YAT-SEN UNIVERSITYMar, 27, 2011

The Procedure of Installing SQL Server 2005

• Microsoft SQL Server 2005 Express

• Microsoft SQL Server Management Studio Express

Introduction of TPC-H and Generate lineitem.tbl

Import Lineitem.tbl into SQL Server

Experiment about the Efficiency between Croup By

and Group By With Cube

•Configuration demands

•Install procedure for SQL server 2005 Express

•Install procedure for SQL server 2005 Express

•Install procedure for SQL server 2005 Express

•Install procedure for SQL server 2005 Express

•Install procedure for SQL server 2005 ExpressThis situation only for installing VS2005 already

•Install procedure for SQL server 2005 Express

•Install procedure for SQL server 2005 Express

•Connect to SQL Server

•Connect to SQL Server

•The interface of SQL Server

•The interface of SQL Server

The TPC Benchmark™H (TPC-H) is a decision support benchmark.

The components of the TPC-H database are defined to consist of eight separate and individual tables.

•Get the tpch_2_14_0

• The DBGEN program can be downloaded at the following URL:http://www.tpc.org/tpch/spec/tpch_2_14_0.zip• The schema of LINEITEM can be found at page 12in the tpch2.14.0.doc, which can be downloaded at thefollowing URL:

http://www.tpc.org/tpch/spec/tpch2.14.0_cb.doc

•Create lineitem.tbl (Linux)

•Create a new query

•Create database dbTPC

•Use graphical interfaces

•Use graphical interfaces

•Create the table use SQLuse dbTPCcreate table lineitem( orderkey int,

partkey int,suppkey int,linenumber int,quantity int,extendedprice decimal,discount decimal,tax decimal,returnflag nchar(1),linestatus nchar(1),shipdate datetime,commitdate datetime,receiptdate datetime,shipinstruct nchar(25),shipmode nchar(10),comment varchar(44)

)

•Create the table use interface

•Step 1 Import file into SQL Server Using Bulk Insert.

BULK INSERT TablenameFROM 'D: \lineitem.tbl'WITH ( FIELDTERMINATOR = '|', ROWTERMINATOR = '|\r')

•When GROUP BY and Aggregate Functions are used together, the practical meaning is significant.

• The Aggregate Functions generate a value for each group whenused together with GROUP BY, other than for the whole table.

•Example: Display the how many lineitems are at each

returning status.

SQL:

SELECT returnflag, COUNT(*)

FROM lineitem

GROUP BY returnflag

•Example: Display the quantity of lineitems which come from

the same order and at the same returning status.

order and they.

SQL:

SELECT returnflag, orderkey, SUM(quantity)

FROM lineitem

GROUP BY returnflag, orderkey

The CUBE operator generates a result set that is a

multidimensional cube.

A multidimensional cube is an expansion of fact data,

The expansion is based on columns that the user wants to analyze

The cube is a result set that contains all the possible combinations

of the dimensions.

SELECT Item, Color, SUM(Quantity) AS QtySum

FROM Inventory

GROUP BY Item, Color WITH CUBE

SELECT Item, Color, SUM(Quantity) AS QtySum

FROM Inventory

GROUP BY Item, Color WITH CUBE

•These four rows report the the original sum, in another wordsthis time we get four groups with their sum value.

SELECT Item, Color, SUM(Quantity) AS QtySum

FROM Inventory

GROUP BY Item,Color

•These two rows report the subtotals for the Item dimension. both have null in the Color dimension to show that aggregate date came from rows having any value for the Color dimension.

SELECT Item, SUM(Quantity) AS QtySum

FROM Inventory

GROUP BY Item

•These two rows report the subtotals for the Color dimension. both have null in the Item dimension to show that aggregate date came from rows having any value for the item dimension.

SELECT Color, SUM(Quantity) AS QtySum

FROM Inventory

GROUP BY Color

•This row reports the grand total for the cube. All values of bothdimensions are summarized in the row.

SELECT SUM(Quantity) AS QtySum

FROM Inventory

•Then we can extend this situation to n dimensions. 2n different combinations of the dimensions should be considered.

•Analysis the column orderkey, partkey, suppkey, linenumber of Table LineItem using WITH CUBE.

•Using 16 GROUP BY clauses simulate the result set of GROUP BY WITH CUBE.

GROUP BY millisecond

No grouping(1) 16

Group with 4 column(1) orderkey, partkey, suppkey, linenumber 31

Group with 3 column(4) orderkey, partkey, suppkey 31

orderkey, partkey,linenumber 16

orderkey, suppkey, linenumber 31

partkey, suppkey, linenumber 16

Group with 2 column(6) orderkey, partkey 16

orderkey, linenumber 15

orderkey, suppkey 16

suppkey, linenumber 15

partkey, suppkey 15

partkey, linenumber 15

Group with 1 column(4) orderkey 16

partkey 16

suppkey 31

linenumber 16

Total 302

GROUP BY WITH CUBE orderkey,partkey,suppkey,linenumber 140

A. Use the DBGEN program of the TPC-H Benchmark to generate all the eight tables of the TPC-H schema, with the Scale Factor set to 1.

B. Create a database with eight tables including possible constrains(You can refer to tpch2.14.0.doc), and then import the generated data.

Submit all the nine queries and the time cost for importing data.

THANK YOU!