Post on 26-Dec-2015
transcript
School of SoftwareSUN YAT-SEN UNIVERSITYMar, 27, 2011
The Procedure of Installing SQL Server 2005
• Microsoft SQL Server 2005 Express
• Microsoft SQL Server Management Studio Express
Introduction of TPC-H and Generate lineitem.tbl
Import Lineitem.tbl into SQL Server
Experiment about the Efficiency between Croup By
and Group By With Cube
•Configuration demands
•Install procedure for SQL server 2005 Express
•Install procedure for SQL server 2005 Express
•Install procedure for SQL server 2005 Express
•Install procedure for SQL server 2005 Express
•Install procedure for SQL server 2005 ExpressThis situation only for installing VS2005 already
•Install procedure for SQL server 2005 Express
•Install procedure for SQL server 2005 Express
•Connect to SQL Server
•Connect to SQL Server
•The interface of SQL Server
•The interface of SQL Server
The TPC Benchmark™H (TPC-H) is a decision support benchmark.
The components of the TPC-H database are defined to consist of eight separate and individual tables.
•Get the tpch_2_14_0
• The DBGEN program can be downloaded at the following URL:http://www.tpc.org/tpch/spec/tpch_2_14_0.zip• The schema of LINEITEM can be found at page 12in the tpch2.14.0.doc, which can be downloaded at thefollowing URL:
http://www.tpc.org/tpch/spec/tpch2.14.0_cb.doc
•Create lineitem.tbl (Linux)
•Create a new query
•Create database dbTPC
•Use graphical interfaces
•Use graphical interfaces
•Create the table use SQLuse dbTPCcreate table lineitem( orderkey int,
partkey int,suppkey int,linenumber int,quantity int,extendedprice decimal,discount decimal,tax decimal,returnflag nchar(1),linestatus nchar(1),shipdate datetime,commitdate datetime,receiptdate datetime,shipinstruct nchar(25),shipmode nchar(10),comment varchar(44)
)
•Create the table use interface
•Step 1 Import file into SQL Server Using Bulk Insert.
BULK INSERT TablenameFROM 'D: \lineitem.tbl'WITH ( FIELDTERMINATOR = '|', ROWTERMINATOR = '|\r')
•When GROUP BY and Aggregate Functions are used together, the practical meaning is significant.
• The Aggregate Functions generate a value for each group whenused together with GROUP BY, other than for the whole table.
•Example: Display the how many lineitems are at each
returning status.
SQL:
SELECT returnflag, COUNT(*)
FROM lineitem
GROUP BY returnflag
•Example: Display the quantity of lineitems which come from
the same order and at the same returning status.
order and they.
SQL:
SELECT returnflag, orderkey, SUM(quantity)
FROM lineitem
GROUP BY returnflag, orderkey
The CUBE operator generates a result set that is a
multidimensional cube.
A multidimensional cube is an expansion of fact data,
The expansion is based on columns that the user wants to analyze
The cube is a result set that contains all the possible combinations
of the dimensions.
SELECT Item, Color, SUM(Quantity) AS QtySum
FROM Inventory
GROUP BY Item, Color WITH CUBE
SELECT Item, Color, SUM(Quantity) AS QtySum
FROM Inventory
GROUP BY Item, Color WITH CUBE
•These four rows report the the original sum, in another wordsthis time we get four groups with their sum value.
SELECT Item, Color, SUM(Quantity) AS QtySum
FROM Inventory
GROUP BY Item,Color
•These two rows report the subtotals for the Item dimension. both have null in the Color dimension to show that aggregate date came from rows having any value for the Color dimension.
SELECT Item, SUM(Quantity) AS QtySum
FROM Inventory
GROUP BY Item
•These two rows report the subtotals for the Color dimension. both have null in the Item dimension to show that aggregate date came from rows having any value for the item dimension.
SELECT Color, SUM(Quantity) AS QtySum
FROM Inventory
GROUP BY Color
•This row reports the grand total for the cube. All values of bothdimensions are summarized in the row.
SELECT SUM(Quantity) AS QtySum
FROM Inventory
•Then we can extend this situation to n dimensions. 2n different combinations of the dimensions should be considered.
•Analysis the column orderkey, partkey, suppkey, linenumber of Table LineItem using WITH CUBE.
•Using 16 GROUP BY clauses simulate the result set of GROUP BY WITH CUBE.
GROUP BY millisecond
No grouping(1) 16
Group with 4 column(1) orderkey, partkey, suppkey, linenumber 31
Group with 3 column(4) orderkey, partkey, suppkey 31
orderkey, partkey,linenumber 16
orderkey, suppkey, linenumber 31
partkey, suppkey, linenumber 16
Group with 2 column(6) orderkey, partkey 16
orderkey, linenumber 15
orderkey, suppkey 16
suppkey, linenumber 15
partkey, suppkey 15
partkey, linenumber 15
Group with 1 column(4) orderkey 16
partkey 16
suppkey 31
linenumber 16
Total 302
GROUP BY WITH CUBE orderkey,partkey,suppkey,linenumber 140
A. Use the DBGEN program of the TPC-H Benchmark to generate all the eight tables of the TPC-H schema, with the Scale Factor set to 1.
B. Create a database with eight tables including possible constrains(You can refer to tpch2.14.0.doc), and then import the generated data.
Submit all the nine queries and the time cost for importing data.
THANK YOU!