+ All Categories
Home > Documents > MBWU: Benefit Quantification for Data Access Function Ooading

MBWU: Benefit Quantification for Data Access Function Ooading

Date post: 05-Dec-2021
Category:
Upload: others
View: 3 times
Download: 0 times
Share this document with a friend
30
MBWU: Benefit Quantification for Data Access Function Offloading Jianshen Liu 1 , Philip Kufeldt 2 , Carlos Maltzahn 1 1 University of California, Santa Cruz, 2 Seagate Technology June 20, 2019 1 This project is funded in part by NSF OAC-1836650, CNS-1764102, CNS-1705021, and the Center for Research in Open Source Soſtware (cross.ucsc.edu).
Transcript
Page 1: MBWU: Benefit Quantification for Data Access Function Ooading

MBWU: Benefit Quantification for Data Access Function Offloading

Jianshen Liu 1, Philip Kufeldt 2, Carlos Maltzahn 11 University of California, Santa Cruz, 2 Seagate Technology

June 20, 2019

1

This project is funded in part by NSF OAC-1836650, CNS-1764102,

CNS-1705021, and the Center for Research in Open Source Software

(cross.ucsc.edu).

Page 2: MBWU: Benefit Quantification for Data Access Function Ooading

Is It Worth to Offload? 12

Page 3: MBWU: Benefit Quantification for Data Access Function Ooading

3

Page 4: MBWU: Benefit Quantification for Data Access Function Ooading

4

Cost/Benefit of Offloading

Offloading

...

Host Platform

Page 5: MBWU: Benefit Quantification for Data Access Function Ooading

5

Cost/Benefit of Offloading

Possible Benefits

◉ Data Translation Reduction◉ Data Transmission Size Reduction◉ Software Layer Reduction◉ Power Consumption Reduction◉ Application Performance Increment◉ Resource Utilization Increment◉ ….

Offloading

⬆ storage device cost ⟹ ⬆overall platform cost

...

Compute Platform

Embedded Platform

Page 6: MBWU: Benefit Quantification for Data Access Function Ooading

6

Placement of Data Access FunctionsDifferent storage media, different workloads ⟹ different cost-optimal placements of functions

Slow media

Move data access function close to DRAM to hide latency

Fast media

network

InterconnectCPU/DRAM

Move data access function close to data to save bandwidth cost

CPU/DRAM

Host Platform

Embedded PlatformCompute Platform

Page 7: MBWU: Benefit Quantification for Data Access Function Ooading

7

Placement of Data Access FunctionsDifferent workloads, different storage media ⟹ different cost-optimal placements of functions

Slow media

Move data access function close to DRAM to hide latency

Fast media

I/O fabric

Data access function

Examples: ● GET/PUT in K/V Store● read/write in File System● SELECT/PROJECT in DBMS● H5Sselect in HDF5

Workload: ● data access function calls

Throughput: ● data access function calls per second

(aka ops/sec, IOPS, OPS)

Problem: How to quantify cost/benefit?

InterconnectCPU/DRAM

Move data access function close to data to save bandwidth cost

CPU/DRAM

Host Platform

Embedded PlatformCompute Platform

Page 8: MBWU: Benefit Quantification for Data Access Function Ooading

Measurement Methodology 28

Page 9: MBWU: Benefit Quantification for Data Access Function Ooading

9

Efficiency Comparison for Different Platforms

We need a normalization that is

◉ Platform-independentReference point across host and embedded platformsBased on amount of work measured in workload operations (as opposed to CPU cycles)

Different storage media, different workloads ⟹ different cost-optimal placements of functions

Page 10: MBWU: Benefit Quantification for Data Access Function Ooading

10

Efficiency Comparison for Different Platforms

We need a normalization that is

◉ Platform-independentReference point across host and embedded platformsBased on amount of work measured in workload operations (as opposed to CPU cycles)

◉ Workload-dependentWorkload operations are implemented by the data access function under studyExamples: GET/PUT K/V ops, read/write FS ops, db transactions

◉ Media-dependentCost-optimal placement of data access function sensitive to types of storage mediaExamples: Spinning media (slow), flash media (fast)

Different workloads, different storage media ⟹ different cost-optimal placements of functions

Page 11: MBWU: Benefit Quantification for Data Access Function Ooading

11

WU

Platform A : 8 units of work

Unitof work

WU WUWU WUWU WUWU WU

Efficiency Normalized by Work Performed

Platform B : 3 units of work

Cost efficiency $/WU

Power efficiency kWh/WU

Space efficiency m3/WU

Platform Efficiency

WUWUWU

Platform

Workload

Throughput:

Page 12: MBWU: Benefit Quantification for Data Access Function Ooading

d

12

MBWU

Platform A: 8 meda-based units of work

Media-based unit of work

MBWU MBWUMBWU MBWUMBWU MBWUMBWU MBWU

Efficiency Normalized by Work Performed Limited by Media

Platform B: 3 media-based units of work

Cost efficiency $/MBWU

Power efficiency kWh/MBWU

Space efficiency m3/MBWU

Platform Efficiency

MBWUMBWUMBWU

Platform

Workload

Throughput:

Media

Page 13: MBWU: Benefit Quantification for Data Access Function Ooading

13

How to Construct a MBWU(workload, media)

Construct a MBWU

1. Run workload on platform that is only limited by storage media, with all external caching effects eliminated/disabled

2. Determine maximum steady-state throughput

3. 1 MBWU ← that throughput

● MBWU construction is fully repeatable● Intended for all workloads, storage media● Not: online method during production

workloads

Measure MBWUs of a platform

1. Run workload on platform under study2. Determine maximum steady-state

throughput of platform under study using the same workload

3. Divide throughput by constructed MBWU

Compare platforms

1. Measure MBWUs for each platform2. Determine $, kWh (under workload), volume

of each platform3. Normalize by MBWU:

$/MBWU, kWh/MBWU, m3/MBWU

Page 14: MBWU: Benefit Quantification for Data Access Function Ooading

Example Evaluation 314

Page 15: MBWU: Benefit Quantification for Data Access Function Ooading

15

The Workload

Key-value data management as an example workload to be offloaded.

6x traffic amplification

◉ RocksDB as the key-value store engine

◉ YCSB as the workload generator

Why this workload?

Page 16: MBWU: Benefit Quantification for Data Access Function Ooading

16

The Workload

◉ Key-value data management is a typical high-selectivity data access function.

◉ 6x amplification means more than 5x extra expense on the I/O fabric to support the bandwidth that is not directly relevant to user applications.

◉ There is nothing to prevent the MBWU-based measurement methodology from being applied to other workloads, such as database operations workload.

Page 17: MBWU: Benefit Quantification for Data Access Function Ooading

17

Evaluation Process

Pre-condition Storage Devices

Start RocksDB daemons

Run YCSB load process

Run YCSB transactions

process

Plat Results for Analysis

If Results Are Not in Steady State

Monitor and record utilization of CPU, memory, device I/O, network, and power for the platform during the whole evaluation process.

Page 18: MBWU: Benefit Quantification for Data Access Function Ooading

18

Follow the SNIA Performance Test Specification

Steady state results ensure reproducibility

All these steps have been automated.

Repeatable Evaluation Process

Page 19: MBWU: Benefit Quantification for Data Access Function Ooading

19

RocksDB RMI Server

◉ It exposes all public interfaces (e.g., open(), close(), get(), put(), delete() of a RocksDB object to network securely by binding this object to an RMI registry.

◉ A YCSB process looks up the corresponding RocksDB object from a specified RMI registry.

◉ YCSB passes down I/O operations to the exposed RocksDB insterfaces.

Page 20: MBWU: Benefit Quantification for Data Access Function Ooading

Prototype Evaluation 420

Page 21: MBWU: Benefit Quantification for Data Access Function Ooading

21

Infrastructure Setup

This is our storage media!

Page 22: MBWU: Benefit Quantification for Data Access Function Ooading

22

The Key-value Workload in Experiment

◉ The key size is 16 bytes, and the value size is 4 KiB.

◉ The read/write ratio is 50/50 following a Zipf distribution for data accessing.

◉ The total size of dataset is 40 GiB.

Page 23: MBWU: Benefit Quantification for Data Access Function Ooading

23

The Value of An MBWU

By running the evaluation prototype on our host platform, we got the value of a single MBWU for this workload:

1 MBWU = 7314.6 ops/sec

Now, we can evaluate efficiency of different platforms!

Page 24: MBWU: Benefit Quantification for Data Access Function Ooading

24

Three-stage Test Setup◉ Integrated Tests

Evaluate the benefits of leveraging cost-efficient hardware to provide key-value data store.

◉ Network TestsEvaluate how the introduction of the front-end network affects the benefit results.

◉ Disaggregated TestsEvaluate the benefits of removing the back-end network requirement for data management traffic.

Page 25: MBWU: Benefit Quantification for Data Access Function Ooading

25

Results From Integrated Tests◉ Integrated Tests

Evaluate the benefits of leveraging cost-efficient hardware to provide key-value data store.

◉ Network TestsEvaluate how the introduction of the front-end network affects the benefit results.

◉ Disaggregated TestsEvaluate the benefits of removing the back-end network requirement for data management traffic.

64% less $/MBWU40% less kWh/MBWU

Host Embedded

Our host platform can generate 6 MWBUs.

Our embedded platform can generate 0.5 MBWUs.

Offload

Page 26: MBWU: Benefit Quantification for Data Access Function Ooading

26

Results From Network Tests◉ Integrated Tests

Evaluate the benefits of leveraging cost-efficient hardware to provide key-value data store.

◉ Network TestsEvaluate how the introduction of the front-end network affects the benefit results.

◉ Disaggregated TestsEvaluate the benefits of removing the back-end network requirement for data management traffic.

58% less $/MBWU46% less kWh/MBWU

Our host platform can generate 5.2 MWBUs.

Our embedded platform can generate 0.37 MBWUs.

Offload

Page 27: MBWU: Benefit Quantification for Data Access Function Ooading

27

Results From Disaggregated Tests◉ Integrated Tests

Evaluate the benefits of leveraging cost-efficient hardware to provide key-value data store.

◉ Network TestsEvaluate how the introduction of the front-end network affects the benefit results.

◉ Disaggregated TestsEvaluate the benefits of removing the back-end network requirement for data management traffic.

74% less $/MBWU

Our host platform can generate 3.2 MWBUs.

Our embedded platform can generate 0.37 MBWUs.

Offload

Page 28: MBWU: Benefit Quantification for Data Access Function Ooading

28

Conclusion

The MBWU Measurement Methodology

◉ provides an instruction to answer the following question:

➪ How efficient is a platform to run a given workload over a specific storage media?

◉ apple-to-apple efficiency comparisons for different platforms.

◉ benefits quantification for functions offloading from traditional host platforms to embedded platforms.

Page 29: MBWU: Benefit Quantification for Data Access Function Ooading

29

Conclusion

The Evaluation Prototype

◉ automates the evaluation process for quantifying benefits of offloading customized key-value workloads.

Target users: storage device/systems designers

Page 30: MBWU: Benefit Quantification for Data Access Function Ooading

THANKS!Any questions?

Carlos [email protected]

Cross.ucsc.edu (Eusocial Storage Devices)

30


Recommended