+ All Categories
Home > Documents > Caching less for better performance: Balancing cache size and … · 2019. 2. 25. · Balancing...

Caching less for better performance: Balancing cache size and … · 2019. 2. 25. · Balancing...

Date post: 17-Sep-2020
Category:
Upload: others
View: 1 times
Download: 0 times
Share this document with a friend
37
Caching less for better performance: Balancing cache size and update cost of flash memory cache in hybrid storage systems 1 Yongseok Oh Jongmoo Choi Donghee Lee Sam H. Noh Hongik University [email protected] University of Seoul {ysoh,dhl_express}@uos.ac.kr Dankook University [email protected]
Transcript
Page 1: Caching less for better performance: Balancing cache size and … · 2019. 2. 25. · Balancing cache size and update cost of flash memory cache in hybrid storage systems" 1 Yongseok

Caching less for better performance: Balancing cache size and update cost

of flash memory cache in hybrid storage systems"

1

Yongseok Oh" Jongmoo Choi! Donghee Lee! Sam H. Noh!

Hongik [email protected]

University of Seoul!{ysoh,dhl_express}@uos.ac.kr

Dankook [email protected]

Page 2: Caching less for better performance: Balancing cache size and … · 2019. 2. 25. · Balancing cache size and update cost of flash memory cache in hybrid storage systems" 1 Yongseok

10th USENIX Conference on File and Storage Technologies (FAST’12) !

•  Harness benefits of SSDs and HDDs!§  High performance, large capacity, affordable cost!

•  SSDs used as flash cache (NVCache)!§  Seagate Momentus XT(SLC 4GB), OCZ RevoDrive Hybrid (MLC 100GB)!

•  Our focus: issue of managing flash cache!

2

Hybrid Storage Systems

+High performance!Low power consumption

Large Capacity!Low cost!!

SSD HDD Hybrid Storage System!

Page 3: Caching less for better performance: Balancing cache size and … · 2019. 2. 25. · Balancing cache size and update cost of flash memory cache in hybrid storage systems" 1 Yongseok

10th USENIX Conference on File and Storage Technologies (FAST’12) !

•  Maintain Over-Provisioned Space (OPS)!§  Reserved space for Garbage Collection (GC)!§  Greatly influence GC performance!

•  Typical SSDs!§  OPS size is fixed!§  Optimal size is unknown!§  Cannot adapt to workload changes!

3

Important Characteristics of Flash based SSDs

Flash based SSD!

Caching Space" OPS"

Fixed Size OPS!

Page 4: Caching less for better performance: Balancing cache size and … · 2019. 2. 25. · Balancing cache size and update cost of flash memory cache in hybrid storage systems" 1 Yongseok

10th USENIX Conference on File and Storage Technologies (FAST’12) ! 4

Our Goal: Find Optimal OPS Size

Less!

Perfo

rman

ce!

Overall cost!(e.g., response time of hybrid storage)!

GC cost!

Cache miss rate!

Over-Provisioned Space More

Point of Optimal Performance!(Our Goal)!

Caching Space" OPS"Caching

Space" OPS"

Page 5: Caching less for better performance: Balancing cache size and … · 2019. 2. 25. · Balancing cache size and update cost of flash memory cache in hybrid storage systems" 1 Yongseok

10th USENIX Conference on File and Storage Technologies (FAST’12) !

•  Periodically adjust OPS size to maximize the performance!§  Based on hit ratio and garbage collection cost!

•  Question: how to find optimal OPS size?!§  Solution: Hybrid Storage Cost Model (Dynamically adjusted

according to workload)!5

Workload Dependent Optimal Partitioning

Proposed Hybrid Storage!

HDD"

Flash Cache

Workload Dependent Optimal Partitioning!

OPS"Caching Space"

Idea"

OPS"Cachi

ngSpace"

Flash Cache

Perf.!

Hit!

Caching Space Size!

Optimal OPS size changes!

OPS"Caching Space"

Flash Cache

Perf.!

Hit!

Caching Space Size!

Page 6: Caching less for better performance: Balancing cache size and … · 2019. 2. 25. · Balancing cache size and update cost of flash memory cache in hybrid storage systems" 1 Yongseok

10th USENIX Conference on File and Storage Technologies (FAST’12) !

•  Introduction!

•  Hybrid Cost Model!

•  Implementation!

•  Evaluation!

•  Conclusion!

6

Outline

Page 7: Caching less for better performance: Balancing cache size and … · 2019. 2. 25. · Balancing cache size and update cost of flash memory cache in hybrid storage systems" 1 Yongseok

10th USENIX Conference on File and Storage Technologies (FAST’12) ! 7

OS 101: Access Cost Model (ACM)

CACM = Hit Rate x Cache Cost + (1-Hit Rate) x Miss Penalty!

HDD"

Storage Hierarchy!

Hit!

Request!

Miss!

Performance!HDD"

Buffer Cache Capacity!

Expected I/O cost!

Page 8: Caching less for better performance: Balancing cache size and … · 2019. 2. 25. · Balancing cache size and update cost of flash memory cache in hybrid storage systems" 1 Yongseok

10th USENIX Conference on File and Storage Technologies (FAST’12) ! 8

Hybrid Storage: Access Cost Model

CACM(u)= Hit Rate(u) x Flash Cache Cost(u) + (1-Hit Rate(u)) x Miss Penalty(u)!

Hit!Request!

Miss!

HDD"

•  CACM(u) represents expected I/O cost based on u!§  Incorporating u into the access cost model!

•  Flash Cache is divided based on u (tunable)!§  u is fraction of caching space in flash cache (e.g., 0 ≤ u ≤ 1.0)!§  u influences hit ratio and access cost of flash cache!

Caching Space OPS

u! 1-u!Flash Cache!

Page 9: Caching less for better performance: Balancing cache size and … · 2019. 2. 25. · Balancing cache size and update cost of flash memory cache in hybrid storage systems" 1 Yongseok

10th USENIX Conference on File and Storage Technologies (FAST’12) !

•  Hybrid cost model represents expected I/O cost!§  Combines hybrid read cost model and hybrid write cost model!§  Caching space divided into read and write spaces!

•  For this talk we derive hybrid read cost model !

9

Overview of Hybrid Cost Model

Read OPS

HDD"

Hybrid Read Cost Model!Read OPS Write!

HDD"

Hybrid Cost Model!

Hybrid Read Cost Model!

Hybrid Write Cost Model!

Flash Cache!

Page 10: Caching less for better performance: Balancing cache size and … · 2019. 2. 25. · Balancing cache size and update cost of flash memory cache in hybrid storage systems" 1 Yongseok

10th USENIX Conference on File and Storage Technologies (FAST’12) !

•  Hybrid cost model represents expected I/O cost!§  Combines hybrid read cost model and hybrid write cost model!§  Caching space divided into read and write spaces!

•  For this talk we derive hybrid read cost model !

10

Overview of Hybrid Cost Model

Read OPS

HDD"

Hybrid Read Cost Model!Read OPS Write!

HDD"

Hybrid Cost Model!

Hybrid Read Cost Model!

Read OPS

HDD"

Hybrid Cost Model!

Hybrid Read Cost Model!

Flash Cache!

Page 11: Caching less for better performance: Balancing cache size and … · 2019. 2. 25. · Balancing cache size and update cost of flash memory cache in hybrid storage systems" 1 Yongseok

10th USENIX Conference on File and Storage Technologies (FAST’12) ! 11

OPS Aware Hybrid Read Cost Model

CHR(u) = Hit Rate(u) x Flash Cache Cost(u) + (1-Hit Rate(u)) x Miss Penalty(u)!

Read Hit Rate!

•  Requirements for derivation!§  Read Hit Rate Function!§  HDD Cost Model!§  Flash Cache Cost Model!

Flash Read!Hit!

HDD"

Read Cache OPS

u! 1-u!

Maintain read data from HDD! Flash Cache!

HDD Read + Flash Write!Miss!

Page 12: Caching less for better performance: Balancing cache size and … · 2019. 2. 25. · Balancing cache size and update cost of flash memory cache in hybrid storage systems" 1 Yongseok

10th USENIX Conference on File and Storage Technologies (FAST’12) !

•  Read Hit Rate Function!

•  HDD Cost Model!

•  Flash Cache Cost Model!

•  Finding Optimal Point!

12

Hybrid Read Cost Model

CHR(u) = Hit Rate(u) x Flash Cache Cost(u) + (1-Hit Rate(u)) x Miss Penalty(u)!

Flash Cache Read! HDD Read + Flash Cache Write!Read Hit Rate!

Page 13: Caching less for better performance: Balancing cache size and … · 2019. 2. 25. · Balancing cache size and update cost of flash memory cache in hybrid storage systems" 1 Yongseok

10th USENIX Conference on File and Storage Technologies (FAST’12) ! 13

Read Hit Rate Function

Read Space (%)!

Rea

d H

it R

ate!

Read OPS

Read Hit! Read Miss!

HR(45%)

•  Read Hit rate function: HR(u), miss rate: 1-HR(u)!§  Related to workload pattern!

§  Depends on u!

u!

Hit

Rat

e!

HR(0.1) Read

Cache OPS

Hit! Miss!

u=0.1!

Flash Cache!

Low!

u H

it R

ate

HR(0.9) Read Cache OPS

Hit! Miss!

u=0.9!

Flash Cache!

High!

Page 14: Caching less for better performance: Balancing cache size and … · 2019. 2. 25. · Balancing cache size and update cost of flash memory cache in hybrid storage systems" 1 Yongseok

10th USENIX Conference on File and Storage Technologies (FAST’12) !

•  Read Hit Rate Function!

•  HDD Cost Model!

•  Flash Cache Cost Model!

•  Finding Optimal Point!

14

Hybrid Read Cost Model

CHR(u) = Hit Rate(u) x Flash Cache Cost(u) + (1-Hit Rate(u)) x Miss Penalty(u)!

Flash Cache Read! HDD Read + Flash Cache Write!HR(u)!

Page 15: Caching less for better performance: Balancing cache size and … · 2019. 2. 25. · Balancing cache size and update cost of flash memory cache in hybrid storage systems" 1 Yongseok

10th USENIX Conference on File and Storage Technologies (FAST’12) !

•  HDD I/O requires positioning cost + bus transfer cost [Hylog]!§  HDD Read: CDR = CD_RPOS + P/B!

§  HDD Write: CDW = CD_WPOS + P/B!

•  Independent from u!

HDD"

Read Cache OPS

u! 1-u!

Hit! Miss!

15

HDD Cost Model

Notation Description CD_RPOS Read positioning Cost CD_WPOS Write positioning Cost

P Page size (in bytes) B Bandwidth

Read

Page 16: Caching less for better performance: Balancing cache size and … · 2019. 2. 25. · Balancing cache size and update cost of flash memory cache in hybrid storage systems" 1 Yongseok

10th USENIX Conference on File and Storage Technologies (FAST’12) !

•  Read Hit Rate Function!

•  HDD Cost Model!

•  Flash Cache Cost Model!§  Read Cost Model!§  Write Cost Model!

•  Finding Optimal Point!

16

Hybrid Read Cost Model

CHR(u) = Hit Rate(u) x Flash Cache Cost(u) + (1-Hit Rate(u)) x Miss Penalty(u)!

Flash Cache Read! CDR + Flash Cache Write!HR(u)!

Page 17: Caching less for better performance: Balancing cache size and … · 2019. 2. 25. · Balancing cache size and update cost of flash memory cache in hybrid storage systems" 1 Yongseok

10th USENIX Conference on File and Storage Technologies (FAST’12) !

•  Hit request requires flash page read: CPR !§  Near constant cost (e.g., 25us)!

§  Regardless of garbage collection cost!§  Independent from u!

Flash Cache Read Cost Model

17

Read Cache OPS

u! 1-u!

Hit! Miss!Flash Read!

HDD"

Page 18: Caching less for better performance: Balancing cache size and … · 2019. 2. 25. · Balancing cache size and update cost of flash memory cache in hybrid storage systems" 1 Yongseok

10th USENIX Conference on File and Storage Technologies (FAST’12) !

•  Read Hit Rate Function!

•  HDD Cost Model!

•  Flash Cache Cost Model!§  Read Cost Model!§  Write Cost Model!

•  Finding Optimal Point!

18

Hybrid Read Cost Model

CHR(u) = Hit Rate(u) x Flash Cache Cost(u) + (1-Hit Rate(u)) x Miss Penalty(u)!

CPR! CDR + Flash Cache Write!HR(u)!

Page 19: Caching less for better performance: Balancing cache size and … · 2019. 2. 25. · Balancing cache size and update cost of flash memory cache in hybrid storage systems" 1 Yongseok

10th USENIX Conference on File and Storage Technologies (FAST’12) !

•  Miss request requires flash page write: CPW(u)!§  Write cost + GC cost(u)!§  GC cost(u) varies depending on u [LFS, Janus-FTL]!§  As u increases, GC cost(u) increases CPW(u) increases!

Flash Cache Write Cost Model

19

Flash Cache"

Caching Space" OPS"

CPW(u)!CPR!

u 1-u

Read Cache OPS

u! 1-u!

Hit! Miss!Flash Write!

HDD"

CPW (u) =CGC (u)(1−u) ⋅NP

+CPROG

CGC (u) = u ⋅NNP ⋅CCP +CE

Detailed Derivation!

See the paper for derivation!

Page 20: Caching less for better performance: Balancing cache size and … · 2019. 2. 25. · Balancing cache size and update cost of flash memory cache in hybrid storage systems" 1 Yongseok

10th USENIX Conference on File and Storage Technologies (FAST’12) !

•  Read Hit Rate Function!

•  HDD Cost Model!

•  Flash Cache Cost Model!

•  Finding Optimal Point!

20

Hybrid Read Cost Model

CHR(u) = Hit Rate(u) x Flash Cache Cost(u) + (1-Hit Rate(u)) x Miss Penalty(u)!

CHR(u) = HR(u) * CPR + (1-HR(u)) * (CDR+CPW(u))!Derive!

Page 21: Caching less for better performance: Balancing cache size and … · 2019. 2. 25. · Balancing cache size and update cost of flash memory cache in hybrid storage systems" 1 Yongseok

10th USENIX Conference on File and Storage Technologies (FAST’12) !

(b) Read access cost

3. Find Optimal Point!

Optimal Point"u=0.92"

21

Finding Optimal Point

(a) Read hit ratio

1. Observe Hit Ratio!

2. Calculate for all values of u !

CHR(u) = HR(u) * CPR + (1-HR(u)) * (CDR+CPW(u))!Partition based on optimal u = 0.92!

Flash Cache: e.g., 4GB

OPS0.32GB

Caching Space 3.68GB

4. Adjust!

Page 22: Caching less for better performance: Balancing cache size and … · 2019. 2. 25. · Balancing cache size and update cost of flash memory cache in hybrid storage systems" 1 Yongseok

10th USENIX Conference on File and Storage Technologies (FAST’12) !

•  CHY(u, r) represents expected I/O cost based on u and r !§  Caching space divided into read and write spaces based on r !§  r is fraction of read space in caching space (e.g., 0 ≤ r ≤ 1.0)!§  Modification: CHR(u) CHR(u, r), CHW(u) CHW(u, r)!

•  Used to find optimal values: u and r !

22

Hybrid Cost Model: Distinguishing Read and Write

Read OPS

HDD"

Hybrid Read Cost Model!

HDD"

Hybrid Cost Model: CHY(u, r)!

Hybrid Read Cost Model: CHR(u, r)!

Hybrid Write Cost Model: CHW(u, r)!

Read OPS Write r 1-r

u 1-u

See the paper for derivation!

Page 23: Caching less for better performance: Balancing cache size and … · 2019. 2. 25. · Balancing cache size and update cost of flash memory cache in hybrid storage systems" 1 Yongseok

10th USENIX Conference on File and Storage Technologies (FAST’12) ! 23

Calculate Hybrid Cost Model

Optimal Point!

CHR (u, r) = HR (u, r) ⋅CPR + (1−HR (u, r)) ⋅ (CDR +CPW (u))CHW (u, r) = HW (u, r) ⋅CWH + (1−HW (u, r)) ⋅ (CPR +CDW +CPW (u))

WHWRHRHY IOruCIOruCruC ⋅+⋅= ),(),(),(Hybrid Cost:!

Hybrid Read Cost:!

Hybrid Write Cost:!

2. Calculate based on u and r!

(a) Read hit ratio (b) Write hit ratio

1. Observe Hit Ratio! 3. Draw Access Cost Graph!

(c)Expected access cost

Better!

Page 24: Caching less for better performance: Balancing cache size and … · 2019. 2. 25. · Balancing cache size and update cost of flash memory cache in hybrid storage systems" 1 Yongseok

10th USENIX Conference on File and Storage Technologies (FAST’12) ! 24

Optimal Partitioning Algorithm with Hybrid Cost Model

op_u! op_r!

•  Periodically Execute Optimal Partitioning Algorithm !

320 FAST ’12: 10th USENIX Conference on File and Storage Technologies USENIX Association

Algorithm 1 Optimal Partitioning Algorithm

1: procedure OPTIMAL PARTITIONING

2: step! segment size/total cache size

3: INIT PARMS(op cost, op u, op r)

4: for u! step; u < 1.0; u! u+ step do

5: for r! 0.0; r " 1.0; r! r+ step do

6: cur cost!CHY

(u, r) ! Call Eq. 6

7: if cur cost < op cost then

8: op cost! cur cost

9: op u! u, op r! r

10: end if

11: end for

12: end for

13: ADJUST CACHE SIZE(op u, op r)

14: end procedure

new period starts. Then, with the hit rate curves gen-erated by the Workload Tracker in the previous period,the Partition Resizer gradually adjusts the sizes of the

three spaces, that is, the read and write cache space andthe OPS for the next period. To make the adjustment,

the Partition Resizer determines the optimal u and r asdescribed in Section 4, and those optimal values in turn

decide the optimal size of the three spaces.

To obtain the optimal u and r, we devise an iterative al-

gorithm presented in Algorithm 1. Starting from u=step,the outer loop iterates the inner loop increasing u in ‘step’

increments while u is less than 1.0. The two extremeconfigurations that we do not consider are where OPS is

0% and 100%. These are unrealistic configurations asOPS must be greater than 0% to perform garbage collec-tion, while OPS being 100% would mean that there is no

space to cache data. The inner loop starting from r=0iterates, calculating the access cost of the hybrid stor-

age system as derived in Equation 6, while increasing r

in ‘step’ increments until r becomes greater or equal to

1.0. The ‘step’ value can be calculated as the segmentsize divided by the total cache size, as shown in the sec-

ond line of Algorithm 1. The nested loop iterates N#M

times to calculate the costs, where N is the outer loop

count, 1/step-1, and M is the inner loop count, 1/step+1.A single cost calculation consists of 10 ADD, 4 SUB, 11MUL, and 4 DIV operations. Finer ‘step’ values may be

used resulting in finer u and r values, but with increasedcost calculation overhead. However, computational over-

head for executing this algorithm is quite small becausethey run once every period and the calculations are just

simple arithmetic operations.

Once the optimal u and r and, in turn, the optimal sizes

are determined, the Partition Resizer starts to progres-sively adjust the sizes of the three spaces. To increase

OPS size, it gradually evicts data in the read or writecaches. To increase cache space, that is, decrease OPS,

GC is performed to produce empty blocks. These emptyblocks are then used by the read and/or write caches.

The key role of our Mapping Manager is translating

the logical address to a physical location in the flashcache layer. For this purpose, it maintains a mapping ta-

ble that keeps the translation information. In our imple-mentation, we keep the mapping information at the last

page of each block. As we consider flash memory blockswith 64 pages, the overhead is roughly 1.6%. Moreover,we implement a crash recovery mechanism similar to

that of LFS [27]. If a power failure occurs, it searchesfor the most up-to-date checkpoint and goes through a

recovery procedure to return to the checkpoint state.

6 Performance Evaluation

In this section, we evaluate OP-FCL. For comparison, wealso implement two other schemes. The first is the Fixed

Partition-Flash Cache Layer (FP-FCL) scheme. This isthe simplest scheme where the read and write cache isnot distinguished, but unified as a single cache. The OPS

is available with a fixed size. This scheme is used tomimic a typical SSD of today that may serve as a cache

in a hybrid storage system. Normally, the SSD would notdistinguish read and write spaces and it would have some

OPS, whose size would be unknown. We evaluate thisscheme as we vary the percentage of the caching space

set aside for the (unified) cache. The best of these resultswill represent the most optimistic situation in real life

deployment.

The other scheme is the Read and Write-Flash Cache

Layer (RW-FCL) scheme. This scheme is in line with theobservation made by Kgil et al. [11] in that read and write

caches are distinguished. This scheme, however, goes astep further in that while the sum of the two cache sizes

remain constant, the size between the two are dynami-cally adjusted for best performance according to the costmodels described in Section 4. For this scheme, the OPS

size would also be fixed as the total read and write cachesize is fixed. We evaluate this scheme as we vary the per-

centage of the caching space set aside for the combinedread and write cache. Initial, all three schemes start with

an empty data cache. For OP-FCL, the initial OPS sizeis set to 5% of the total flash memory size.

The experiments are conducted using two sets oftraces. We categorize them based on the size of requests.

The first one, ‘Small Scale’, are workloads that requestless than 100GBs of total data. The other set, ‘Large

Scale’, are workloads with over 100GBs of data requests.Details of the characteristics of these workloads are in

Table 1.

The first two subsections discuss the performance as-pects of the two class of workloads. Then, in the next

8

Read 0.64GB

OPS1.44GB

Write 1.92GB!

Flash Cache: e.g., 4GB

op_u = 0.64!

op_r = 0.25!

0.36!

0.75!

•  Adjust Flash Cache partition!

Optimal Pointat op_u=0.64,

op_r=0.25!

•  Find u and r resulting in Optimal I/O Cost !

u!

r!

Page 25: Caching less for better performance: Balancing cache size and … · 2019. 2. 25. · Balancing cache size and update cost of flash memory cache in hybrid storage systems" 1 Yongseok

10th USENIX Conference on File and Storage Technologies (FAST’12) !

•  Introduction!

•  Hybrid Cost Model!

•  Implementation!

•  Evaluation!

•  Conclusion!

25

Outline

Page 26: Caching less for better performance: Balancing cache size and … · 2019. 2. 25. · Balancing cache size and update cost of flash memory cache in hybrid storage systems" 1 Yongseok

10th USENIX Conference on File and Storage Technologies (FAST’12) ! 26

Optimal Partitioning Flash Cache Layer (OP-FCL)

HDD Flash Cache

I/O request arrives

Page Replacer Cache Miss!

Cache Hit!

Read LRU!

Write LRU!

Mapping Manager"Translation Table

Logical to Physical!

If identify,!go to HDD !

Seq. I/O Detector"

If non-seq. I/O,!go to Flash Cache!

Workload Tracker"Hit!

Curves!

Partition Resizer Workload

Dependent Optimal

Partitioning!

Periodically Execute!

Shrink!Enlarge!

Page 27: Caching less for better performance: Balancing cache size and … · 2019. 2. 25. · Balancing cache size and update cost of flash memory cache in hybrid storage systems" 1 Yongseok

10th USENIX Conference on File and Storage Technologies (FAST’12) ! 27

Adapt to Workload Pattern

Read OPS Write

u!

Rea

d H

it!

u!W

rite

Hit!

Flash Cache!

Invalidate!

Destage!

u!

Rea

d H

it!

u!

Writ

e H

it!Workload changes!

Resize!

Enlarge!

u!

Rea

d H

it!

u!

Writ

e H

it!

Workload changes!

Resize!

Page 28: Caching less for better performance: Balancing cache size and … · 2019. 2. 25. · Balancing cache size and update cost of flash memory cache in hybrid storage systems" 1 Yongseok

10th USENIX Conference on File and Storage Technologies (FAST’12) !

•  Introduction!

•  Hybrid Cost Model!

•  Implementation!

•  Evaluation!

•  Conclusion!

28

Outline

Page 29: Caching less for better performance: Balancing cache size and … · 2019. 2. 25. · Balancing cache size and update cost of flash memory cache in hybrid storage systems" 1 Yongseok

10th USENIX Conference on File and Storage Technologies (FAST’12) !

•  Hybrid Storage Simulator!§  CMU DiskSim 4.0 and MSR SSD extension!

•  Flash Cache Layers (FCLs)!§  Fixed Partitioning (FP-FCL) - Fixed size OPS

- Typical SSD product !§  Read Write (RW-FCL) - Fixed size OPS

- Distinguishes read and write!§  Optimal Partitioning (OP-FCL) – Dynamically adjusted

! ! based on workload !

•  Configurations !§  Config. 1: 4GB flash cache + 10K RPM HDD!§  Config. 2: 16GB flash cache + three 10K RPM HDDs!

29

Evaluation Setup

Page 30: Caching less for better performance: Balancing cache size and … · 2019. 2. 25. · Balancing cache size and update cost of flash memory cache in hybrid storage systems" 1 Yongseok

10th USENIX Conference on File and Storage Technologies (FAST’12) !

Financial [UMass] with Config. 1!§  Random write dominant!§  OLTP application running at a financial institutions !

Search Engine [UMass] with Config. 1!§  Random read dominant!§  Web search engine !

Exchange [SNIA] with Config. 2!§  Random read/write mixed!§  Microsoft employee e-mail server!

Home [FIU] with Config. 1!§  Development, testing, and plotting in NFS Server!

MSN [SNIA] with Config. 2!§  MSN storage back-end file store!

30

Workload Traces

Page 31: Caching less for better performance: Balancing cache size and … · 2019. 2. 25. · Balancing cache size and update cost of flash memory cache in hybrid storage systems" 1 Yongseok

10th USENIX Conference on File and Storage Technologies (FAST’12) ! 31

Response Time Results

0 0.2 0.4 0.6 0.8

1 1.2

0 20 40 60 80 100

Mea

n R

esp.

Tim

e (m

s)

Caching Space (%) in SSD

FP-FCLRW-FCLOP-FCL

0 2 4 6 8

10 12 14

0 20 40 60 80 100

Mea

n R

esp.

Tim

e (m

s)

Caching Space (%) in SSD

FP-FCLRW-FCLOP-FCL

0 2 4 6 8

10 12 14

0 20 40 60 80 100

Mea

n R

esp.

Tim

e (m

s)

Caching Space (%) in SSD

FP-FCLRW-FCLOP-FCL

(a) Financial! (b) Search Engine! (c) Exchange!

•  OP-FCL shows near-optimal performance!

•  Optimal performance depends on workload characteristics!

Page 32: Caching less for better performance: Balancing cache size and … · 2019. 2. 25. · Balancing cache size and update cost of flash memory cache in hybrid storage systems" 1 Yongseok

10th USENIX Conference on File and Storage Technologies (FAST’12) ! 32

Response Time Results

0 0.2 0.4 0.6 0.8

1 1.2

0 20 40 60 80 100

Mea

n R

esp.

Tim

e (m

s)

Caching Space (%) in SSD

FP-FCLRW-FCLOP-FCL

0 2 4 6 8

10 12 14

0 20 40 60 80 100

Mea

n R

esp.

Tim

e (m

s)

Caching Space (%) in SSD

FP-FCLRW-FCLOP-FCL

0 2 4 6 8

10 12 14

0 20 40 60 80 100

Mea

n R

esp.

Tim

e (m

s)

Caching Space (%) in SSD

FP-FCLRW-FCLOP-FCL

(a) Financial! (b) Search Engine! (c) Exchange!

•  OP-FCL shows near-optimal performance!

•  Optimal performance depends on workload characteristics!

50%! 95%! 70%!

Page 33: Caching less for better performance: Balancing cache size and … · 2019. 2. 25. · Balancing cache size and update cost of flash memory cache in hybrid storage systems" 1 Yongseok

10th USENIX Conference on File and Storage Technologies (FAST’12) !

•  OP-FCL dynamically adjusts cache spaces according to workloads !

•  Financial and Exchange!§  Considerable OPS is used to lower garbage collection cost"

•  Search Engine!§  Most caching space is used to maintain read data!

33

Dynamic Adjustment

0

1

2

3

4

Cac

he S

ize

(GB)

Logical Time 0

1

2

3

4

Cac

he S

ize

(GB)

Logical Time 0 2 4 6 8

10 12 14 16

Cac

he S

ize

(GB)

Logical Time(a) Financial! (b) Search Engine! (c) Exchange!

OPS!

Write!

Read!

OPS!

Read!

OPS!

Write!Read!

Page 34: Caching less for better performance: Balancing cache size and … · 2019. 2. 25. · Balancing cache size and update cost of flash memory cache in hybrid storage systems" 1 Yongseok

10th USENIX Conference on File and Storage Technologies (FAST’12) !

•  OP-FCL dynamically adjusts cache spaces according to workloads !

•  Financial and Exchange!§  Considerable OPS is used to lower garbage collection cost!

•  Search Engine!§  Most caching space is used to maintain read data"

34

Dynamic Adjustment

0

1

2

3

4

Cac

he S

ize

(GB)

Logical Time 0

1

2

3

4

Cac

he S

ize

(GB)

Logical Time 0 2 4 6 8

10 12 14 16

Cac

he S

ize

(GB)

Logical Time(a) Financial! (b) Search Engine! (c) Exchange!

OPS!

Write!

Read!

OPS!

Read!

OPS!

Write!Read!

Page 35: Caching less for better performance: Balancing cache size and … · 2019. 2. 25. · Balancing cache size and update cost of flash memory cache in hybrid storage systems" 1 Yongseok

10th USENIX Conference on File and Storage Technologies (FAST’12) ! 35

Effect on Lifetime of Flash Cache

0 2 4 6 8

10 12 14

0 20 40 60 80 100

Aver

age

Eras

e C

ount

Caching Space (%) in SSD

FP-FCLRW-FCLOP-FCL

0

1

2

3

4

5

0 20 40 60 80 100Av

erag

e Er

ase

Cou

ntCaching Space (%) in SSD

FP-FCLRW-FCLOP-FCL

0 20 40 60 80

100 120

0 20 40 60 80 100

Aver

age

Eras

e C

ount

Caching Space (%) in SSD

FP-FCLRW-FCLOP-FCL

(a) Financial! (b) Search Engine! (c) Exchange!

•  Lifetime of flash cache is an important issue!•  Optimal point of lifetime differs from that of performance!

•  Our focus is to improve the performance of flash cache !

•  Optimizing lifetime of flash cache left as future work!

Lifetime Optimal!

Performance Optimal!

Page 36: Caching less for better performance: Balancing cache size and … · 2019. 2. 25. · Balancing cache size and update cost of flash memory cache in hybrid storage systems" 1 Yongseok

10th USENIX Conference on File and Storage Technologies (FAST’12) !

•  Trade-off exists!§  Caching benefit vs update cost !

•  We proposed OP-FCL for Hybrid Storage Systems !§  Use workload dependent cost model!§  Adjust read, write, and OPS sizes based on proposed cost model!§  Show near-optimal performance compared to others !

•  Future direction!§  Develop better destaging and replacement algorithm!§  Make SSD lifetime aware hybrid storage system!

36

Conclusion

Page 37: Caching less for better performance: Balancing cache size and … · 2019. 2. 25. · Balancing cache size and update cost of flash memory cache in hybrid storage systems" 1 Yongseok

10th USENIX Conference on File and Storage Technologies (FAST’12) ! 37

Yongseok Oh! Jongmoo Choi! Donghee Lee! Sam H. Noh!

Hongik [email protected]

University of Seoul!{ysoh,dhl_express}@uos.ac.kr

Dankook [email protected]

Caching less for better performance: Balancing cache size and update cost of flash memory cache in hybrid storage systems


Recommended