+ All Categories
Home > Documents > 5. SDC EMEA 2018 ZUFS Golander - snia.org File System in User Space ... 5 SW reuse Performance...

5. SDC EMEA 2018 ZUFS Golander - snia.org File System in User Space ... 5 SW reuse Performance...

Date post: 27-Mar-2018
Category:
Upload: dangkhanh
View: 214 times
Download: 1 times
Share this document with a friend
11
ZUFS How to Develop an Efficient PM-based File System in User Space Dr. Golander Amit, NetApp
Transcript

ZUFS How to Develop an Efficient

PM-based File System in User SpaceDr. Golander Amit, NetApp

© 2018 SNIA SDC. All Rights Reserved.

Storage Media Generations

3

PM

PM marries the best of both worlds: + StoragePersistency

MemorySpeed

HDD FLASH

IOPS(even if random…)

Latency(even under load…)

NVDIMM / PM

© 2018 SNIA SDC. All Rights Reserved.

PM Hardware

4

© 2018 SNIA SDC. All Rights Reserved.

PM-based SW Approached

5

SW reuse Performance

Application

SWInfrastructure

HW PM PMPM

© 2018 SNIA SDC. All Rights Reserved.

Background

6

Block wrapper

PM-based FS

Applications

DAX-enabled FS

Storage semantics

Memorysemantics

Block-based FS

Page Cache

bio

SPDKPlexistor (acquired by NetApp)

PM-based FS pioneer since 2013Contributing some of our IP

Our PM-based FS approach:Support legacy applications& Enable NPM (e.g. SPDK)Feature richIntegrate with NetApp product portfolio

e.g. M1FS

PM

© 2018 SNIA SDC. All Rights Reserved.

Kernel Vs. User Space FS Implementation

7

User space

Portable

Resilient (contained)

Simpler to add functionality & Debug

Fewer licensing restrictions

Kernel

Fast (shortest path)

The gap: Near-memory speed Kernel-to-User bridge

Kernel FS

K-U Bridges

© 2018 SNIA SDC. All Rights Reserved.

Why not extend FUSE to PM?FUSE architecture is great for HDDs and ok(ish) for SSDs, but not suitable for PM

8

FlashHDD Memory

FUSE

PM

?RDMATCP

Latency$/GB

FUSE ZUFSTypical medias Built for HDDs & extended to Flash Built for PM/NVDIMMs and DRAM

SW Perf. goals • Secondary (High latency media)• Async I/O Throughput

• SW is the bottleneck • Latency is everything

SW caching Slow media ->Rely on OS Page Cache

Near-memory speed media ->Bypass OS Page Cache

Access method I/O only I/O and mmap (DAX)

Cost of redundant copy / context switch

Negligible The bottleneck ->Avoid copies, queues & remain on core

Latency penalty under load

100s of µs 3-4 µs

Des

ign

Ass

umpt

ions

© 2018 SNIA SDC. All Rights Reserved.

ZUFS Features & Architecture

Low latency & EfficientCore & L1 cache affinityZero data copy

Manages devicesOptimal pmem accessNUMA awareData mover to lower tier devices

Page table mapping supportsI/O & DAX semantics Misc

Async hook availableSystem service

9

© 2018 SNIA SDC. All Rights Reserved.

Preliminary Results (for PM)

10

Measured on Dual socket, XEON 2650v4 (48HT)

DRAM-backed PMEM type

Random 4KB DirectIO write access

© 2018 SNIA SDC. All Rights Reserved.

Summary

ZUFS is a Kernel-to-User bridge designed for PMEnables NetApp solutions

Open Source, being contributed upstreamHope to accelerate PM adoption and innovationhttps://github.com/NetApp/zufs-zus & zufs-zufYou’re welcome to use, review and contribute code

11

© 2018 SNIA SDC. All Rights Reserved.

Thank you

12


Recommended