+ All Categories
Home > Documents > Nave pNFS Client for Windows HPC Server 2008citi.umich.edu/projects/nfsv4/windows/SC09talk.pdf ·...

Nave pNFS Client for Windows HPC Server 2008citi.umich.edu/projects/nfsv4/windows/SC09talk.pdf ·...

Date post: 13-Jul-2020
Category:
Upload: others
View: 1 times
Download: 0 times
Share this document with a friend
18
Na#ve pNFS Client for Windows HPC Server 2008
Transcript
Page 1: Nave pNFS Client for Windows HPC Server 2008citi.umich.edu/projects/nfsv4/windows/SC09talk.pdf · NFSv4 • Control ceded to IETF • Goals – Internet performance – Interoperability,

Na#ve pNFS Client for Windows HPC Server 2008 

Page 2: Nave pNFS Client for Windows HPC Server 2008citi.umich.edu/projects/nfsv4/windows/SC09talk.pdf · NFSv4 • Control ceded to IETF • Goals – Internet performance – Interoperability,

NFS origins •  NFSv2 and NFSv3 – Proprietary (Sun Microsystems) client/server protocols for distributed filing 

– “Open system” •  Protocol published •  Interoperability promoted 

– Stateless, usually UDP‐based – Other protocols for mount, locks, quotas 

Page 3: Nave pNFS Client for Windows HPC Server 2008citi.umich.edu/projects/nfsv4/windows/SC09talk.pdf · NFSv4 • Control ceded to IETF • Goals – Internet performance – Interoperability,

NFSv4 •  Control ceded to IETF •  Goals –  Internet performance –  Interoperability, interna#onaliza#on –  Security, reliability, availability –  Extensible 

•  Stateful protocol – Open, locks, oplocks, secure channels, callbacks 

•  RFC  3010, December 2000; RFC 3530, April 2003 

Page 4: Nave pNFS Client for Windows HPC Server 2008citi.umich.edu/projects/nfsv4/windows/SC09talk.pdf · NFSv4 • Control ceded to IETF • Goals – Internet performance – Interoperability,

NFSv4 and HPC 

FS Node 

FS Node 

… …

Visualiza#on  Cluster Capability 

Pla^orm 

Studio Displays 

System Data and Control Networks System Data and Control Networks System Data and Control Networks System Data and Control Networks 

Compute Node 

Compute Node 

NFS 

Login Net 

from ASCI Technology Prospectus, July 2001 

NFS 

Login Net 

Internal Users 

(NFS, etc) 

Net … Net 

Gateway  

Archive 

Infiniband™ I/O Network I/O Network 

Page 5: Nave pNFS Client for Windows HPC Server 2008citi.umich.edu/projects/nfsv4/windows/SC09talk.pdf · NFSv4 • Control ceded to IETF • Goals – Internet performance – Interoperability,

Single server boaleneck 

Server 

Client 

Client 

… 

Client 

Page 6: Nave pNFS Client for Windows HPC Server 2008citi.umich.edu/projects/nfsv4/windows/SC09talk.pdf · NFSv4 • Control ceded to IETF • Goals – Internet performance – Interoperability,

Mee#ng HPC storage needs •  HPC demands for scalable storage are met through innova#ve, proprietary, non‐interoperable solu#ons –  Lustre, GPFS, PanFS, PVFS2 dominate –  Investment in proprietary solu#on has high risk of lock‐in or loss 

•  pNFS insulates storage architects from these risks –  Neutral ground through standardiza#on –  Con#nues to admit vendor innova#on –  Pools customer investment –  Spreads investment across more vendors 

Page 7: Nave pNFS Client for Windows HPC Server 2008citi.umich.edu/projects/nfsv4/windows/SC09talk.pdf · NFSv4 • Control ceded to IETF • Goals – Internet performance – Interoperability,

Parallel file systems Asymmetric 

•  Direct access to storage  •  Separate metadata servers •  File, object, or block access 

Symmetric 

•  Direct access to storage  •  Each node is a fully capable 

client and metadata server 

•  File access 

Page 8: Nave pNFS Client for Windows HPC Server 2008citi.umich.edu/projects/nfsv4/windows/SC09talk.pdf · NFSv4 • Control ceded to IETF • Goals – Internet performance – Interoperability,

NFS advantages and obstacles 

NFS Clients 

NFS Clients 

  Performance 

  Scalability 

  Security  Heterogeneity   Transparency 

Page 9: Nave pNFS Client for Windows HPC Server 2008citi.umich.edu/projects/nfsv4/windows/SC09talk.pdf · NFSv4 • Control ceded to IETF • Goals – Internet performance – Interoperability,

pNFS worldview •  pNFS extends NFSv4 – parallel, mul#‐path transfers – complex topologies 

•  A layout associates a file with a device ID •  LAYOUTGET returns the device ID for a given file – The handle for a specific storage device topology 

•  GETDEVICEINFO returns the storage device topology for a given device ID 

Page 10: Nave pNFS Client for Windows HPC Server 2008citi.umich.edu/projects/nfsv4/windows/SC09talk.pdf · NFSv4 • Control ceded to IETF • Goals – Internet performance – Interoperability,

pNFS 

Page 11: Nave pNFS Client for Windows HPC Server 2008citi.umich.edu/projects/nfsv4/windows/SC09talk.pdf · NFSv4 • Control ceded to IETF • Goals – Internet performance – Interoperability,

pNFS I/O in Windows 

Cache manager 

RDBSS‐based mini‐RDR driver 

Callback handler 

NFS ops Layout manager 

Session manager 

Creden#als manager 

RPC/XDR 

Net 

kernel 

user 

Page 12: Nave pNFS Client for Windows HPC Server 2008citi.umich.edu/projects/nfsv4/windows/SC09talk.pdf · NFSv4 • Control ceded to IETF • Goals – Internet performance – Interoperability,

The path to pNFS in Linux •  Convince people it will work •  Get on the IETF agenda •  Drah a protocol standard – Make sure it addresses HPC issues 

•  Convince people to fund implementa#on •  Gather frequently to test interoperability •  Convince Linux maintainers to accept patches 

•  Convince Linux distribu#ons to support pNFS 

 Convince people it will work  Get on the IETF agenda  Drah a protocol standard 

 Make sure it addresses HPC issues 

 Convince people to fund implementa#on  Gather frequently to test interoperability 

Page 13: Nave pNFS Client for Windows HPC Server 2008citi.umich.edu/projects/nfsv4/windows/SC09talk.pdf · NFSv4 • Control ceded to IETF • Goals – Internet performance – Interoperability,

Building the Case 

Write one file 

Write files 

Read files  Read one file 

Page 14: Nave pNFS Client for Windows HPC Server 2008citi.umich.edu/projects/nfsv4/windows/SC09talk.pdf · NFSv4 • Control ceded to IETF • Goals – Internet performance – Interoperability,

Roadmap •  Standardiza#on – Target: 2008 

•  Implementa#on – Target: 2009 

•  Distribu#on – Target: 2010 (HPC, other early adopters) 

•  Enterprise distribu#on – Target: 2011 

Page 15: Nave pNFS Client for Windows HPC Server 2008citi.umich.edu/projects/nfsv4/windows/SC09talk.pdf · NFSv4 • Control ceded to IETF • Goals – Internet performance – Interoperability,

Implementa#ons •  CITI, Sun, StorSpeed, Seagate, Panasas, Ohio Supercompu#ng Center, NetApp, LSI, IBM, EMC, Carnegie Mellon, DESY, BlueArc 

•  Frequent interoperability tes#ng – Connectathon, Bake‐a‐thons 

•  Func#onally correct and interoperable – Linux, Solaris clients – DESY, EMC, IBM, Linux, NetApp, Panasas, Solaris servers  

Page 16: Nave pNFS Client for Windows HPC Server 2008citi.umich.edu/projects/nfsv4/windows/SC09talk.pdf · NFSv4 • Control ceded to IETF • Goals – Internet performance – Interoperability,

Windows client status •  Passing most “Basic Connectathon” tests –  Interoperability tes#ng began at last Bake‐a‐thon 

•  Layout implementa#on begins in 4Q09 

•  Open source distribu#on (more hands, more eyes) to begin 1H10 

•  Func#onal completeness 4Q10 – With con#nuing development and tuning by open source developers 

Page 17: Nave pNFS Client for Windows HPC Server 2008citi.umich.edu/projects/nfsv4/windows/SC09talk.pdf · NFSv4 • Control ceded to IETF • Goals – Internet performance – Interoperability,

Linux implementa#on status •  Maintainers work with developers to engineer kernel patches –  Linux kernel version increments approximately quarterly 

– Ul#mately Linus Torvalds decides 

•  NFSv4.1 is more than pNFS –  Sessions communica#on layer, required for pNFS – Directory delega#on 

•  Client and server fore and back sessions channel in Linux 2.6.32 kernel 

Page 18: Nave pNFS Client for Windows HPC Server 2008citi.umich.edu/projects/nfsv4/windows/SC09talk.pdf · NFSv4 • Control ceded to IETF • Goals – Internet performance – Interoperability,

There is much leh to do •  Administra#on tools – Metadata server management – Volume management 

•  Performance at scale –  Instrumenta#on, measurement, tuning – Small‐scale file striping performance under way at CITI (fewer than 20 nodes) 

•  Metadata striping •  Windows HPC Server 2008 metadata server 


Recommended