+ All Categories
Home > Documents > datacenter networking - NANOG Archive · datacenter networking david swafford network engineer...

datacenter networking - NANOG Archive · datacenter networking david swafford network engineer...

Date post: 26-Jun-2020
Category:
Upload: others
View: 11 times
Download: 0 times
Share this document with a friend
44
datacenter networking david swafford network engineer [email protected] 8-OCT-2013 – NANOG 59
Transcript
Page 1: datacenter networking - NANOG Archive · datacenter networking david swafford network engineer dswafford@fb.com 8-OCT-2013 – NANOG 59 . 1.15B ... servers servers servers servers

datacenter networking

david swafford network engineer

[email protected] 8-OCT-2013 – NANOG 59

Page 2: datacenter networking - NANOG Archive · datacenter networking david swafford network engineer dswafford@fb.com 8-OCT-2013 – NANOG 59 . 1.15B ... servers servers servers servers

1.15B people (MAUs)

350M+ photos uploaded per day

(on average in Q4 2012)

+ 7 PB each month for photos alone

(as of Oct. 2012)

Source: Facebook internal data, June 2013

Page 3: datacenter networking - NANOG Archive · datacenter networking david swafford network engineer dswafford@fb.com 8-OCT-2013 – NANOG 59 . 1.15B ... servers servers servers servers

traffic growth

Page 4: datacenter networking - NANOG Archive · datacenter networking david swafford network engineer dswafford@fb.com 8-OCT-2013 – NANOG 59 . 1.15B ... servers servers servers servers

clusters

a unit of compute

Page 5: datacenter networking - NANOG Archive · datacenter networking david swafford network engineer dswafford@fb.com 8-OCT-2013 – NANOG 59 . 1.15B ... servers servers servers servers

our clusters

web

photos

messaging

news feed

advertising

cache database

Page 6: datacenter networking - NANOG Archive · datacenter networking david swafford network engineer dswafford@fb.com 8-OCT-2013 – NANOG 59 . 1.15B ... servers servers servers servers

1st generation clusters

cluster switch

backbone

rack switch

switched uplinks

n x 1Gb / first-gen 10Gb

cluster switch

Layer 2

Layer 3

MPLS domain

servers 1Gb servers servers

Page 7: datacenter networking - NANOG Archive · datacenter networking david swafford network engineer dswafford@fb.com 8-OCT-2013 – NANOG 59 . 1.15B ... servers servers servers servers

early challenges

usable capacity

Layer 2 scaling

Page 8: datacenter networking - NANOG Archive · datacenter networking david swafford network engineer dswafford@fb.com 8-OCT-2013 – NANOG 59 . 1.15B ... servers servers servers servers

2nd generation

cluster switch

backbone

rack switch

routed uplinks

10Gb fiber

cluster switch

Layer 2

Layer 3

MPLS domain

cluster switch

cluster switch

servers 10Gb copper servers servers

Page 9: datacenter networking - NANOG Archive · datacenter networking david swafford network engineer dswafford@fb.com 8-OCT-2013 – NANOG 59 . 1.15B ... servers servers servers servers

BGP to the rack

control

scale

no IGP

Page 10: datacenter networking - NANOG Archive · datacenter networking david swafford network engineer dswafford@fb.com 8-OCT-2013 – NANOG 59 . 1.15B ... servers servers servers servers

backbone

Internet lots of intra-cluster traffic!

web photos

cluster switch

servers servers

servers servers servers

servers

cluster switch

rack switch

cluster switch

cluster switch

cluster switch

cluster switch

rack switch

cluster switch

cluster switch

Page 11: datacenter networking - NANOG Archive · datacenter networking david swafford network engineer dswafford@fb.com 8-OCT-2013 – NANOG 59 . 1.15B ... servers servers servers servers

the primary role of our backbone

connects datacenters

private / transit peering

Page 12: datacenter networking - NANOG Archive · datacenter networking david swafford network engineer dswafford@fb.com 8-OCT-2013 – NANOG 59 . 1.15B ... servers servers servers servers

growing pains

backbone devices are too

powerful for intra-DC needs

Page 13: datacenter networking - NANOG Archive · datacenter networking david swafford network engineer dswafford@fb.com 8-OCT-2013 – NANOG 59 . 1.15B ... servers servers servers servers

looking for a better way to scale…

Page 14: datacenter networking - NANOG Archive · datacenter networking david swafford network engineer dswafford@fb.com 8-OCT-2013 – NANOG 59 . 1.15B ... servers servers servers servers

evolving

cluster switch

rack switch

cluster switch

web

cluster switch

cluster switch

servers

cluster switch

rack switch

cluster switch

cluster switch

cluster switch

servers

backend datacenter network

backbone

news feed

servers servers

servers servers

Page 15: datacenter networking - NANOG Archive · datacenter networking david swafford network engineer dswafford@fb.com 8-OCT-2013 – NANOG 59 . 1.15B ... servers servers servers servers

it works, why change?

chassis break in obscure ways

efficiency

Page 16: datacenter networking - NANOG Archive · datacenter networking david swafford network engineer dswafford@fb.com 8-OCT-2013 – NANOG 59 . 1.15B ... servers servers servers servers

3rd generation

fabric switch

rack switch

fabric switch

fabric switch

fabric switch

servers

fabric switch

fabric switch

fabric switch

fabric switch

servers

servers

Folded Clos datacenter-wide fabric

Page 17: datacenter networking - NANOG Archive · datacenter networking david swafford network engineer dswafford@fb.com 8-OCT-2013 – NANOG 59 . 1.15B ... servers servers servers servers

3rd generation

spine switch

spine switch

spine switch

spine switch

spine switch

spine switch

spine switch

spine switch

spin

e

fabric switch

rack

fabric switch

fabric switch

fabric switch

servers

fabric switch

fabric switch

fabric switch

fabric switch

serv

er

po

d

Page 18: datacenter networking - NANOG Archive · datacenter networking david swafford network engineer dswafford@fb.com 8-OCT-2013 – NANOG 59 . 1.15B ... servers servers servers servers

3rd generation

spine switch

spine switch

spine switch

spine switch

spine switch

spine switch

spine switch

spine switch

spin

e

rack servers

serv

er

po

d

rack servers

serv

er

po

d

Page 19: datacenter networking - NANOG Archive · datacenter networking david swafford network engineer dswafford@fb.com 8-OCT-2013 – NANOG 59 . 1.15B ... servers servers servers servers

pod

pod pod

pod

spine

spine

pla

ne

scaling to a full datacenter

Page 20: datacenter networking - NANOG Archive · datacenter networking david swafford network engineer dswafford@fb.com 8-OCT-2013 – NANOG 59 . 1.15B ... servers servers servers servers

managing everything

Page 21: datacenter networking - NANOG Archive · datacenter networking david swafford network engineer dswafford@fb.com 8-OCT-2013 – NANOG 59 . 1.15B ... servers servers servers servers

approaching networking from a software mindset

configuration

auditing alerting

remediating

C Programing

Page 22: datacenter networking - NANOG Archive · datacenter networking david swafford network engineer dswafford@fb.com 8-OCT-2013 – NANOG 59 . 1.15B ... servers servers servers servers

frees up engineers to make greater impact

learning

automating

sharing

Page 23: datacenter networking - NANOG Archive · datacenter networking david swafford network engineer dswafford@fb.com 8-OCT-2013 – NANOG 59 . 1.15B ... servers servers servers servers

deploying a cluster switch

step engineer computer

planning the port map

physical installation

generating configuration

applying

validating

enabling for live traffic

Page 24: datacenter networking - NANOG Archive · datacenter networking david swafford network engineer dswafford@fb.com 8-OCT-2013 – NANOG 59 . 1.15B ... servers servers servers servers

FBAR

engineers create audits

and remediation scripts

audits trigger alarms

FBAR reacts to alarms

Page 25: datacenter networking - NANOG Archive · datacenter networking david swafford network engineer dswafford@fb.com 8-OCT-2013 – NANOG 59 . 1.15B ... servers servers servers servers

IX peering dropped BGP

session

syslog alarm

attempt remediation,

still down? No

Yes

discard

escalate

For more information, see “Making Facebook Self-Healing” https://www.facebook.com/note.php?note_id=10150275248698920

Page 26: datacenter networking - NANOG Archive · datacenter networking david swafford network engineer dswafford@fb.com 8-OCT-2013 – NANOG 59 . 1.15B ... servers servers servers servers

understanding the black box

SWITCH ASIC

PHY CTLR

SFP

SFP

SFP

SFP

CP

CPU

L3 Tables

L2 Tables

FABRIC INTERCONNECT

PHY CTLR

SFP

SFP

SFP

SFP

PHY CTLR

SFP

SFP

SFP

SFP

Page 27: datacenter networking - NANOG Archive · datacenter networking david swafford network engineer dswafford@fb.com 8-OCT-2013 – NANOG 59 . 1.15B ... servers servers servers servers

troubleshooting the black box

when a line-card goes crazy

how do you even know about it?

never trust the box!

Page 28: datacenter networking - NANOG Archive · datacenter networking david swafford network engineer dswafford@fb.com 8-OCT-2013 – NANOG 59 . 1.15B ... servers servers servers servers

monitoring everything

all links / BGP sessions

FIBs

TCP retransmit stats

Page 29: datacenter networking - NANOG Archive · datacenter networking david swafford network engineer dswafford@fb.com 8-OCT-2013 – NANOG 59 . 1.15B ... servers servers servers servers

a culture of automation

filter noise in software

automate the repetitive

engineers focus on real problems

Page 30: datacenter networking - NANOG Archive · datacenter networking david swafford network engineer dswafford@fb.com 8-OCT-2013 – NANOG 59 . 1.15B ... servers servers servers servers

our rack switches

Page 31: datacenter networking - NANOG Archive · datacenter networking david swafford network engineer dswafford@fb.com 8-OCT-2013 – NANOG 59 . 1.15B ... servers servers servers servers

the problem

install and forget

configuration drift

inconsistent IPv6 support

Page 32: datacenter networking - NANOG Archive · datacenter networking david swafford network engineer dswafford@fb.com 8-OCT-2013 – NANOG 59 . 1.15B ... servers servers servers servers

IPv6 everywhere! for real!

every rack in 2013

all services in early 2014

why?

•  IPv4 won’t last forever

•  Band-Aids are not fun to troubleshoot

•  and it’s really cool !

2a03:2880:2110:df07:face:b00c:0:1

Page 33: datacenter networking - NANOG Archive · datacenter networking david swafford network engineer dswafford@fb.com 8-OCT-2013 – NANOG 59 . 1.15B ... servers servers servers servers

rolling out IPv6

dual-stacked backbone

and cluster switches

rack switch upgrades

service migration

Page 34: datacenter networking - NANOG Archive · datacenter networking david swafford network engineer dswafford@fb.com 8-OCT-2013 – NANOG 59 . 1.15B ... servers servers servers servers

the old way to upgrade a cluster

coordinate an outage window

with affected service owners

drain traffic

upgrade

Page 35: datacenter networking - NANOG Archive · datacenter networking david swafford network engineer dswafford@fb.com 8-OCT-2013 – NANOG 59 . 1.15B ... servers servers servers servers

the difficulty – lots of service owners

web

photos

messaging

news feed

advertising

cache database

Page 36: datacenter networking - NANOG Archive · datacenter networking david swafford network engineer dswafford@fb.com 8-OCT-2013 – NANOG 59 . 1.15B ... servers servers servers servers

we needed a change….

why drain?

why a single window?

why is NetEng so heavily involved?

Page 37: datacenter networking - NANOG Archive · datacenter networking david swafford network engineer dswafford@fb.com 8-OCT-2013 – NANOG 59 . 1.15B ... servers servers servers servers

looking at an early attempt

blacklist racks by hostnames,

upgrade the rest

Page 38: datacenter networking - NANOG Archive · datacenter networking david swafford network engineer dswafford@fb.com 8-OCT-2013 – NANOG 59 . 1.15B ... servers servers servers servers

how we solved it

dedicated racks? •  shift the responsibility to the service owner

shared racks? •  schedule every rack under full automation

Page 39: datacenter networking - NANOG Archive · datacenter networking david swafford network engineer dswafford@fb.com 8-OCT-2013 – NANOG 59 . 1.15B ... servers servers servers servers

dedicated racks

shift the responsibility?

•  real world – less time on all sides!

•  empowered service owners – moved from user to customer

•  we’re friends now!

Page 40: datacenter networking - NANOG Archive · datacenter networking david swafford network engineer dswafford@fb.com 8-OCT-2013 – NANOG 59 . 1.15B ... servers servers servers servers

focused on a smooth user experience

single button upgrades

detailed reporting from the start

easy job management

Page 41: datacenter networking - NANOG Archive · datacenter networking david swafford network engineer dswafford@fb.com 8-OCT-2013 – NANOG 59 . 1.15B ... servers servers servers servers

shared racks

schedule every rack?

•  accurate timing and notification

•  some services need to be drained

•  why not? software will do the work anyway

Page 42: datacenter networking - NANOG Archive · datacenter networking david swafford network engineer dswafford@fb.com 8-OCT-2013 – NANOG 59 . 1.15B ... servers servers servers servers

where are we now?

service owners handling

rack switch upgrades!

a network that constantly upgrades itself!

Page 43: datacenter networking - NANOG Archive · datacenter networking david swafford network engineer dswafford@fb.com 8-OCT-2013 – NANOG 59 . 1.15B ... servers servers servers servers

how? Aggiornamento!

client / server model based on Thrift

runs on Linux, written in Python,

backed by MySQL

integrates across all internal systems:

▪  impact analysis and notification

•  scheduling

•  job management

Page 44: datacenter networking - NANOG Archive · datacenter networking david swafford network engineer dswafford@fb.com 8-OCT-2013 – NANOG 59 . 1.15B ... servers servers servers servers

automate your day job! focus on the impossible!


Recommended