Modern Network Monitoring for the Rest of Us - AusNOG · Modern Network Monitoring for the Rest of...

Post on 19-Apr-2018

216 views 2 download

transcript

CC-BY AusNOG 2017

Modern Network Monitoring for

the Rest of Us

AusNOG 2017Tim Raphael

1

CC-BY AusNOG 2017

Intro•Why?

• The current state, emerging trends and technologies

• Barriers for smaller ISPs and enterprises

• An effective monitoring strategy

• How?2

CC-BY AusNOG 2017

Why?

• React to failure situations

• Capacity and service utilisation analysis

• Discover new and emerging trends

• Drive continual improvement

3

CC-BY AusNOG 2017

Current State

• Off-the-shelf software and its limitations

• Larger and more innovative organisations tending to build their own due to scale and complexity

• Differences between Silicon Valley-like organisations and “the rest of us”

4

CC-BY AusNOG 2017

Technologies

• Push vs Pull

• Time-Series Data

• Event Stream Processing

5

CC-BY AusNOG 2017

DeviceCollector

DeviceCollector

Request (50ms)

Response (50ms)

Pushed (50ms)

Push vs Pull

6

CC-BY AusNOG 2017

Time Series

7

CC-BY AusNOG 2017

Event Stream Processing

Source Aggregation Threshold Alert

8

CC-BY AusNOG 2017

Limitations on the rest of us

• Time / $

• On-device computing power

• Vendor feature support

• Previous generation monitoring interfaces

9

CC-BY AusNOG 2017

Monitoring for the rest of us

• Top-down approach

• Business goals measured by metrics

• Focus on alert conditions to reduce noise and improve impact

• Continual improvement

10

CC-BY AusNOG 2017

if out.bits.rate > 90% AND out.discards.rate > 0:

alert: high priority:

else if out.bits.rate > 85%: alert: low priority

CC-BY AusNOG 2017

Services Affected

Identify Cause

Implement Change& Confirm

Re-evaluateMonitoring Strategy

Continual Improvement

12

CC-BY AusNOG 2017

Monitoring for the rest of us

• Use technology available to you

• Design for resilience

• Put collectors close to the data source

• Use buffers

• Store raw values

• Meta Monitoring

13

CC-BY AusNOG 2017

Monitoring for the rest of us

Use technology available to you.

14

CC-BY AusNOG 2017

Monitoring for the rest of usDesign for Resilience

Device Collector

Device Collector

Central Datastore

15

CC-BY AusNOG 2017

Monitoring for the rest of usMeta Monitoring is important

DeviceCollector

Device Collector

16

CC-BY AusNOG 2017

Software

• Storage: • InfluxDB, Graphite, Prometheus

• Collection: • Telegraf, Collectd, Sensu, Statd, SnmpCollector

• Alert Evaluation: • Riemann, Kapacitor, Sensu

• Display: • Grafana, Chronograf

17

CC-BY AusNOG 2017

Questions

@timraphael215 linkedin.com/in/timraphael215/ www.timraphael.com

18