A Heartbeat Mechanism and its Application in Gigascope Johnson, Muthukrishnan, Shkapenyuk,...

A Heartbeat Mechanism and its Application in Gigascope

Johnson, Muthukrishnan, Shkapenyuk, Spatscheck

Presented by:

Joseph Frate and John Russo

Introduction Unblocking aggregation, join and union

operators Limit scope of output tuples that an input tuple

can affect. Two techniques

Define queries over window of input stream Applicable to continuous query systems for monitoring

Timestamp mechanism Applicable to data reduction applications

Gigascope A high-performance streaming database for network

monitoring Some fields of input stream behave as timestamps Locality of input tuple

Aggregation must have timestamp as one of group by fields

Join query relates timestamp fields of both inputs Merge operator

Union operator Preserves timestamp property of one of the fields

Timestamps and Gigascope Effective as long as progress is made in all

input streams Join or merge operators can stall if one input

stream stalls, causing systems failure

Example Sites monitored by Gigascope have multiple gigabit

connections to the Internet. One or more is a backup link through which traffic

can be diverted if primary link fails All links are monitored simultaneously Since primary link has gigabit traffic and backup

link has no traffic, merge operator will quickly overflow.

Presence of tuples carries timestamps, absence does not

Focus of Paper Authors propose use of heartbeats or punctuation to

unblock operators Heartbeats originate at source query operators and

propagate throughout query Timestamp punctuations are generated at source query

nodes and inferred at every other operator Punctuated heartbeats unblock operators that would

otherwise be blocked Focus is on multi-stream operators (joins and merges) Significantly reduces memory load for join and merge

operators

Related Work Heartbeat

Widely used for fault-tolerance in distributed systems Remote nodes send periodic heartbeats to let other nodes

know that they are alive Heartbeats are also used in distributed DSMS

Stream Punctuations Embed special marks in stream to indicate end of subset

of data Previous work in heartbeat mechanisms with

streaming data have focused on enforcing ordering of timestamps of tuples before query processing

Gigascope Architecture Stream-only database No continuous queries, thus no windows to

unblock queries Streams are labeled with timestampness, such

as monotone increasing. Used by query planner to unblock blocking operators

Gigascope Architecture (Continued) Aggregation query

One of the group by attributes must have timestampness. When this attribute changes, all groups and aggregates are

flushed to the operator’s output We can define this as an epoch. Flush occurs at the end

of each epoch. Example: here time is labeled as monotone increasing

select tb, srcIP, count(*) from TCPgroup by time/60 as tb, srcOP

In this query, we count the packets from each source IP address during 60 second epochs

Gigascope Architecture (Continued) Merge Operator

Union of two streams: A and B A and B must have same schema Both streams must have a timestamp field, such

as t, on which to merge If tuples on A have a larger value of t than those

on B, tuples on A are buffered until B catches up

Gigascope Architecture (Continued) Join Query

Must contain a join predicate such as A.tr=B.ts or A.tr/3 = B.ts+2 Relates timestamp field from A with B

Input streams are buffered so that streams match up on the timestamp.

Two-Level Query Architecture Low level used for data

reduction High level performs

complex processing Controls high streaming

rates Data streams from NICs

are placed in a ring buffer. These are called source

streams

Two-Level Query Architecture (continued)

Since volumes are too large to provide copy to each query, Gigascope creates a subquery

For example: Query Q to be executed over source stream S Gigascope creates a subquery q which directly

accesses S Q is transformed into Q0 which is executed over

the stream output of q


Low-level subqueries are called LFTAs Fast, lightweight data reduction queries Objective is to quickly process high volume data

stream in order to minimize buffer requirements Expensive processing is performed on output of low

level queries Smaller volume Easily buffered


Much of subquery processing can be performed on the NIC itself

Low-level aggregation uses a fixed-size hash table for maintaining groups in group by

If a collision occurs in hash table, old group is ejected as a tuple and new group replaces it in its slot

Similar methodology as subaggregates and superaggregates in data cube computations Higher level queries complete aggregation


Traffic shaping policies are implemented in order to spread out processing load Aggregation operator uses a slow flush to emit

tuples when aggregation epoch changes

Heartbeats to Unblock Streaming Operators

Gigascope heartbeats are produced by low-level query operators Propagated throughout query DAG Incur same queuing delays as tuples System performance monitoring Aids in detecting failed nodes

Stream punctuation mechanism is implemented Injects special temporal update tuples into operator output streams Informs receiving operator of end of subset of data Authors first attempted on-demand generation of temporal update

tuples Authors settled upon approach of using heartbeats to carry temporal

update tuples

Schema of Temporal Update Tuple Identical to regular tuple All attributes marked as temporal are initialized with

values that will not violate the ordering properties i.e. timebucket is marked as temporal increasing Temporal update tuple with timebucket=t is received by

an operator All future tuples will have timebucket >=t Non-temporal attributes are ignored by receiving

operators Goal is to generate temporal attribute values aggressively

and set them to highest possible value

Heartbeat Generation at LFTA LevelLow-level Streaming Operators

Read data directly from source data streams (packets sniffed from NICs)

Use filtering, projection and aggregation to reduce amount of data in a stream before passing it to higher-level nodes

Two types of streaming operators: selection and aggregation

Multi-query optimization through prefilters

Heartbeat Generation at LFTA LevelLow-level Streaming Operators

Normal mode of operation is to block waiting for new tuples to be posted to NICs ring buffer.

Once a tuple is in the buffer, it is processed Wait is periodically interrupted to generate a

punctuated heartbeat

Heartbeats in select LFTAs Selection, projection and transformation is

performed by LFTAs on packets arriving from data stream source.

If predicate is satisfied, output tuple is generated according to projection list.

A few additions Modify accept_tuple to save all temporal attribute values

referenced in select clause Whenever a request is received to generate a temporal

update tuple, use maximum of saved value of temporal attributes and a value saved by prefilter to infer value of temporal update tuple

Heartbeats in Aggregation LFTAs Group by and aggregation functionality is implemented using direct-

mapped hash table Collision results in ejected tuple being sent to output stream

High-level aggregation node completes aggregation of partial results produced by LFTA

Flushing occurs whenever incoming tuple advances the epoch. Slow-flush is used to avoid overflow of buffers

Gradually emits tuples as new tuples arrive Last seen temporal values in input stream are saved, similar to select. This value is used to generate temporal update tuples Also maintains value of last temporal attribute of last tuple flushed

Heartbeats in Aggregation LFTAs Whenever a request for a heartbeat is received, uses

following formula in order to insure that heartbeat does not violate temporal attribute ordering properties:

if we have unflushed tuples:use the value of last flushed tuple

elseuse max of saved value of the temporal attributes and the value saved by theprefilter

Using System Time for Temporal Values

When a link has no traffic for a long time, heartbeat is generated using inference from system time.

Skew between system clock and buffering has to be accounted for when setting up Gigascope system

High Level Query Nodes (HFTA) Second level of query execution (two levels) LFTA for low-level filtering and sub-queries HFTA for complex query processing

Selection Multiple types of aggregation Stream merge Inner and outer join

Data from multiple streams (HFTAs, LFTAs)

HFTA Execution Normal mode of execution:

Block while waiting for new tuples to arrive Determines which operator to apply to that stream Process the incoming tuple for that operator Route output (if operator required) Also regularly receives temporal update tuples

Operators interpret temporal update tuples Operators use these tuples to unblock themselves

HFTA: Heartbeats in selection Mostly identical to selection LFTAs

Unpack field values referenced in query predicate Check if predicate is satisfied If satisfied, generate projection list

Difference: can receive temporal update tuples (in addition to regular data tuples) Temporal update tuple is received Operator updates the saved values of all temporal

attributes referenced in query Select Generates new temporal update tuple Rest of normal tuple processing is bypassed

HFTA: Heartbeats in aggregation and sampling

Non-blocking operator Aggregates data within a time window (epoch) Epoch defined by values of temporal group-by attributes

Required to keep all groups and corresponding aggregates until end of epoch Then flushes to output stream “Slow flush” to avoid overflow

One output tuple for each input tuple Change in epoch immediate flush

HFTA: Heartbeats in stream merge Union of two streams R and S in a way that

preserves the ordering properties of the temporal attributes

R and S must have same schema Both must have a temporal field to merge on Stream buffers until the other catches up (if

difference in time fields)

HFTA: Heartbeats in join Relate two data streams by timestamp

e.g., R.tr = 2 * S.ts INNER, LEFT, RIGHT, and FULL OUTER Minimum timestamps for R and S maintained Buffers input streams to ensure they match up

by timestamp predicate May include temporal attributes

Other heartbeat applications Initial goal of heartbeats was to collect

statistics about nodes in a distributed setting Main focus of paper is to use heartbeats to

carry stream punctuation Found that there are other uses for heartbeats

Fault tolerance Heartbeat mechanism used to detect node failures in

distributed systems Nodes must “ping” to indicate still alive Gigascope slightly different:

Heartbeats periodically generated by low-level queries Propagated upward through query execution DAG

Constant flow of heartbeat tuples indicates if low-level sub-query is responding or not responding No heartbeat over specified amount of time query has

failed and recovery is initiated

System performance analysis Heartbeat message subject to same routing as

other data in network Heartbeat visit all nodes Statistics gathered on every node visit Full trace of all nodes Can be used to identify poor-performing

nodes

Distributed query optimization

Automated tools to utilize collected statistics Re-optimize query execution plans (eliminate

identified bottlenecks) Other statistics:

Predicate selectivities Data arrival rates Tuple processing costs

Performance evaluation Experiments conducted on live network feed Queries monitor three network interfaces

main1 and main2: DAG4.3GE Gigabit Ethernet interfaces (see bulk of traffic)

control: 100Mbit interface Approximately 100,000 packets per second

400 Mbits/sec Dual-processor 2.8 GHz P4, 4GB RAM, FreeBSD 4.10 Focus is to unblock streaming operators

high-rate main links low-rate backup links (control)

Evaluation: Unblocking stream merge using heartbeats

Effect of heartbeats on memory usage of queries that use stream merge operator

Used the following query:SELECT tb, protocol, srcIP, destIP, srcPort,

destPort, count(*)FROM DataProtocolGROUP BY time/10 as tb, protocol, srcIP,

destIP, srcPort, destPort


Query computes the number of packets observed in different flows in 10 second time buckets

Query planner merges three streams:


When control link has no traffic, large amount of data buffered (from other streams)

Varied heartbeat interval From 1 second to 30 seconds 5 second increments


Heartbeats successfully unblock the stream merge operators Higher intervals more state maintained Also results in more data being flushed when an

epoch advances (could cause query failure when system lacks traffic sharing, such as slow flush)

Evaluation: Unblocking join operators using heartbeats

How effectively do heartbeats… Unblock join queries Reduce overall query memory requirements

Evaluation: Unblocking join operators using heartbeats (continued)

Query flow1:SELECT tb,protocol,srcIP,destIP, srcPort,destPort,count(*)

as cntFROM [main0_and_control].DataProtocolGROUP BY time/10 as tb,protocol,srcIP, destIP, srcPort, destPort;

Query flow2:SELECT tb,protocol,srcIP,destIP, srcPort,destPort,count(*) as cntFROM main1.DataProtocolGROUP BY time/10 as tb,protocol,srcIP, destIP, srcPort, destPort;


Query full_flow:SELECT flow1.tb,flow1.protocol, flow1.srcIP, flow1.destIP,

flow1.srcPort,flow1.destPort,flow1.cnt, flow2.cnt

OUTER_JOIN FROM flow1, flow2WHERE flow1.srcIP=flow2.srcIP and

flow1.destIP=flow2.destIP andflow1.srcPort=flow2.srcPort andflow1.destPort=flow2.destPort andflow1.protocol=flow2.protocol andflow1.tb = flow2.tb



Two sub-queries compute the flows aggregated in 10 second time-buckets main1+control main2

Full query results combined using full outer join for final output


Varied interval of generated heartbeats from 1 second to 60 seconds 10 second increments


Less memory usage with greater heartbeat interval (linear growth) Avoid accumulating large state of blocking

operators Decrease bursty-ness of output (since less data is

dumped at the end of the heartbeat interval when the interval is low)

Evaluation: CPU overhead of heartbeat generation

Measure CPU overhead from heartbeats Measure average CPU load of a merge query:

SELET tb, protocol, srcIP, destIP, srcPort, destPort, count(*)FROM DataProtocolGROUP BY time/10 as tb, protocol, srcIP, destIP, srcPort, destPort

Run on two high-rate interfaces Compared 1 second heartbeat interval vs. identical

system with heartbeats disabled

Evaluation: CPU overhead of heartbeat generation (continued)

Both monitored links have moderately high load, so merge operators are naturally unblocked

Heartbeats disabled: 37.3% CPU load Heartbeats enabled: 37.5% CPU load Insignificant difference

Explained by variations in traffic load Conclusion: overhead of heartbeat mechanism is

immeasurably small

Conclusion Heartbeats regularly generated

Propagated to all nodes Attach temporal update tuples to unblock operators

Heartbeat mechanism can be used for other applications in distributed setting Detecting node failure Performance monitoring Query optimization

Performance evaluation Capable of working at multiple Gigabit line speeds Significantly decrease query memory utilization

Date post:	21-Dec-2015
Category:	Documents
View:	216 times
Download:	0 times

A Heartbeat Mechanism and its Application in Gigascope Johnson, Muthukrishnan, Shkapenyuk,...

Documents