+ All Categories
Home > Documents > Network Summarization - UCSBmedya/papers/Poster1.pdfNetwork Summarization A Comparison of Methods...

Network Summarization - UCSBmedya/papers/Poster1.pdfNetwork Summarization A Comparison of Methods...

Date post: 30-Jul-2020
Category:
Upload: others
View: 0 times
Download: 0 times
Share this document with a friend
1
Network Summarization A Comparison of Methods Networks are ubiquitous – they model numerous complex structures and processes. In order to analyze such networks, the size of the network data must be reduced; thus, network summarization becomes important. We experimented with three methods of compression: Slice Tree, Spectral Graph Fourier, and Spectral Graph Wavelets. We examine each method for its scalability and accuracy on real and synthetic datasets. We discover that the Slice Tree algorithm is scalable and outperforms the Spectral Graph methods when the network values change smoothly across the topology. A network is a set of nodes and a set of edges that represent relationships between nodes. Networks generated by real-world complex systems are extremely large. For example, Twitter has 40 million nodes and 1.4 billion edges. We are summarizing the dynamic states (or attributes) of the nodes. For example, in Traffic networks, the average speed in a location/node varies with time. A smooth network contains states that do not change rapidly across the network. Spectral Graph Fourier Slice Tree (ST) Scalability Datasets Acknowledgments: Special thanks to Prof. Ambuj Singh, Arlei Silva, Sourav Medya, and Tim Robinson. We would also like to thank the National Science Foundations and University of California Santa Barbara for funding our research. This work was supported in part by the National Science Foundation through grant numbers DGE-1258507 and IIS-1219254. References: Silva, Arlei, Petko Bogdanov, and Ambuj Singh. “Network State Summarization via In-Graph Compression.” D. I. Shuman, S. K. Narang, P. Frossard, A. Ortega, and P. Vandergheynst. The emerging field of signal processing on graphs: Extending high-dimensional data analysis to networks and other irregular domains. Signal Processing Magazine, IEEE, 30(3):83–98, 2013. Hammond, David K., Pierre Vandergheynst, and Rémi Gribonval. Wavelets on Graphs via Spectral Graph Theory. Applied and Computational Harmonic Analysis: 129-50. Print. Ali Hajimirza, U. Oklahoma. Jason White, CSU San Bernardino. Mentors: Sourav Medya Arlei Silva Prof. Ambuj Singh Introduction Summarizing a Network Background Experiments and Results Conclusion Future Work 2 6 1 8 5 4 3 7 3 7 3 3 Let W be an adjacency matrix and f an array of node values. Compute the Laplacian: Compute the eigenvalues ( ) and eigenvectors ( ) of the Laplacian. Forward transform Inverse transform Store the high signal values. Each signal value and its position occupy 12 bytes. Use the stored signal values to approximate the original values. 0 2 4 6 8 10 12 0 0.438 1 3 3 4.562 Absolute Signal Value λ l ˆ f (λ l ) L = D W 0 2 4 6 8 10 12 100 500 900 1300 1700 Seconds Number of nodes Fourier Slice Tree 0 5000 10000 15000 20000 25000 30000 0 2412 6012 9612 13212 16812 20412 Error Compressed Size (Bytes) Fourier Slice Tree 1 100 10000 1000000 100000000 1E+10 1E+12 DBPL Human Synthetic Trac Twitter Wikipedia Error Datasets Fourier Slice Tree This experiment is designed to measure the scalability of the Fourier and Slice Tree algorithms. The compressed network size is kept constant (50 bytes) while the number of nodes are varied. Due to the time complexity of eigenvector decomposition, the time taken by Fourier grows exponentially. We measure the error between the original and compressed network states. Each network is reduced to 1000 nodes and compressed to 600 bytes. The error is computed by the following formula: Slice Tree partitions a network into smooth regions such that each region can be compactly represented by the mean of the node values in that region. Each slice occupies 10 bytes and is represented by a center node, radius, and mean value. Finding the optimal partitioning given a budget is NP-hard. Slice Tree greedily computes the best slice at each iteration. The greedy algorithm is computationally expensive. Therefore, an importance sampling method is introduced to improve the runtime for very large networks. Implement the Spectral Graph Wavelets and compare against the implemented algorithms. Use Slice Tree to detect outlier node values in a network. Explore the possibilities of using the Slice Tree method to predict the values of missing nodes in a graph. Figure 1: A snapshot of a Facebook friendship network. λ l u l Figure 2: The Slice Tree of the network in Figure 1. The Slice Tree algorithm scales much better than the Fourier algorithm. For smooth networks, Slice Tree gives better compression results. For “non-smooth” networks, the accuracy of Fourier is comparable to Slice Tree. (II) f (i)= N 1 l=0 ˆ f (λ l )u l (i) (I) (I) ˆ f (λ l ) := N 1 i=1 f (i)u l (i) (II) Figure 4: A comparison of runtimes for each algorithm on a traffic network. Accuracy The accuracy of the algorithms are determined by measuring the loss of information for different compressed sizes. Due to the smoothness of the network, Slice Tree gives a more accurate compressed network state. Figure 6: Accuracy of each algorithm with respect to the compressed network state using the traffic network. Figure 5: A comparison of error for each algorithm on networks with different value distributions (in logarithmic scale). Error (V )= v V (V (v ) orig V (v ) final ) 2 Figure 3: Fourier transformation from network to signal and back.
Transcript
Page 1: Network Summarization - UCSBmedya/papers/Poster1.pdfNetwork Summarization A Comparison of Methods Networks are ubiquitous – they model numerous complex structures and processes.

Network Summarization A Comparison of Methods

Networks are ubiquitous – they model numerous complex structures and processes. In order to analyze such networks, the size of the network data must be reduced; thus, network summarization becomes important. We experimented with three methods of compression: Slice Tree, Spectral Graph Fourier, and Spectral Graph Wavelets.  We examine each method for its scalability and accuracy on real and synthetic datasets. We discover that the Slice Tree algorithm is scalable and outperforms the Spectral Graph methods when the network values change smoothly across the topology.

A network is a set of nodes and a set of edges that represent relationships between nodes. Networks generated by real-world complex systems are extremely large. For example, Twitter has 40 million nodes and 1.4 billion edges. We are summarizing the dynamic states (or attributes) of the nodes. For example, in Traffic networks, the average speed in a location/node varies with time. A smooth network contains states that do not change rapidly across the network.

Spectral Graph Fourier Slice Tree (ST)

Scalability Datasets

Acknowledgments: •  Special thanks to Prof. Ambuj Singh, Arlei Silva, Sourav Medya, and Tim Robinson. •  We would also like to thank the National Science Foundations and University of California Santa Barbara for funding our research. •  This work was supported in part by the National Science Foundation through grant numbers DGE-1258507 and IIS-1219254.

References: •  Silva, Arlei, Petko Bogdanov, and Ambuj Singh. “Network State Summarization via In-Graph Compression.” •  D. I. Shuman, S. K. Narang, P. Frossard, A. Ortega, and P. Vandergheynst. The emerging field of signal processing on graphs: Extending

high-dimensional data analysis to networks and other irregular domains. Signal Processing Magazine, IEEE, 30(3):83–98, 2013. •  Hammond, David K., Pierre Vandergheynst, and Rémi Gribonval. Wavelets on Graphs via Spectral Graph Theory. Applied and

Computational Harmonic Analysis: 129-50. Print.

Ali Hajimirza, U. Oklahoma. Jason White, CSU San Bernardino.

Mentors: Sourav Medya Arlei Silva Prof. Ambuj Singh

Introduction

Summarizing a Network

Background

Experiments and Results

Conclusion Future Work

2 6

1

8

5

4

3 7

3

7

3

3

•  Let W be an adjacency matrix and f an array of node values. •  Compute the Laplacian: •  Compute the eigenvalues ( ) and eigenvectors ( ) of the Laplacian. •  Forward transform çè Inverse transform •  Store the high signal values. •  Each signal value and its position occupy 12 bytes. •  Use the stored signal values to approximate the original values.

0  

2  

4  

6  

8  

10  

12  

0   0.438   1   3   3   4.562  

Absolute  Signal  Value  

λl

f̂(λ

l)

L = D −W

0

2

4

6

8

10

12

100 500 900 1300 1700

Seco

nds

Number of nodes

Fourier Slice Tree

0

5000

10000

15000

20000

25000

30000

0 2412 6012 9612 13212 16812 20412

Erro

r

Compressed Size (Bytes)

Fourier Slice Tree

1

100

10000

1000000

100000000

1E+10

1E+12

DBPL Human Synthetic Traffic Twitter Wikipedia

Erro

r

Datasets

Fourier Slice Tree

•  This experiment is designed to measure the scalability of the Fourier and Slice Tree algorithms.

•  The compressed network size is kept constant (50 bytes) while the number of nodes are varied.

•  Due to the time complexity of eigenvector decomposition, the time taken by Fourier grows exponentially.

•  We measure the error between the original and compressed network states.

•  Each network is reduced to 1000 nodes and compressed to 600 bytes.

•  The error is computed by the following formula:

•  Slice Tree partitions a network into smooth regions such that each region can be compactly represented by the mean of the node values in that region.

•  Each slice occupies 10 bytes and is represented by a center node, radius, and mean value.

•  Finding the optimal partitioning given a budget is NP-hard. Slice Tree greedily computes the best slice at each iteration.

•  The greedy algorithm is computationally expensive. Therefore, an importance sampling method is introduced to improve the runtime for very large networks.

•  Implement the Spectral Graph Wavelets and compare against the implemented algorithms.

•  Use Slice Tree to detect outlier node values in a network. •  Explore the possibilities of using the Slice Tree method to predict the values of

missing nodes in a graph.

Figure 1: A snapshot of a Facebook friendship network.

λl ul

Figure 2: The Slice Tree of the network in Figure 1.

•  The Slice Tree algorithm scales much better than the Fourier algorithm. •  For smooth networks, Slice Tree gives better compression results. •  For “non-smooth” networks, the accuracy of Fourier is comparable to Slice Tree.

(II) f(i) =N−1!

l=0

f̂(λl)ul(i)

(I)

(I) f̂(λl) :=N−1!

i=1

f(i)ul(i)

(II)

Figure 4: A comparison of runtimes for each algorithm on a traffic network.

Accuracy •  The accuracy of the algorithms are determined by

measuring the loss of information for different compressed sizes.

•  Due to the smoothness of the network, Slice Tree gives a more accurate compressed network state.

Figure 6: Accuracy of each algorithm with respect to the compressed network state using the traffic network.

Figure 5: A comparison of error for each algorithm on networks with different value distributions (in logarithmic scale).

Error(V ) =!

v∈V

(V (v)orig − V (v)final)2

Figure 3: Fourier transformation from network to signal and back.

Recommended