Decentralised Coordination of Mobile Sensors using the Max-Sum Algorithm Ruben Stranders, Alex...

transcript

Decentralised Coordination of Mobile Sensors using the Max-Sum Algorithm

Ruben Stranders, Alex Rogers, Nick JenningsSchool of Electronics and Computer ScienceUniversity of Southampton{rs06r, acr, nrj}@ecs.soton.ac.uk

This presentation focuses on the use of Max-Sum to coordinate mobile sensors

Sensor Architecture & Max-Sum

Empirical Evaluation

Speeding up Max-Sum

Coordinate

This work can be applied to improve situational awareness in dynamic scenarios

Disaster Response

Military Surveillance

Climate Research

Our contribution is a coordination mechanism for a team of autonomous mobile sensors

These mobile sensors continuously monitor spatial phenomena

The main challenge is to coordinate the sensors in order to the state of these spatial phenomena

Limited Communication

No centralised control

To solve this coordination problem, we had to address three challenges

1. How to model the phenomena?2. How to value potential samples?3. How to coordinate to gather

samples of highest value?

The three central challenges are clearly reflected in the architecture of our sensing agents

Samples sent toneighbouring agents

Samples received fromneighbouring agents

Information processing

Model of Environment

Outgoing negotiation messages

Incomingnegotiation messages

Value of potential samples Action

Selection

Samples from own sensor

SensingAgent

Rawsamples

Coordinate

These three challenges are clearly reflected in the architecture of our sensing agents

Selection

SensingAgent

Rawsamples

The sensors model the spatial phenomenon using the Gaussian Process

Weak Strong

Spatial Correlations

Areas of Rapid Change

Weak Strong

Temporal Correlations

0 0.5 1 1.5 2 2.5 3 3.5 4

The value of a sample is determined how much it reduces uncertainty

Selection

SensingAgent

Rawsamples

The value of a sample is based on how much it reduces uncertainty

But how to determine uncertainty reduction before collecting a sample?

0 1 2 3 4 5 6 7 8 9 100

PredictionConfidence IntervalCollected Sample

Gaussian Process not only gives predictions, but also confidence intervals

0 1 2 3 4 5 6 7 8 9 100

Potential Sample Location

0 1 2 3 4 5 6 7 8 9 100

Measure of uncertainty

0 1 2 3 4 5 6 7 8 9 100

Specifically, we can use information metrics such as Entropy ,or Mutual Information

);( YXMI

The sensor agents coordinate using the Max-Sum algorithm

Selection

SensingAgent

Rawsamples

Coordinate

Max-Sum is a powerful algorithm for solving DCOPs

Complete Algorithms

DPOPOptAPOADOPT

Communication Cost

Iterative Algorithms

Best Response (BR)Distributed Stochastic

Algorithm (DSA) Fictitious Play (FP)

Max-SumAlgorithm

Optimality

Max-Sum solves the social welfare maximisation problem in a decentralised way

Mobile Sensors

Movement Parameters

Utility Functions

)( 33 xU

Localised Interaction

},,,{ 54313 xxxxx

)(maxarg xx

Social welfare:

Mobile Sensors

The input for the Max-Sum algorithm is a graphical representation of the problem: a Factor Graph

Variable nodes Function nodes

Agent 1Agent 2

Agent 3

Max-Sum solves the social welfare maximisation problem by message passing

Variable nodes Function nodes

Agent 1Agent 2

Agent 3

Max-Sum solves the social welfare maximisation problem by message passing

jiadjk

iikiji xrxq\)(

ijadjk

kjkjji

iij xqUxrj \)(\

)()(max)( xx

From variable i to function j

From function j to variable i

To use Max-Sum, we encode the mobile sensor coordination problem as a factor graph

Sensor 1Sensor 2

Sensor 3

Sensor 1

Sensor 2

Sensor 3

Variables represent the sensors’ movements

Sensor 1Sensor 2

Sensor 3

Functions represent the uncertainty reduction that results from collecting a sample

Sensor 1Sensor 2

Sensor 3

Sensor 1Sensor 2

Sensor 3

Sensor 1Sensor 2

Sensor 3

Sensor 1Sensor 2

Sensor 3

Sensor 1Sensor 2

Sensor 3

ijadjk

kjkjji

iij xqUxrj \)(\

)()(max)( xx

Unfortunately, the straightforward application of Max-Sum is too computationally expensive

jiadjk

iikiji xrxq\)(

ijadjk

kjkjji

iij xqUxrj \)(\

)()(max)( xx

Unfortunately, the straightforward application of Max-Sum is too computationally expensive

jiadjk

iikiji xrxq\)(

Bottleneck!

ijadjk

kjkjji

iij xqUxrj \)(\

)()(max)( xx

Therefore, we developed two general pruning techniques that speed up Max-Sum

Goal: Make as small as possible

ijadjk

kjkjji

iij xqUxrj \)(\

)()(max)( xx

Therefore, we developed two general pruning techniques that speed up Max-Sum

1. Try to prune the action spaces of individual sensors

2. Try to prune joint actions

The first pruning technique prunes individual actions by identifying dominated actions

1. Neighbours send bounds

↑ [2, 2]↓ [1, 1]

↑ [5, 6]↓ [0, 1]

↑ [1, 2]↓ [3, 4]

2. Bounds are summed[8, 10]

[4, 7]

3. Dominated actions are pruned [8, 10]

[4, 7]X

ijadjk

kjkjji

iij xqUxrj \)(\

)()(max)( xx

We developed two general pruning techniques that speed up Max-Sum

1. Try to prune the action spaces of individual sensors

2. Try to prune joint actions

The second pruning technique reduces the joint action space because exhaustive enumeration is too costly

Action

Sensor 1 Sensor 2 Sensor 3

Action

Etcetera…

The second pruning technique prunes the joint action space using branch and bound

Sensor 1

Sensor 2

Sensor 3

[7, 13][0, 4] [2, 6]

Sensor 1

Sensor 2

Sensor 3

[7, 13][0, 4] [2, 6]XX

Sensor 1

Sensor 2

Sensor 3

9 10 7 8

[7, 13][0, 4] [2, 6]XX

Sensor 1

Sensor 2

Sensor 3

9 10 7 8

[7, 13][0, 4] [2, 6]XX

X X XO

Sensor 1

Sensor 2

Sensor 3

This demonstration shows four sensors monitoring a spatial phenomenon

Sensors

UncertaintyContours

To empirically evaluate our algorithm, we measured speed up and prediction error

UncertaintyContours

9 10 7 8

[7, 13][0, 4] [2, 6]XX

The two pruning techniques combined prune 95% of the action space with 6 neighbouring sensors

2 2.5 3 3.5 4 4.5 5 5.5 60

Number of neighbouring sensors

Our Algorithm reduces Root Mean Squared Error of predictions up to 50% compared to Greedy

Our Al-gorithm

Greedy Random Fixed0.0

In conclusion, the use of Max-Sum leads to an effective coordination algorithm for mobile sensors

1. Decentralised

2. Fast

1. Decentralised

2. Fast

3. Accurate predictions

Greedy

Random

For future work, we wish to extend the algorithm to do non-myopic planning

References

• R. Stranders, A. Farinelli, A. Rogers and N.R. Jennings (2009): Decentralised Coordination of Mobile Sensors Using the Max-Sum Algorithm. In: Proc 21st Int. Joint Conf on AI (IJCAI), Pasadena, USA. (In Press)

• R. Stranders, A. Farinelli, A. Rogers and N.R. Jennings (2009): Decentralised Coordination of Continuously Valued Control Parameters using the Max-Sum Algorithm. 8th Proc. Int. Conf. on Autonomous Agents and Multiagent Systems (AAMAS), Budapest. (In Press)

1. Decentralised

2. Fast

3. Accurate predictions

Greedy

Random

Questions?

Decentralised Coordination of Mobile Sensors using the Max-Sum Algorithm Ruben Stranders, Alex...

Documents