+ All Categories
Home > Documents > The Hitchhiker's Guide to Choosing the Compression ......3500ASCII-coded (EN-62056-21) datasets From...

The Hitchhiker's Guide to Choosing the Compression ......3500ASCII-coded (EN-62056-21) datasets From...

Date post: 22-Jul-2021
Category:
Upload: others
View: 2 times
Download: 0 times
Share this document with a friend
32
The Hitchhiker’s Guide to Choosing the Compression Algorithm for your Smart Meter Data Martin Ringwelski , Christian Renner, Andreas Reinhardt (TUD), Andreas Weigel, Volker Turau ENERGYCON 2012 12 th September, 2012 TUHH TUHH Institute of Telematics Institute of Telematics Hamburg University of Technology Hamburg University of Technology
Transcript
Page 1: The Hitchhiker's Guide to Choosing the Compression ......3500ASCII-coded (EN-62056-21) datasets From real smart meter installation Size range from76to3100Byte 95power consumption measures

The Hitchhiker’s Guide to Choosingthe Compression Algorithm for your

Smart Meter DataMartin Ringwelski, Christian Renner, Andreas Reinhardt (TUD),

Andreas Weigel, Volker Turau

ENERGYCON 201212th September, 2012

TUHHTUHHInstitute of TelematicsInstitute of TelematicsHamburg University of TechnologyHamburg University of Technology

Page 2: The Hitchhiker's Guide to Choosing the Compression ......3500ASCII-coded (EN-62056-21) datasets From real smart meter installation Size range from76to3100Byte 95power consumption measures

IntroductionIntroduction

Motivation

� Emerging renewable energy resources� Grids to be relieved by tariffs for the current supply� Smart Metering helps the customer

� Shifting tasks to times with cheaper tariffs� Identifying heavy electricity consumers

� Meter data needs to be transmitted, but� Bad connectivity for GSM/UMTS in cellars� Powerline communication is not available in all areas

� Wireless mesh network� Hop-to-Hop data transport

Martin Ringwelski et al. The Hitchhiker’s Guide to Choosing the Compression Algorithm for your Smart Meter DataMartin Ringwelski et al. The Hitchhiker’s Guide to Choosing the Compression Algorithm for your Smart Meter Data 11

Page 3: The Hitchhiker's Guide to Choosing the Compression ......3500ASCII-coded (EN-62056-21) datasets From real smart meter installation Size range from76to3100Byte 95power consumption measures

IntroductionIntroduction

Motivation

� Pro:� Wireless sensor nodes are cheap

� Contra:� Low Bandwidth� Bottleneck near Sink

� Reducing network load allows more nodes in network� Data formats are regulated and mandatory

⇒ Lossless data compression is needed

Martin Ringwelski et al. The Hitchhiker’s Guide to Choosing the Compression Algorithm for your Smart Meter DataMartin Ringwelski et al. The Hitchhiker’s Guide to Choosing the Compression Algorithm for your Smart Meter Data 22

Page 4: The Hitchhiker's Guide to Choosing the Compression ......3500ASCII-coded (EN-62056-21) datasets From real smart meter installation Size range from76to3100Byte 95power consumption measures

IntroductionIntroduction

Smart Meter DataASCII-coded (EN-62056-21)

1 F . F(00000000)2 0.0.0(00617827)3 0.0.1(00000000)4 0.9.2(1070813)5 1.8.1(00000247*kWh)6 1.8.1*04(00000172*kWh)7 1.8.1*03(00000103*kWh)8 C.51.6*03(0000000000000)9 C.51.7(00000000)

10 C.77 .2 (0 A78422F69654D2D4952481FBF16 )11 C.86.0(20001010)12 31.25(0.008*A)13 51.25(0.009*A)14 71.25(0.045*A)15 32.25(0.66*V)16 52.25(0.57*V)17 72.25(225.04*V)18 1.35.0 .01(0.000*kW)19 P.01(0080213003000)(00000000)(15) (1) (1 .5) (kW)20 (00 .00 )21 (00 .02 )22 (00 .01 )23 (02 .00 )24 (00 .01 )

Martin Ringwelski et al. The Hitchhiker’s Guide to Choosing the Compression Algorithm for your Smart Meter DataMartin Ringwelski et al. The Hitchhiker’s Guide to Choosing the Compression Algorithm for your Smart Meter Data 33

Page 5: The Hitchhiker's Guide to Choosing the Compression ......3500ASCII-coded (EN-62056-21) datasets From real smart meter installation Size range from76to3100Byte 95power consumption measures

IntroductionIntroduction

Motivation

� Trade-off: Compression gain ⇔ Hardware requirements� Sensor nodes have limited hardware resources

(RAM, ROM, speed)

� Modern algorithms concentrate on compression gain

Question:Which compression algorithm gives best results for the recordeddata by using as few resources as possible?

What we did:� Improve compression algorithms for wireless sensor nodes� Analyze the strengths and weaknesses

Martin Ringwelski et al. The Hitchhiker’s Guide to Choosing the Compression Algorithm for your Smart Meter DataMartin Ringwelski et al. The Hitchhiker’s Guide to Choosing the Compression Algorithm for your Smart Meter Data 44

Page 6: The Hitchhiker's Guide to Choosing the Compression ......3500ASCII-coded (EN-62056-21) datasets From real smart meter installation Size range from76to3100Byte 95power consumption measures

IntroductionIntroduction

Agenda

1 Introduction

2 Compression Algorithms

3 Evaluation

4 Conclusion

Martin Ringwelski et al. The Hitchhiker’s Guide to Choosing the Compression Algorithm for your Smart Meter DataMartin Ringwelski et al. The Hitchhiker’s Guide to Choosing the Compression Algorithm for your Smart Meter Data 55

Page 7: The Hitchhiker's Guide to Choosing the Compression ......3500ASCII-coded (EN-62056-21) datasets From real smart meter installation Size range from76to3100Byte 95power consumption measures

Compression AlgorithmsCompression Algorithms

Compression Types

� Entropy Coding: ◁ MoreUse variable Bit-length codes for symbols, depending on theirprobabilities. e.g.: Huffman-, Arithmetic-, Range-Encoding

� Wordbook Compression: ◁ MoreRecognize repetitions of strings and use references to encode them. e.g.:LZ77, LZ78

� Block-sorting: ◁ MoreRearrange the symbols in a way that they are easier to compressafterwards. e.g.: Burrows-Wheeler-Transformation, Move-To-Front

Martin Ringwelski et al. The Hitchhiker’s Guide to Choosing the Compression Algorithm for your Smart Meter DataMartin Ringwelski et al. The Hitchhiker’s Guide to Choosing the Compression Algorithm for your Smart Meter Data 66

Page 8: The Hitchhiker's Guide to Choosing the Compression ......3500ASCII-coded (EN-62056-21) datasets From real smart meter installation Size range from76to3100Byte 95power consumption measures

Compression AlgorithmsCompression Algorithms

Reference Compression Algorithms

� Lempel Ziv Storer Szymanski (LZSS)Dictionary compression

� Lempel Ziv Welch (LZW)Dictionary compression (patent expired, used in GIF)

� DEFLATE (ZLIB, GZIP)Dictionary compression with Huffman Encoding (used in HTTP, PDF, ...)

� BZIP2Block sorting with Range-encoding

� Lempel Ziv Markov Chain Algorithm (LZMA)Dictionary compression with Reference History and subsequentRange-encoding

Martin Ringwelski et al. The Hitchhiker’s Guide to Choosing the Compression Algorithm for your Smart Meter DataMartin Ringwelski et al. The Hitchhiker’s Guide to Choosing the Compression Algorithm for your Smart Meter Data 77

Page 9: The Hitchhiker's Guide to Choosing the Compression ......3500ASCII-coded (EN-62056-21) datasets From real smart meter installation Size range from76to3100Byte 95power consumption measures

Compression AlgorithmsCompression Algorithms

Entropy-based Compression

Adaptive Trimmed Huffman Coding (ATH) ◁ More� Developed for energy-constrained wireless sensor nodes� Adaptive Entropy Coding Scheme� Huffman tree is trimmed to reduce memory consumption� Prefix determines the encoding of next symbol

Adaptive Markov Chain Huffman Coding (AMCH) ◁ More� Uses probabilities of successive symbols� Each symbol has Huffman tree of following symbols� Trees are built during compression� Each tree has Escape symbol for not yet encountered (NYE) symbols

Martin Ringwelski et al. The Hitchhiker’s Guide to Choosing the Compression Algorithm for your Smart Meter DataMartin Ringwelski et al. The Hitchhiker’s Guide to Choosing the Compression Algorithm for your Smart Meter Data 88

Page 10: The Hitchhiker's Guide to Choosing the Compression ......3500ASCII-coded (EN-62056-21) datasets From real smart meter installation Size range from76to3100Byte 95power consumption measures

Compression AlgorithmsCompression Algorithms

Wordbook-based Compression

tiny Lempel Ziv Markov Chain Algorithm (tLZMA)� Adaption of the LZMA Scheme� History window constrained to 128 Byte� No Range-encoding step

Lempel Ziv Markov Chain Huffman Coding (LZMH)� Combination of tLZMA and ATH� Dictionary compression with 128 Byte History� Compression of symbols with ATH method

Martin Ringwelski et al. The Hitchhiker’s Guide to Choosing the Compression Algorithm for your Smart Meter DataMartin Ringwelski et al. The Hitchhiker’s Guide to Choosing the Compression Algorithm for your Smart Meter Data 99

Page 11: The Hitchhiker's Guide to Choosing the Compression ......3500ASCII-coded (EN-62056-21) datasets From real smart meter installation Size range from76to3100Byte 95power consumption measures

EvaluationEvaluation

Evaluation MethodologyData

� 3 500 ASCII-coded (EN-62056-21) datasets� From real smart meter installation� Size range from 76 to 3 100 Byte

� 95 power consumption measures of household devices� Sampled about once per second� Data in binary format� Sizes between 18.8 and 171.0 KByte

Methodology and Metrics� All Methods implemented in C, no use of heap memory

ZLIB, BZ2 and LZMA implementations use heap memory� Compression rate benchmarked on a desktop PC� Compression rate = 1 − compressed data size

uncompressed data size� Processing time benchmarked on ATmega 1281 (8 kB RAM, 7.37 MHz)� Processing time over size = consumed time

uncompressed data size

Martin Ringwelski et al. The Hitchhiker’s Guide to Choosing the Compression Algorithm for your Smart Meter DataMartin Ringwelski et al. The Hitchhiker’s Guide to Choosing the Compression Algorithm for your Smart Meter Data 1010

Page 12: The Hitchhiker's Guide to Choosing the Compression ......3500ASCII-coded (EN-62056-21) datasets From real smart meter installation Size range from76to3100Byte 95power consumption measures

EvaluationEvaluation

Memory Consumption

Algorithm ROM (Byte) RAM (Byte)

static stack

LZSS 544 129 19

LZW 550 12 416 16

ZLIB 27 960 2 690 ca. > 1 000*

BZ2 28 332 1 564 ca. > 100 000*

LZMA 34 442 110 ca. > 6 000 000*

ATH 592 170 15

AMCH 1 680 1 820 21

tLZMA 992 133 27

LZMH 1 428 378 29

(*) Heap Memory

Martin Ringwelski et al. The Hitchhiker’s Guide to Choosing the Compression Algorithm for your Smart Meter DataMartin Ringwelski et al. The Hitchhiker’s Guide to Choosing the Compression Algorithm for your Smart Meter Data 1111

Page 13: The Hitchhiker's Guide to Choosing the Compression ......3500ASCII-coded (EN-62056-21) datasets From real smart meter installation Size range from76to3100Byte 95power consumption measures

EvaluationEvaluation

Memory Consumption

Algorithm ROM (Byte) RAM (Byte)

static stack

LZSS 544 129 19

LZW 550 12 416 16

ZLIB 27 960 2 690 ca. > 1 000*

BZ2 28 332 1 564 ca. > 100 000*

LZMA 34 442 110 ca. > 6 000 000*

ATH 592 170 15

AMCH 1 680 1 820 21

tLZMA 992 133 27

LZMH 1 428 378 29

ZLIB, BZ2, LZMA andLZW use too muchmemory

⇒ can not be tested on

ATmega 1281

(*) Heap Memory

Martin Ringwelski et al. The Hitchhiker’s Guide to Choosing the Compression Algorithm for your Smart Meter DataMartin Ringwelski et al. The Hitchhiker’s Guide to Choosing the Compression Algorithm for your Smart Meter Data 1111

Page 14: The Hitchhiker's Guide to Choosing the Compression ......3500ASCII-coded (EN-62056-21) datasets From real smart meter installation Size range from76to3100Byte 95power consumption measures

EvaluationEvaluation

Compression RateASCII encoded smart meter datasets

0

20

40

60

80

100

LZSSLZW

ZLIBBZ2

LZMAATH

AMCH

tLZMA

LZMH

com

pres

sion

rate

(%)

binary encoded daily device reports

0

20

40

60

80

100

LZSSLZW

ZLIBBZ2

LZMAATH

AMCH

tLZMA

LZMHcompression

rate

(%)

generic

entropy

wordbook

hybrid

� LZMH achieves compression rates comparable to ZLIB, BZ2 and LZMA� ATH performs good for small, but poor for bigger datasets

◁ Histogram

Martin Ringwelski et al. The Hitchhiker’s Guide to Choosing the Compression Algorithm for your Smart Meter DataMartin Ringwelski et al. The Hitchhiker’s Guide to Choosing the Compression Algorithm for your Smart Meter Data 1212

Page 15: The Hitchhiker's Guide to Choosing the Compression ......3500ASCII-coded (EN-62056-21) datasets From real smart meter installation Size range from76to3100Byte 95power consumption measures

EvaluationEvaluation

Processing Time

� ATH is the fastest� ATH and AMCH are mostly

unaffected by the dataset� Other methods depend on the

compressability of the datasets

0

100

200

300

400

LZSSATH

AMCH

tLZMA

LZMHco

mpr

essi

ontim

e(m

s/K

Byt

e)

◁ Histogram

Martin Ringwelski et al. The Hitchhiker’s Guide to Choosing the Compression Algorithm for your Smart Meter DataMartin Ringwelski et al. The Hitchhiker’s Guide to Choosing the Compression Algorithm for your Smart Meter Data 1313

Page 16: The Hitchhiker's Guide to Choosing the Compression ......3500ASCII-coded (EN-62056-21) datasets From real smart meter installation Size range from76to3100Byte 95power consumption measures

ConclusionConclusion

Selection Guidelines

LZSS ATH AMCH tLZMA LZMH

ROM

RAM

Performance

Compression Gain

Scenarios:� Very limited resources

� Focus on performance → ATH� Focus on compression → LZSS

� Limited resources, focus on compression → tLZMA� Moderate resources, best compression → LZMH� Good compression with predictable performance → AMCH

Martin Ringwelski et al. The Hitchhiker’s Guide to Choosing the Compression Algorithm for your Smart Meter DataMartin Ringwelski et al. The Hitchhiker’s Guide to Choosing the Compression Algorithm for your Smart Meter Data 1414

Page 17: The Hitchhiker's Guide to Choosing the Compression ......3500ASCII-coded (EN-62056-21) datasets From real smart meter installation Size range from76to3100Byte 95power consumption measures

ConclusionConclusion

Resume

� Lossless data compression to reduce:� bandwidth� energy consumption� transmission costs

� Trade-off between compression gains, time, and resources� ATH, LZMH, LZSS and tLZMH fulfill the resource constraints� LZMH gives best compression results with moderate

execution times and resources� ATH is the fastest method, but has lowest compression rates

Martin Ringwelski et al. The Hitchhiker’s Guide to Choosing the Compression Algorithm for your Smart Meter DataMartin Ringwelski et al. The Hitchhiker’s Guide to Choosing the Compression Algorithm for your Smart Meter Data 1515

Page 18: The Hitchhiker's Guide to Choosing the Compression ......3500ASCII-coded (EN-62056-21) datasets From real smart meter installation Size range from76to3100Byte 95power consumption measures

The Hitchhiker’s Guide to Choosingthe Compression Algorithm for your

Smart Meter DataMartin Ringwelski, Christian Renner, Andreas Reinhardt (TUD),

Andreas Weigel, Volker Turau

ENERGYCON 201212th September, 2012

Martin RingwelskiResearch Assistant

Phone +49 / (0)40 42878 3387

e-Mail [email protected]

http://www.ti5.tu-harburg.de/staff/ringwelski

TUHHTUHHInstitute of TelematicsInstitute of TelematicsHamburg University of TechnologyHamburg University of Technology

Page 19: The Hitchhiker's Guide to Choosing the Compression ......3500ASCII-coded (EN-62056-21) datasets From real smart meter installation Size range from76to3100Byte 95power consumption measures

Appendix

Martin Ringwelski et al. The Hitchhiker’s Guide to Choosing the Compression Algorithm for your Smart Meter DataMartin Ringwelski et al. The Hitchhiker’s Guide to Choosing the Compression Algorithm for your Smart Meter Data 1616

Page 20: The Hitchhiker's Guide to Choosing the Compression ......3500ASCII-coded (EN-62056-21) datasets From real smart meter installation Size range from76to3100Byte 95power consumption measures

References� Frost & Sullivan, "Smart Meter Market – Frost & Sullivan Forecasts 109%

Growth in the UK," Online:http://www.frost.com/prod/servlet/press-release.pag?docid=238393168,2011.

� J. Vasconcelos, "Survey of Regulatory and Technological DevelopmentsConcerning Smart Metering in the European Union Electricity Market,"EUI RSCAS PP 2008/01, Florence School of Regulation, 2008.

� U. Greveler, B. Justus, and D. Loehr, "Multimedia Content IdentificationThrough Smart Meter Power Usage Profiles," in Proceedings of the 5thInternational Conference on Computers, Privacy, and Data Protection(CPDP), 2012.

� N. Zouba, F. Brémond, and M. Thonnat, "Multisensor Fusion forMonitoring Elderly Activities at Home," in Proceedings of the 6th IEEEInternational Conference on Advanced Video and Signal BasedSurveillance (AVSS), 2009.

� R. C. Kryter and H. D. Haynes, "Condition Monitoring of Machinery usingMotor Current Signature Analysis," in Proceedings of the 7th Power PlantDynamics, Control and Testing Symposium, 2003.

Martin Ringwelski et al. The Hitchhiker’s Guide to Choosing the Compression Algorithm for your Smart Meter DataMartin Ringwelski et al. The Hitchhiker’s Guide to Choosing the Compression Algorithm for your Smart Meter Data 1717

Page 21: The Hitchhiker's Guide to Choosing the Compression ......3500ASCII-coded (EN-62056-21) datasets From real smart meter installation Size range from76to3100Byte 95power consumption measures

References� G. W. Hart, "Residential Energy Monitoring and Computerized

Surveillance via Utility Power Flows," IEEE Technology and SocietyMagazine, vol. 8, no. 2, 1989.

� J. Ziv and A. Lempel, "A Universal Algorithm for Sequential DataCompression," IEEE Transactions on Information Theory, vol. 23, no. 3,pp. 337-343, 1977.

� T. A. Welch, "A Technique for High-Performance Data Compression,"IEEE Computer, vol. 17, no. 6, pp. 8-19, 1984.

� G. J. Pottie and W. J. Kaiser, "Wireless Integrated Network Sensors,"Communications of the ACM, vol. 43, no. 5, pp. 51-58, 2000.

� K. Barr and K. Asanovic, "Energy Aware Lossless Data Compression," inProceedings of the 1st International Conference on Mobile Systems,Applications, and Services (MobiSys), 2003, pp. 231-244.

� N. Tsiftes, A. Dunkels, and T. Voigt, "Efficient Sensor NetworkReprogramming through Compression of Executable Modules," inProceedings of the 5th Annual IEEE Communications SocietyConference on Sensor, Mesh and Ad Hoc Communications andNetworks (SECON), 2008, pp. 359-367.

Martin Ringwelski et al. The Hitchhiker’s Guide to Choosing the Compression Algorithm for your Smart Meter DataMartin Ringwelski et al. The Hitchhiker’s Guide to Choosing the Compression Algorithm for your Smart Meter Data 1818

Page 22: The Hitchhiker's Guide to Choosing the Compression ......3500ASCII-coded (EN-62056-21) datasets From real smart meter installation Size range from76to3100Byte 95power consumption measures

References

� C. M. Sadler and M. Martonosi, "Data Compression Algorithms forEnergy-Constrained Devices in Delay Tolerant Networks," in Proceedingsof the 4th ACM Conference on Embedded Networked Sensor Systems(SenSys), 2006, pp. 265-278.

� T. Schoellhammer, B. Greenstein, E. Osterweil, M. Wimbrow, and D.Estrin, "Lightweight Temporal Compression of Microclimate Datasets," inProceedings of the 29th IEEE Conference on Local Computer Networks(LCN), 2004, pp. 516-524.

� E.-O. Blass, L. Tiede, and M. Zitterbart, "An Energy-Efficient and ReliableMechanism for Data Transport in Wireless Sensor Networks," inProceedings of the 3rd International Conference on Networked SensingSystems (INSS), 2006, pp. 211-216.

� A. Reinhardt, "Designing Sensor Networks for Smart Spaces – UnifiedInterfacing and Energy-Efficient Communication between WirelessSensor and Actuator Nodes," Ph.D. dissertation, Technische UniversitätDarmstadt, Multimedia Communications Lab, 2011.

Martin Ringwelski et al. The Hitchhiker’s Guide to Choosing the Compression Algorithm for your Smart Meter DataMartin Ringwelski et al. The Hitchhiker’s Guide to Choosing the Compression Algorithm for your Smart Meter Data 1919

Page 23: The Hitchhiker's Guide to Choosing the Compression ......3500ASCII-coded (EN-62056-21) datasets From real smart meter installation Size range from76to3100Byte 95power consumption measures

References

� J. Z. Kolter and M. J. Johnson, "REDD: A Public Data Set for EnergyDisaggregation Research," in Proceedings of the SustKDD Workshop onData Mining Applications in Sustainability, 2011.

� A. Reinhardt, D. Christin, M. Hollick, J. Schmitt, P. Mogre, and R.Steinmetz, "Trimming the Tree: Tailoring Adaptive Huffman Coding toWireless Sensor Networks," in Proceedings of the 7th EuropeanConference on Wireless Sensor Networks (EWSN ’10), no. LNCS 5970,Coimbra, Portugal, Feb. 2010.

� D. Huffman, "A Method for the Construction of Minimum-RedundancyCodes," Proceedings of the IRE, vol. 40, no. 9, pp. 1098-1101, 1952.

� J. S. Vitter, "Design and Analysis of Dynamic Huffman Codes," Journal ofthe Association for Computing Machinery, vol. 34, no. 4, pp. 825-845,1987.

� D. Salomon and G. Motta, Handbook of Data Compression, 5th ed.Springer, 2010.

Martin Ringwelski et al. The Hitchhiker’s Guide to Choosing the Compression Algorithm for your Smart Meter DataMartin Ringwelski et al. The Hitchhiker’s Guide to Choosing the Compression Algorithm for your Smart Meter Data 2020

Page 24: The Hitchhiker's Guide to Choosing the Compression ......3500ASCII-coded (EN-62056-21) datasets From real smart meter installation Size range from76to3100Byte 95power consumption measures

Entropy

� Measure for the average self-information and theinformation density of a code system

� n: Number of symbols, pi : Probability of a symbol

H = −n∑

i=1pi · log2 pi

� The entropy is the average number of needed bits toencode one symbol in a message

◁ Back

Martin Ringwelski et al. The Hitchhiker’s Guide to Choosing the Compression Algorithm for your Smart Meter DataMartin Ringwelski et al. The Hitchhiker’s Guide to Choosing the Compression Algorithm for your Smart Meter Data 2121

Page 25: The Hitchhiker's Guide to Choosing the Compression ......3500ASCII-coded (EN-62056-21) datasets From real smart meter installation Size range from76to3100Byte 95power consumption measures

Move-To-Front Example

Symbol Sequence Alphabet

b 1 abcdefghijklmnopqrstuvwxyz

a 1 bacdefghijklmnopqrstuvwxyz

n 13 abcdefghijklmnopqrstuvwxyz

a 1 nabcdefghijklmopqrstuvwxyz

n 1 anbcdefghijklmopqrstuvwxyz

a 1 nabcdefghijklmopqrstuvwxyz

◁ Back

Martin Ringwelski et al. The Hitchhiker’s Guide to Choosing the Compression Algorithm for your Smart Meter DataMartin Ringwelski et al. The Hitchhiker’s Guide to Choosing the Compression Algorithm for your Smart Meter Data 2222

Page 26: The Hitchhiker's Guide to Choosing the Compression ......3500ASCII-coded (EN-62056-21) datasets From real smart meter installation Size range from76to3100Byte 95power consumption measures

LZSS Example

Hello you there, yea you there, yea you there,Hello!

Becomes:Hello you there, yea(5,11)(18,15)(0,5)!

The Numbers in brackets determine the offset and length of theprevious occurrence of the following string.

◁ Back

Martin Ringwelski et al. The Hitchhiker’s Guide to Choosing the Compression Algorithm for your Smart Meter DataMartin Ringwelski et al. The Hitchhiker’s Guide to Choosing the Compression Algorithm for your Smart Meter Data 2323

Page 27: The Hitchhiker's Guide to Choosing the Compression ......3500ASCII-coded (EN-62056-21) datasets From real smart meter installation Size range from76to3100Byte 95power consumption measures

Range-encoding Example

Message: AABA$Intervals: A: 0 to 0,6 - B: 0,6 to 0,8 - $: 0,8 to 1A - 0 to 0,6A - 0 to 0,36B - 0,216 to 0,288A - 0,216 to 0,2591$ - 0,25046 to 0,2591⇒ 0,251 is the shortest number that lies in the end interval andthereby encodes the message AABA$.

◁ Back

Martin Ringwelski et al. The Hitchhiker’s Guide to Choosing the Compression Algorithm for your Smart Meter DataMartin Ringwelski et al. The Hitchhiker’s Guide to Choosing the Compression Algorithm for your Smart Meter Data 2424

Page 28: The Hitchhiker's Guide to Choosing the Compression ......3500ASCII-coded (EN-62056-21) datasets From real smart meter installation Size range from76to3100Byte 95power consumption measures

The used ATH tree

1

23

456789

10111213

P

W

◁ Back

Martin Ringwelski et al. The Hitchhiker’s Guide to Choosing the Compression Algorithm for your Smart Meter DataMartin Ringwelski et al. The Hitchhiker’s Guide to Choosing the Compression Algorithm for your Smart Meter Data 2525

Page 29: The Hitchhiker's Guide to Choosing the Compression ......3500ASCII-coded (EN-62056-21) datasets From real smart meter installation Size range from76to3100Byte 95power consumption measures

Histogramda

tasi

ze(K

Byt

e) com

pres

sion

rate

(%)

dataset

ZLIB ATH LZMH

0.0

0.5

1.0

1.5

2.0

0 500 1000 1500 2000 2500 3000

0

10

20

30

40

50

60

70

80

90

100

◁ Back

Martin Ringwelski et al. The Hitchhiker’s Guide to Choosing the Compression Algorithm for your Smart Meter DataMartin Ringwelski et al. The Hitchhiker’s Guide to Choosing the Compression Algorithm for your Smart Meter Data 2626

Page 30: The Hitchhiker's Guide to Choosing the Compression ......3500ASCII-coded (EN-62056-21) datasets From real smart meter installation Size range from76to3100Byte 95power consumption measures

Histogram of Times

LZSS

0100200300400

proc

essi

ngtim

e(m

s/K

Byt

e)

ATH AMCH

0100200300400

tLZMA LZMH

0100200300400

data

size

(KB

yte)

dataset

0.00.51.01.52.0

0 500 1000 1500 2000 2500 3000

◁ Back

Martin Ringwelski et al. The Hitchhiker’s Guide to Choosing the Compression Algorithm for your Smart Meter DataMartin Ringwelski et al. The Hitchhiker’s Guide to Choosing the Compression Algorithm for your Smart Meter Data 2727

Page 31: The Hitchhiker's Guide to Choosing the Compression ......3500ASCII-coded (EN-62056-21) datasets From real smart meter installation Size range from76to3100Byte 95power consumption measures

Symbol Histogram

0

3

6

9

12

15

18

21

24

27

30

nl cr & ( ) * - . 0 1 2 3 4 5 6 7 8 9 A B C D E F O P R V W h k

Symbol

Pro

bab

iliti

es

[%]

Martin Ringwelski et al. The Hitchhiker’s Guide to Choosing the Compression Algorithm for your Smart Meter DataMartin Ringwelski et al. The Hitchhiker’s Guide to Choosing the Compression Algorithm for your Smart Meter Data 2828

Page 32: The Hitchhiker's Guide to Choosing the Compression ......3500ASCII-coded (EN-62056-21) datasets From real smart meter installation Size range from76to3100Byte 95power consumption measures

Symbol-transition Heatmap

Symbolübergänge

lf cr& ( ) * - . 0 1 2 3 4 5 6 7 8 9 A BCDE FOPRVWh k

Symbol

lfcr&()*-.0123456789ABCDEFOPRVWhk

Folg

esym

bol

0

20

40

60

80

100

Wah

rsch

einl

ichk

eit[

%]

◁ Back

Martin Ringwelski et al. The Hitchhiker’s Guide to Choosing the Compression Algorithm for your Smart Meter DataMartin Ringwelski et al. The Hitchhiker’s Guide to Choosing the Compression Algorithm for your Smart Meter Data 2929


Recommended