+ All Categories
Home > Technology > EDF2012 Peter Boncz - LOD benchmarking SRbench

EDF2012 Peter Boncz - LOD benchmarking SRbench

Date post: 04-Jul-2015
Category:
Upload: european-data-forum
View: 1,020 times
Download: 3 times
Share this document with a friend
18
Benchmarking Linked Open Data technology SRbench: A Benchmark for Streaming RDF Storage Engines Ying Zhang, Peter Boncz (CWI, Amsterdam)
Transcript
Page 1: EDF2012   Peter Boncz - LOD benchmarking SRbench

Benchmarking

Linked Open Data technology

SRbench: A Benchmark for Streaming RDF Storage Engines

Ying Zhang, Peter Boncz (CWI, Amsterdam)

Page 2: EDF2012   Peter Boncz - LOD benchmarking SRbench

What is Database Benchmarking?

Standard test to measure and understand how technology performs

Dataset definition

at various scales (100GB, 300GB, 1TB, 3TB, etc)

mimicks a recognizable relevant usage scenario

Database Queries

often between 10-100 queries, with parameters

+ rules/programs that specify how these queries are posed

Result Metrics

a number to understand the result

tps = “transactions/second”

$/QphH@size = “price per query per hour”

Audit Rules

allow results to be checked by independent auditors

prevent/limit cheating

Ying Zhang, Peter Boncz – Benchmarking Linked Open Data Technology June 7, 2012 @EDF Copenhagen

Page 3: EDF2012   Peter Boncz - LOD benchmarking SRbench

make competing products comparable

accelerate progress, make technology viable

Why Benchmarking?

Ying Zhang, Peter Boncz – Benchmarking Linked Open Data Technology June 7, 2012 @EDF Copenhagen

© Jim Gray, 2005

Page 4: EDF2012   Peter Boncz - LOD benchmarking SRbench

LOD = Linked Open Data

web addressable data RDF data format ( )

lots of useful data on the web (“LOD cloud”)

LOD technology (SPARQL) benchmarks:

BSBM, DBpedia Benchmark, SIB

SRbench topic of this talk

New industry cooperation:

Benchmarking LOD Technology

June 7, 2012 @EDF CopenhagenYing Zhang, Peter Boncz – Benchmarking Linked Open Data Technology

Page 5: EDF2012   Peter Boncz - LOD benchmarking SRbench

6/9/2012 5

LDBC: FP7 2012-2015

vendor cooperation to establish accepted RDF/Graph database benchmarks and benchmark results

* tentative/expected project

Page 6: EDF2012   Peter Boncz - LOD benchmarking SRbench

6/9/2012 6

LDBC Goals

1. Create the LDBC Foundation of graph and RDF DB vendors

2. Equip de LDBC Foundation with a good initial set of benchmarks, and benchmark results

spin-off

Page 7: EDF2012   Peter Boncz - LOD benchmarking SRbench

Benchmarking

Linked Open Data technology

SRbench: A Benchmark for Streaming RDF Storage Engines

Ying Zhang, Peter Boncz (CWI, Amsterdam)

Page 8: EDF2012   Peter Boncz - LOD benchmarking SRbench

SRbench: Streaming RDF Benchmark

Traditional Database System vs.

Stream Database System

Ying Zhang, Peter Boncz – Benchmarking Linked Open Data Technology June 7, 2012 @EDF Copenhagen

Persistent

Data

stream

of

queries

stream

of

queries

stream

of

queries

“pull” based

query answering

Persistent Queries

“continuous queries”“push” based

query answering

data

stream

Page 9: EDF2012   Peter Boncz - LOD benchmarking SRbench

Data Streams (1/4): Stock Market

Page 10: EDF2012   Peter Boncz - LOD benchmarking SRbench

Data Streams (2/4): Social Chatter

Ying Zhang, Peter Boncz – Benchmarking Linked Open Data Technology June 7, 2012 @EDF Copenhagen

Detect breaking news

Analyze Marketing campaigns

Page 11: EDF2012   Peter Boncz - LOD benchmarking SRbench

Data Streams (3/4): Car Traffic

Ying Zhang, Peter Boncz – Benchmarking Linked Open Data Technology June 7, 2012 @EDF Copenhagen

monitor positions and speeds of cars detect accidents, traffic jams

Applications: better safety, improved logistics

Page 12: EDF2012   Peter Boncz - LOD benchmarking SRbench

Data Streams (4/4): Tele Health

Monitor health of elderly in their homes

Ying Zhang, Peter Boncz – Benchmarking Linked Open Data Technology June 7, 2012 @EDF Copenhagen

Why?

- Difficult to reach locations

- Make health care more affordable

How?

Who are the users?

Page 13: EDF2012   Peter Boncz - LOD benchmarking SRbench

SRbench: Streaming RDF Benchmark

Streaming RDF data benefits:

apply Linked Open Data (LOD) principles to streaming data

Link streaming data to data on the web (enrichment)

Publish data streams on the web

support (simple) reasoning semantics in stream queries

Richer semantics than relational streaming database systems

Ying Zhang, Peter Boncz – Benchmarking Linked Open Data Technology June 7, 2012 @EDF Copenhagen

Page 14: EDF2012   Peter Boncz - LOD benchmarking SRbench

SRbench: Streaming RDF Benchmark

Streaming RDF data challenges:

Proper benchmark dataset

use real-world datasets from LOD

No standard query language

natural language query definition +

three implementations (SPARQLStream, CQELS, C-SPARQL)

Limited systems support

evaluate on the strRS system (UPM)

Ying Zhang, Peter Boncz – Benchmarking Linked Open Data Technology June 7, 2012 @EDF Copenhagen

Page 15: EDF2012   Peter Boncz - LOD benchmarking SRbench

SRbench: used Datasets

Ying Zhang, Peter Boncz – Benchmarking Linked Open Data Technology June 7, 2012 @EDF Copenhagen

Feature

LinkedSensorData

Observation

ResultData

MeasureData TruthDataInstant

om-owl:result

om-owl:samplingTime

LinkedObservationData

System

LocatedNearRel

om-owl:procedure

Point

om-owl:processLocation

om-owl:hasLocatedNearRel

LinkedSensorMetaData

GeoNames om-owl:hasLocation

Airport

owl:sameAs

DBpedia

Use case: wheather information application

Page 16: EDF2012   Peter Boncz - LOD benchmarking SRbench

SRBench Queries

Page 17: EDF2012   Peter Boncz - LOD benchmarking SRbench

Summary

the importance of

Database System Benchmarking

RDF Database System Benchmarking ( )

Streaming RDF Database System Benchmarking

SRbench

Developed in PlanetData (CWI, UPM)

First dedicated streaming RDF/SPARQL benchmark

SRbench future work:

performance evaluation

results verification (not easy!)

Ying Zhang, Peter Boncz – Benchmarking Linked Open Data Technology June 7, 2012 @EDF Copenhagen

Page 18: EDF2012   Peter Boncz - LOD benchmarking SRbench

Thank You!

June 7, 2012 @EDF CopenhagenYing Zhang, Peter Boncz – Benchmarking Linked Open Data Technology

Questions?

Ying Zhang ([email protected])

Peter Boncz ([email protected])


Recommended