+ All Categories
Home > Documents > Thegraphrevolution

Thegraphrevolution

Date post: 18-Jul-2015
Category:
Upload: rickard-oeberg
View: 352 times
Download: 0 times
Share this document with a friend
Popular Tags:
35
Neo Technology, Inc The graph revolution Intro to graphs and Neo4j Rickard Öberg [email protected] @rickardoberg söndag 28 april 13
Transcript
Page 1: Thegraphrevolution

Neo Technology, Inc

The graph revolution

Intro to graphs and Neo4jRickard Öberg

[email protected]@rickardoberg

söndag 28 april 13

Page 2: Thegraphrevolution

Neo Technology, Inc

Agenda

1. The Graph Mega Trend

2. Graph Databases in the Enterprise

3. Neo4j Customers

4. About Neo Technology

söndag 28 april 13

Page 3: Thegraphrevolution

Neo Technology, Inc

The Graph Mega Trend

Connected Data:Survival of the Fittest

söndag 28 april 13

Page 4: Thegraphrevolution

Neo Technology, Inc

Early Adopters of Graph Tech

söndag 28 april 13

Page 5: Thegraphrevolution

Neo Technology, Inc

Evolution of Web SearchSurvival of the Fittest

Pre-1999WWW Indexing

Atomic Data

söndag 28 april 13

Page 6: Thegraphrevolution

Neo Technology, Inc

Evolution of Web SearchSurvival of the Fittest

Pre-1999WWW Indexing

Atomic Data

1999 - 2012Google Invents

PageRank

Connected Data(Simple)

söndag 28 april 13

Page 7: Thegraphrevolution

Neo Technology, Inc

Evolution of Web SearchSurvival of the Fittest

Pre-1999WWW Indexing

Atomic Data

1999 - 2012Google Invents

PageRank

Connected Data(Simple)

2012-?Google Knowledge Graph, Facebook Graph Search

Connected Data(Rich)

söndag 28 april 13

Page 8: Thegraphrevolution

Neo Technology, Inc

Evolution of Online Job Search

2010-11Resume Scoring

Atomic Data

Survival of the Fittest

söndag 28 april 13

Page 9: Thegraphrevolution

Neo Technology, Inc

Evolution of Online Job Search

2010-11Resume Scoring

Atomic Data

Survival of the Fittest

2011-12Social Job Search

Connected Data

söndag 28 april 13

Page 10: Thegraphrevolution

Neo Technology, Inc

Emergent Graph in Other Industries(Actual Neo4j Graphs)

söndag 28 april 13

Page 11: Thegraphrevolution

Neo Technology, Inc

Content Management & Access Control

Emergent Graph in Other Industries(Actual Neo4j Graphs)

söndag 28 april 13

Page 12: Thegraphrevolution

Neo Technology, Inc

Content Management & Access Control

Emergent Graph in Other Industries(Actual Neo4j Graphs)

Insurance Risk Analysis

söndag 28 april 13

Page 13: Thegraphrevolution

Neo Technology, Inc

Content Management & Access Control

Geo Routing(Public Transport)

Emergent Graph in Other Industries(Actual Neo4j Graphs)

Insurance Risk Analysis

söndag 28 april 13

Page 14: Thegraphrevolution

Neo Technology, Inc

Content Management & Access Control

Network Cell Analysis

Geo Routing(Public Transport)

Emergent Graph in Other Industries(Actual Neo4j Graphs)

Insurance Risk Analysis

söndag 28 april 13

Page 15: Thegraphrevolution

Neo Technology, Inc

Content Management & Access Control

Network Asset Management

Network Cell Analysis

Geo Routing(Public Transport)

Emergent Graph in Other Industries(Actual Neo4j Graphs)

Insurance Risk Analysis

söndag 28 april 13

Page 16: Thegraphrevolution

Neo Technology, Inc

Content Management & Access Control

Network Asset Management

Network Cell Analysis

Geo Routing(Public Transport)

BioInformatics

Emergent Graph in Other Industries(Actual Neo4j Graphs)

Insurance Risk Analysis

söndag 28 april 13

Page 17: Thegraphrevolution

Neo Technology, Inc

Graph Databasesin the Enterprise

söndag 28 april 13

Page 18: Thegraphrevolution

Neo Technology, Inc

• Neo4j is the leading graph database in the world today

• Most widely deployed: 500,000+ downloads

• Largest ecosystem: active forums, code contributions, etc

• Most mature product: in development since 2000, in 24/7 production since 2003

The Neo4j Graph Database

söndag 28 april 13

Page 19: Thegraphrevolution

Neo Technology, Inc

Core Industries & Use Cases:

Web / ISVFinance & Insurance

Datacom / Telecom

Network Management

MDM

Social

Geo

Select Commercial Customers (Community Users Not Included)

Neo4j Adoption: Latest Snapshot*

söndag 28 april 13

Page 20: Thegraphrevolution

Neo Technology, Inc

Core Industries & Use Cases:

Web / ISVFinance & Insurance

Datacom / Telecom

Network Management

MDM

Social

Geo

Core Industries & Use Cases: Web / ISV Finance &

InsuranceCommuni-

cationsLogistics Life

SciencesMedia &

Publishing

Education, Not-for-

Profit

Government, Aerospace,

Gaming, Other

Network Management

MDM

Social

Geo

Authorization & Access Control

Content Management

Recommend-ations

Fraud Detection,

Other

Accenture

Select Commercial Customers (Community Users Not Included)

Neo4j Adoption: Latest Snapshot*

söndag 28 april 13

Page 21: Thegraphrevolution

Neo Technology, Inc

NOSQL in the Enterprise

The Graph Database Story:Selected Customers.

söndag 28 april 13

Page 22: Thegraphrevolution

Neo Technology, Inc

Background

• One of the world’s largest logistics carriers

• Projected to outgrow capacity of old system

• New parcel routing system• Single source of truth for entire network

• B2C & B2B parcel tracking

• Real-time routing: up to 5M parcels per day

Industry: LogisticsUse case: Parcel Routing

söndag 28 april 13

Page 23: Thegraphrevolution

Neo Technology, Inc

Background

• One of the world’s largest logistics carriers

• Projected to outgrow capacity of old system

• New parcel routing system• Single source of truth for entire network

• B2C & B2B parcel tracking

• Real-time routing: up to 5M parcels per day

Business problem• 24x7 availability, year round• Peak loads of 2500+ parcels per second

• Complex and diverse software stack• Need predictable performance & linear

scalability

• Daily changes to logistics network: route from any point, to any point

Industry: LogisticsUse case: Parcel Routing

söndag 28 april 13

Page 24: Thegraphrevolution

Neo Technology, Inc

Background

• One of the world’s largest logistics carriers

• Projected to outgrow capacity of old system

• New parcel routing system• Single source of truth for entire network

• B2C & B2B parcel tracking

• Real-time routing: up to 5M parcels per day

Business problem• 24x7 availability, year round• Peak loads of 2500+ parcels per second

• Complex and diverse software stack• Need predictable performance & linear

scalability

• Daily changes to logistics network: route from any point, to any point

Solution & Benefits• Neo4j provides the ideal domain fit:

• a logistics network is a graph

• Extreme availability & performance with Neo4j clustering

• Hugely simplified queries, vs. relational for complex routing

• Flexible data model can reflect real-world data variance much better than relational

• “Whiteboard friendly” model easy to understand

Industry: LogisticsUse case: Parcel Routing

söndag 28 april 13

Page 25: Thegraphrevolution

Neo Technology, Inc

Industry: CommunicationsUse case: Resource Authorization & Access Control

Background• 10th largest Telco provider in the world, leading in

the Nordics

• Online self-serve system where large business admins manage employee subscriptions and plans

• Mission-critical system whose availability and responsiveness is critical to customer satisfaction

Oslo, Norway

söndag 28 april 13

Page 26: Thegraphrevolution

Neo Technology, Inc

Industry: CommunicationsUse case: Resource Authorization & Access Control

Background• 10th largest Telco provider in the world, leading in

the Nordics

• Online self-serve system where large business admins manage employee subscriptions and plans

• Mission-critical system whose availability and responsiveness is critical to customer satisfaction

Business problem• Degrading relational performance. User login taking

minutes while system retrieved access rights

• Millions of plans, customers, admins, groups. Highly interconnected data set w/massive joins

• Nightly batch workaround solved the performance problem, but meant data was no longer current

• Primary system was Sybase. Batch pre-compute workaround projected to reach 9 hours by 2014: longer than the nightly batch window

Subscription

Account

Customer

Customer

SUBSCRIBED_BY

CONTROLLED_BY

PART_OF

User

USER_ACCESSOslo, Norway

söndag 28 april 13

Page 27: Thegraphrevolution

Neo Technology, Inc

Industry: CommunicationsUse case: Resource Authorization & Access Control

Background• 10th largest Telco provider in the world, leading in

the Nordics

• Online self-serve system where large business admins manage employee subscriptions and plans

• Mission-critical system whose availability and responsiveness is critical to customer satisfaction

Business problem• Degrading relational performance. User login taking

minutes while system retrieved access rights

• Millions of plans, customers, admins, groups. Highly interconnected data set w/massive joins

• Nightly batch workaround solved the performance problem, but meant data was no longer current

• Primary system was Sybase. Batch pre-compute workaround projected to reach 9 hours by 2014: longer than the nightly batch window

Solution & Benefits• Moved authorization functionality from Sybase to Neo4j

• Modeling the resource graph in Neo4j was straightforward, as the domain is inherently a graph

• Able to retire the batch process, and move to real-time responses: measured in milliseconds

• Users able to see fresh data, not yesterday’s snapshot

• Customer retention risks fully mitigated

Subscription

Account

Customer

Customer

SUBSCRIBED_BY

CONTROLLED_BY

PART_OF

User

USER_ACCESSOslo, Norway

söndag 28 april 13

Page 28: Thegraphrevolution

Neo Technology, Inc

Industry: Online Job SearchUse case: Social / Recommendations

• Online jobs and career community, providing anonymized inside information to job seekers

Background

Sausalito, CA

söndag 28 april 13

Page 29: Thegraphrevolution

Neo Technology, Inc

Industry: Online Job SearchUse case: Social / Recommendations

• Online jobs and career community, providing anonymized inside information to job seekers

Business problem• Wanted to leverage known fact that most jobs are

found through personal & professional connections

• Needed to rely on an existing source of social network data. Facebook was the ideal choice.

• End users needed to get instant gratification

• Aiming to have the best job search service, in a very competitive market

Person

Company

KNO

WS

Person

Person

KNOWS

Company

KN

OW

S

WORKS_AT

WORKS_AT

Background

Sausalito, CA

söndag 28 april 13

Page 30: Thegraphrevolution

Neo Technology, Inc

Industry: Online Job SearchUse case: Social / Recommendations

• Online jobs and career community, providing anonymized inside information to job seekers

Business problem• Wanted to leverage known fact that most jobs are

found through personal & professional connections

• Needed to rely on an existing source of social network data. Facebook was the ideal choice.

• End users needed to get instant gratification

• Aiming to have the best job search service, in a very competitive market

Solution & Benefits• First-to-market with a product that let users find jobs

through their network of Facebook friends

• Job recommendations served real-time from Neo4j

• Individual Facebook graphs imported real-time into Neo4j

• Glassdoor now stores > 50% of the entire Facebook social graph

• Neo4j cluster has grown seamlessly, with new instances being brought online as graph size and load have increased

Person

Company

KNO

WS

Person

Person

KNOWS

Company

KN

OW

S

WORKS_AT

WORKS_AT

Background

Sausalito, CA

söndag 28 april 13

Page 31: Thegraphrevolution

Neo Technology, Inc

Top Reasons for Choosing Neo4j

1. Problems with Join performance.

2. Continuously evolving data set (often involves wide and sparse tables)

3. Domain Problem association with graph (e.g. social)

4. The Shape of the Data is naturally a graph

5. Open-ended business requirements necessitating fast, iterative development.

söndag 28 april 13

Page 32: Thegraphrevolution

Neo Technology, Inc

About Neo Technology

söndag 28 april 13

Page 33: Thegraphrevolution

Neo Technology, Inc

Our Mission:

Help the world to make sense of data

söndag 28 april 13

Page 34: Thegraphrevolution

Neo Technology, Inc

Neo TechnologyFact Sheet

• Numbers:

• 50 people / Ten countries / Four continents

• $25M raised

• 30,000+ downloads per month

• Team:

• Selected Leadership: Emil Eifrem (CEO), Johan Svensson (CTO), Rod Johnson (Chairman - SpringSource/VMW), Lars Nordwall (COO - ex SugarCRM, Pentaho), Philip Rathle (Sr Dir of Products - ex Accenture, Embarcadero), Jim Webber (Chief Scientist - ex ThoughtWorks)

• Product:

• Development of Neo4j started in 2000 in Sweden

• Put in 24/7 production in 2003

• Open sourced in 2007

• Today the leading graph database

söndag 28 april 13

Page 35: Thegraphrevolution

Neo Technology, Inc

Thank you!

söndag 28 april 13