Date post: | 18-Jul-2015 |
Category: |
Documents |
Upload: | rickard-oeberg |
View: | 352 times |
Download: | 0 times |
Neo Technology, Inc
The graph revolution
Intro to graphs and Neo4jRickard Öberg
[email protected]@rickardoberg
söndag 28 april 13
Neo Technology, Inc
Agenda
1. The Graph Mega Trend
2. Graph Databases in the Enterprise
3. Neo4j Customers
4. About Neo Technology
söndag 28 april 13
Neo Technology, Inc
The Graph Mega Trend
Connected Data:Survival of the Fittest
söndag 28 april 13
Neo Technology, Inc
Early Adopters of Graph Tech
söndag 28 april 13
Neo Technology, Inc
Evolution of Web SearchSurvival of the Fittest
Pre-1999WWW Indexing
Atomic Data
söndag 28 april 13
Neo Technology, Inc
Evolution of Web SearchSurvival of the Fittest
Pre-1999WWW Indexing
Atomic Data
1999 - 2012Google Invents
PageRank
Connected Data(Simple)
söndag 28 april 13
Neo Technology, Inc
Evolution of Web SearchSurvival of the Fittest
Pre-1999WWW Indexing
Atomic Data
1999 - 2012Google Invents
PageRank
Connected Data(Simple)
2012-?Google Knowledge Graph, Facebook Graph Search
Connected Data(Rich)
söndag 28 april 13
Neo Technology, Inc
Evolution of Online Job Search
2010-11Resume Scoring
Atomic Data
Survival of the Fittest
söndag 28 april 13
Neo Technology, Inc
Evolution of Online Job Search
2010-11Resume Scoring
Atomic Data
Survival of the Fittest
2011-12Social Job Search
Connected Data
söndag 28 april 13
Neo Technology, Inc
Emergent Graph in Other Industries(Actual Neo4j Graphs)
söndag 28 april 13
Neo Technology, Inc
Content Management & Access Control
Emergent Graph in Other Industries(Actual Neo4j Graphs)
söndag 28 april 13
Neo Technology, Inc
Content Management & Access Control
Emergent Graph in Other Industries(Actual Neo4j Graphs)
Insurance Risk Analysis
söndag 28 april 13
Neo Technology, Inc
Content Management & Access Control
Geo Routing(Public Transport)
Emergent Graph in Other Industries(Actual Neo4j Graphs)
Insurance Risk Analysis
söndag 28 april 13
Neo Technology, Inc
Content Management & Access Control
Network Cell Analysis
Geo Routing(Public Transport)
Emergent Graph in Other Industries(Actual Neo4j Graphs)
Insurance Risk Analysis
söndag 28 april 13
Neo Technology, Inc
Content Management & Access Control
Network Asset Management
Network Cell Analysis
Geo Routing(Public Transport)
Emergent Graph in Other Industries(Actual Neo4j Graphs)
Insurance Risk Analysis
söndag 28 april 13
Neo Technology, Inc
Content Management & Access Control
Network Asset Management
Network Cell Analysis
Geo Routing(Public Transport)
BioInformatics
Emergent Graph in Other Industries(Actual Neo4j Graphs)
Insurance Risk Analysis
söndag 28 april 13
Neo Technology, Inc
Graph Databasesin the Enterprise
söndag 28 april 13
Neo Technology, Inc
• Neo4j is the leading graph database in the world today
• Most widely deployed: 500,000+ downloads
• Largest ecosystem: active forums, code contributions, etc
• Most mature product: in development since 2000, in 24/7 production since 2003
The Neo4j Graph Database
söndag 28 april 13
Neo Technology, Inc
Core Industries & Use Cases:
Web / ISVFinance & Insurance
Datacom / Telecom
Network Management
MDM
Social
Geo
Select Commercial Customers (Community Users Not Included)
Neo4j Adoption: Latest Snapshot*
söndag 28 april 13
Neo Technology, Inc
Core Industries & Use Cases:
Web / ISVFinance & Insurance
Datacom / Telecom
Network Management
MDM
Social
Geo
Core Industries & Use Cases: Web / ISV Finance &
InsuranceCommuni-
cationsLogistics Life
SciencesMedia &
Publishing
Education, Not-for-
Profit
Government, Aerospace,
Gaming, Other
Network Management
MDM
Social
Geo
Authorization & Access Control
Content Management
Recommend-ations
Fraud Detection,
Other
Accenture
Select Commercial Customers (Community Users Not Included)
Neo4j Adoption: Latest Snapshot*
söndag 28 april 13
Neo Technology, Inc
NOSQL in the Enterprise
The Graph Database Story:Selected Customers.
söndag 28 april 13
Neo Technology, Inc
Background
• One of the world’s largest logistics carriers
• Projected to outgrow capacity of old system
• New parcel routing system• Single source of truth for entire network
• B2C & B2B parcel tracking
• Real-time routing: up to 5M parcels per day
Industry: LogisticsUse case: Parcel Routing
söndag 28 april 13
Neo Technology, Inc
Background
• One of the world’s largest logistics carriers
• Projected to outgrow capacity of old system
• New parcel routing system• Single source of truth for entire network
• B2C & B2B parcel tracking
• Real-time routing: up to 5M parcels per day
Business problem• 24x7 availability, year round• Peak loads of 2500+ parcels per second
• Complex and diverse software stack• Need predictable performance & linear
scalability
• Daily changes to logistics network: route from any point, to any point
Industry: LogisticsUse case: Parcel Routing
söndag 28 april 13
Neo Technology, Inc
Background
• One of the world’s largest logistics carriers
• Projected to outgrow capacity of old system
• New parcel routing system• Single source of truth for entire network
• B2C & B2B parcel tracking
• Real-time routing: up to 5M parcels per day
Business problem• 24x7 availability, year round• Peak loads of 2500+ parcels per second
• Complex and diverse software stack• Need predictable performance & linear
scalability
• Daily changes to logistics network: route from any point, to any point
Solution & Benefits• Neo4j provides the ideal domain fit:
• a logistics network is a graph
• Extreme availability & performance with Neo4j clustering
• Hugely simplified queries, vs. relational for complex routing
• Flexible data model can reflect real-world data variance much better than relational
• “Whiteboard friendly” model easy to understand
Industry: LogisticsUse case: Parcel Routing
söndag 28 april 13
Neo Technology, Inc
Industry: CommunicationsUse case: Resource Authorization & Access Control
Background• 10th largest Telco provider in the world, leading in
the Nordics
• Online self-serve system where large business admins manage employee subscriptions and plans
• Mission-critical system whose availability and responsiveness is critical to customer satisfaction
Oslo, Norway
söndag 28 april 13
Neo Technology, Inc
Industry: CommunicationsUse case: Resource Authorization & Access Control
Background• 10th largest Telco provider in the world, leading in
the Nordics
• Online self-serve system where large business admins manage employee subscriptions and plans
• Mission-critical system whose availability and responsiveness is critical to customer satisfaction
Business problem• Degrading relational performance. User login taking
minutes while system retrieved access rights
• Millions of plans, customers, admins, groups. Highly interconnected data set w/massive joins
• Nightly batch workaround solved the performance problem, but meant data was no longer current
• Primary system was Sybase. Batch pre-compute workaround projected to reach 9 hours by 2014: longer than the nightly batch window
Subscription
Account
Customer
Customer
SUBSCRIBED_BY
CONTROLLED_BY
PART_OF
User
USER_ACCESSOslo, Norway
söndag 28 april 13
Neo Technology, Inc
Industry: CommunicationsUse case: Resource Authorization & Access Control
Background• 10th largest Telco provider in the world, leading in
the Nordics
• Online self-serve system where large business admins manage employee subscriptions and plans
• Mission-critical system whose availability and responsiveness is critical to customer satisfaction
Business problem• Degrading relational performance. User login taking
minutes while system retrieved access rights
• Millions of plans, customers, admins, groups. Highly interconnected data set w/massive joins
• Nightly batch workaround solved the performance problem, but meant data was no longer current
• Primary system was Sybase. Batch pre-compute workaround projected to reach 9 hours by 2014: longer than the nightly batch window
Solution & Benefits• Moved authorization functionality from Sybase to Neo4j
• Modeling the resource graph in Neo4j was straightforward, as the domain is inherently a graph
• Able to retire the batch process, and move to real-time responses: measured in milliseconds
• Users able to see fresh data, not yesterday’s snapshot
• Customer retention risks fully mitigated
Subscription
Account
Customer
Customer
SUBSCRIBED_BY
CONTROLLED_BY
PART_OF
User
USER_ACCESSOslo, Norway
söndag 28 april 13
Neo Technology, Inc
Industry: Online Job SearchUse case: Social / Recommendations
• Online jobs and career community, providing anonymized inside information to job seekers
Background
Sausalito, CA
söndag 28 april 13
Neo Technology, Inc
Industry: Online Job SearchUse case: Social / Recommendations
• Online jobs and career community, providing anonymized inside information to job seekers
Business problem• Wanted to leverage known fact that most jobs are
found through personal & professional connections
• Needed to rely on an existing source of social network data. Facebook was the ideal choice.
• End users needed to get instant gratification
• Aiming to have the best job search service, in a very competitive market
Person
Company
KNO
WS
Person
Person
KNOWS
Company
KN
OW
S
WORKS_AT
WORKS_AT
Background
Sausalito, CA
söndag 28 april 13
Neo Technology, Inc
Industry: Online Job SearchUse case: Social / Recommendations
• Online jobs and career community, providing anonymized inside information to job seekers
Business problem• Wanted to leverage known fact that most jobs are
found through personal & professional connections
• Needed to rely on an existing source of social network data. Facebook was the ideal choice.
• End users needed to get instant gratification
• Aiming to have the best job search service, in a very competitive market
Solution & Benefits• First-to-market with a product that let users find jobs
through their network of Facebook friends
• Job recommendations served real-time from Neo4j
• Individual Facebook graphs imported real-time into Neo4j
• Glassdoor now stores > 50% of the entire Facebook social graph
• Neo4j cluster has grown seamlessly, with new instances being brought online as graph size and load have increased
Person
Company
KNO
WS
Person
Person
KNOWS
Company
KN
OW
S
WORKS_AT
WORKS_AT
Background
Sausalito, CA
söndag 28 april 13
Neo Technology, Inc
Top Reasons for Choosing Neo4j
1. Problems with Join performance.
2. Continuously evolving data set (often involves wide and sparse tables)
3. Domain Problem association with graph (e.g. social)
4. The Shape of the Data is naturally a graph
5. Open-ended business requirements necessitating fast, iterative development.
söndag 28 april 13
Neo Technology, Inc
About Neo Technology
söndag 28 april 13
Neo Technology, Inc
Our Mission:
Help the world to make sense of data
söndag 28 april 13
Neo Technology, Inc
Neo TechnologyFact Sheet
• Numbers:
• 50 people / Ten countries / Four continents
• $25M raised
• 30,000+ downloads per month
• Team:
• Selected Leadership: Emil Eifrem (CEO), Johan Svensson (CTO), Rod Johnson (Chairman - SpringSource/VMW), Lars Nordwall (COO - ex SugarCRM, Pentaho), Philip Rathle (Sr Dir of Products - ex Accenture, Embarcadero), Jim Webber (Chief Scientist - ex ThoughtWorks)
• Product:
• Development of Neo4j started in 2000 in Sweden
• Put in 24/7 production in 2003
• Open sourced in 2007
• Today the leading graph database
söndag 28 april 13
Neo Technology, Inc
Thank you!
söndag 28 april 13