+ All Categories
Home > Technology > Nuxeo Semantic ECM: from Scribo and Stanbol to valuable applications

Nuxeo Semantic ECM: from Scribo and Stanbol to valuable applications

Date post: 11-May-2015
Category:
Upload: nuxeo
View: 2,438 times
Download: 2 times
Share this document with a friend
Description:
Work on integrating semantic technologies developed in several R&D projects is now progressing at full speed. Expect to see creative new uses of semantic technologies in Nuxeo open source content management products in 2011!
Popular Tags:
47
Nov. 23 2010 - S. Fermigier & O. Grisel, Nuxeo Semantic ECM @ Nuxeo A progress report - Nov. 2010
Transcript
Page 1: Nuxeo Semantic ECM: from Scribo and Stanbol to valuable applications

Nov. 23 2010 - S. Fermigier & O. Grisel, Nuxeo

Semantic ECM @ NuxeoA progress report - Nov. 2010

Page 2: Nuxeo Semantic ECM: from Scribo and Stanbol to valuable applications

Agenda

From ECM to Semantic ECM

Scribo & IKS

Fise & Apache Stanbol

Nuxeo Integration

Roadmap for 2011

Page 3: Nuxeo Semantic ECM: from Scribo and Stanbol to valuable applications

Nuxeo: from ECM...

Page 4: Nuxeo Semantic ECM: from Scribo and Stanbol to valuable applications

Nuxeo: an open source ECM vendor

Our Focus is Enterprise Content Management

ECM as a Platform for Content Applications

Open Source as Efficient Development Model

Modern architecture for 21st Century business

“Lean, mobile, social, interoperable”

A Social Marketplace in action

Innovation driven by community of customers, partners, and our core developers

Page 5: Nuxeo Semantic ECM: from Scribo and Stanbol to valuable applications

5

Nuxeo ECM - From Platform to Products

PlatformContent

Infrastructure

Nuxeo Enterprise PlatformComplete set of components covering all aspects of ECM

Nuxeo CoreLightweight, scalable, embeddable content repository

HorizontalPackages

DocumentManagement

Digital AssetManagement

CaseManagement

Framework

StructuredDocument

Server

ContentAggregator

Business Solutions

Correspondence Management

Contracts Management

Invoice ProcessingRecords

Management

Construction Media Government Life Sciences

Page 6: Nuxeo Semantic ECM: from Scribo and Stanbol to valuable applications

Major Customers

Page 7: Nuxeo Semantic ECM: from Scribo and Stanbol to valuable applications

... to Semantic ECM

Page 8: Nuxeo Semantic ECM: from Scribo and Stanbol to valuable applications

Picture source: http://www.flickr.com/photos/pixelydixel/

Page 9: Nuxeo Semantic ECM: from Scribo and Stanbol to valuable applications

“Linking Open Data cloud diagram, by Richard Cyganiak and Anja Jentzsch. http://lod-cloud.net/”

Linked Online Data in 2007

Page 10: Nuxeo Semantic ECM: from Scribo and Stanbol to valuable applications

“Linking Open Data cloud diagram, by Richard Cyganiak and Anja Jentzsch. http://lod-cloud.net/”

2008

Page 11: Nuxeo Semantic ECM: from Scribo and Stanbol to valuable applications

“Linking Open Data cloud diagram, by Richard Cyganiak and Anja Jentzsch. http://lod-cloud.net/”

2009

Page 12: Nuxeo Semantic ECM: from Scribo and Stanbol to valuable applications

“Linking Open Data cloud diagram, by Richard Cyganiak and Anja Jentzsch. http://lod-cloud.net/”

2010

Page 13: Nuxeo Semantic ECM: from Scribo and Stanbol to valuable applications

Good for Enterprise apps too!

Diagram source: http://www.w3.org/2007/Talks/0130-sb-W3CTechSemWeb/

Page 14: Nuxeo Semantic ECM: from Scribo and Stanbol to valuable applications

Key Enablers

Open Data and Linked Online Data

Advances in automatic content analysis (linguistics, image processing) and machine learning

Classical logic and classical AI

Computing power (Moore’s law + MapReduce)

Page 15: Nuxeo Semantic ECM: from Scribo and Stanbol to valuable applications

Let’s put them to use!

The technologies and data are available,

Page 16: Nuxeo Semantic ECM: from Scribo and Stanbol to valuable applications

Semantic ECM

Page 17: Nuxeo Semantic ECM: from Scribo and Stanbol to valuable applications

Content

Text

Image

Sound

Video

Semantic ECM

Page 18: Nuxeo Semantic ECM: from Scribo and Stanbol to valuable applications

Content

Text

Image

Sound

Video

Meaning

Metadata

Relations

EntitiesTags

Reasoning

Semantic ECM

Page 19: Nuxeo Semantic ECM: from Scribo and Stanbol to valuable applications

Content

Text

Image

Sound

Video

Meaning

Metadata

Relations

EntitiesTags

Reasoning

Semantic ECM

Page 20: Nuxeo Semantic ECM: from Scribo and Stanbol to valuable applications

Goals for Semantic ECM

Repurpose existing content

Improve search and collaboration

Make information contextual

Extract and use information from your content

Make your content smarter!

Page 21: Nuxeo Semantic ECM: from Scribo and Stanbol to valuable applications

Challenges

Extract meaning from content

Enrich content with knowledge

Enhance interaction with content thanks to added meaning

Page 22: Nuxeo Semantic ECM: from Scribo and Stanbol to valuable applications

Content Stack vs. Knowledge Cake

Architectural Challenge

Page 23: Nuxeo Semantic ECM: from Scribo and Stanbol to valuable applications

Business valuefrom semantic ECM

Efficiency gains: 20% to 90% (ex: in search, collaboration)

Effectiveness gains: better returns from your assets (ex: news and images from AFP)

Strategic edge: growth, value capture, new services, gain unfair strategic advantage (ex: vertical ontologies for CEVAs / CCAs)

Page 24: Nuxeo Semantic ECM: from Scribo and Stanbol to valuable applications

SCRIBO and IKS

Page 25: Nuxeo Semantic ECM: from Scribo and Stanbol to valuable applications

Project under the french FUI program, with 9 partners, and a budget of 4.7 M€

Goal: to develop algorithms and collaborative tools for extracting knowledge from unstructured documents and images

Started in 2008, finishing in Dec. 2010, with results already integrated as a Nuxeo plugin

Page 26: Nuxeo Semantic ECM: from Scribo and Stanbol to valuable applications

European project under the FP7, with 13 partners (6 SMEs) and a 8.5 M€ budget

Goal: create a semantic software “stack” that will be used by CMS vendors to add semantic features to their products

Started in Jan. 2009, will last until Dec. 2012

First tangible result: FISE, already integrated in a Nuxeo plugin

Page 27: Nuxeo Semantic ECM: from Scribo and Stanbol to valuable applications

Linking Semantic EntitiesApache Stanbol - Nuxeo integration

Page 28: Nuxeo Semantic ECM: from Scribo and Stanbol to valuable applications

Demo time!

25

Screencast online at http://blogs.nuxeo.com/dev

Page 29: Nuxeo Semantic ECM: from Scribo and Stanbol to valuable applications

How does this work?

26

Page 30: Nuxeo Semantic ECM: from Scribo and Stanbol to valuable applications

27

Page 31: Nuxeo Semantic ECM: from Scribo and Stanbol to valuable applications

28

• Open Source Semantic Engine

• HTTP Services

• For content driven applications

• OSGi: loosely coupled components

• Analysis Engines

• Knowledge RDF vocabularies

Page 32: Nuxeo Semantic ECM: from Scribo and Stanbol to valuable applications

What is a semantic engine?

29

• Unstructured content => Knowledge

• Language guessing

• Topic classification (Business, Sports, Media, ...)

• Named Entities extraction and linking

• Relationships and properties extraction

Page 33: Nuxeo Semantic ECM: from Scribo and Stanbol to valuable applications

30

Page 34: Nuxeo Semantic ECM: from Scribo and Stanbol to valuable applications

31

Page 35: Nuxeo Semantic ECM: from Scribo and Stanbol to valuable applications

32

curl -X POST \ -H "Accept: application/json" \ -H "Content-type: text/plain" \ --data "John Smith works at Smith Consulting in Paris." \ http://fise.demo.nuxeo.com/engines

{ "urn:enhancement-1564680b-861c-df6f-fdf9-d34a75d68dfe": { "http://fise.iks-project.eu/ontology/selected-text": [ { "datatype": "http://www.w3.org/2001/XMLSchema#string", "type": "literal", "value": "Paris" } ], "http://fise.iks-project.eu/ontology/selection-context": [ { "datatype": "http://www.w3.org/2001/XMLSchema#string", "type": "literal", "value": "John Smith works at Smith Consulting Paris." } ], "http://purl.org/dc/terms/type": [ { "type": "uri", "value": "http://dbpedia.org/ontology/Place" } ] }, …

Page 36: Nuxeo Semantic ECM: from Scribo and Stanbol to valuable applications

33

Page 37: Nuxeo Semantic ECM: from Scribo and Stanbol to valuable applications

34

Page 38: Nuxeo Semantic ECM: from Scribo and Stanbol to valuable applications

35

= fise +

fast Linked Data local index +

semantic rule engine+

more ?

Page 39: Nuxeo Semantic ECM: from Scribo and Stanbol to valuable applications

Apache Stanbol / Nuxeo integration

36

Page 40: Nuxeo Semantic ECM: from Scribo and Stanbol to valuable applications

Local IT infrastructure (LAN) 37

Nuxeo DM

addon

1

Apache Stanbol

2

Engine 1

Engine 2

Engine 3

3

DBpedia

Freebase

GeonamesLDAP

Page 41: Nuxeo Semantic ECM: from Scribo and Stanbol to valuable applications

Roadmap 2010-2011

Page 42: Nuxeo Semantic ECM: from Scribo and Stanbol to valuable applications

Nuxeo DM Improvement

Automated document categorization (language, subject, geo coverage based on fixed lists)

Semantic entities detection and linking

Available as add-ons on the Nuxeo Marketplace in December!

Page 43: Nuxeo Semantic ECM: from Scribo and Stanbol to valuable applications

Nuxeo DM:Upcoming Work

Stanbol + Scribo integration

Multilingual support

Extraction of relations between entities

Topic classification and linking to external taxonomies

Page 44: Nuxeo Semantic ECM: from Scribo and Stanbol to valuable applications

Nuxeo DAM

Clustering pictures by similarity

Faces detection

Faces recognition using contextual information

Speech to text integration for full-text search on audio and video files

Page 45: Nuxeo Semantic ECM: from Scribo and Stanbol to valuable applications

Nuxeo CMF / Correspondence

Document OCR and structure extraction

Scanned document categorization (ex: invoice vs. contract vs. claim...) and routing

Structured field extraction with configurable document masks

Page 46: Nuxeo Semantic ECM: from Scribo and Stanbol to valuable applications

Questions?


Recommended