+ All Categories
Home > Technology > Building a semantic chemistry platform with the royal society of chemistry

Building a semantic chemistry platform with the royal society of chemistry

Date post: 10-May-2015
Category:
Upload: valery-tkachenko
View: 309 times
Download: 0 times
Share this document with a friend
Description:
We live in an exponentially expanding world of “big data”. Social networks, global portals and other distributed systems have been attempting to deal with the problem for a few years now. Scientific applications are commonly lagging behind the mainstream trends due to the complexity of the scientific domain. The Royal Society of Chemistry is building the Global Chemistry Network connecting a variety of resources both in-house and external, bridging gaps and advancing the chemical sciences. One of the main issues connected to the world of big data is the ease of navigation and comprehensiveness of the search capabilities. This is where the approach of the semantic web meets the world of big data. We will present our approaches in building a global federated chemistry platform connecting multiple domains of chemistry using semantic web technologies.
Popular Tags:
44
Building a semantic chemistry platform with the Royal Society of Chemistry Valery Tkachenko, Colin Batchelor, Pete Ken Karapetyan, Alexey Pshenichnov, Ant Williams ACS 247th National Meeting Dallas, TX
Transcript
Page 1: Building a semantic chemistry platform with the royal society of chemistry

Building a semantic chemistry platformwith the Royal Society of Chemistry

Valery Tkachenko, Colin Batchelor, Peter Corbett,

Ken Karapetyan, Alexey Pshenichnov, Antony Williams

ACS 247th National Meeting

Dallas, TX

March 16th 2014

Page 2: Building a semantic chemistry platform with the royal society of chemistry

Big Data World and Chemistry

ChemSpider

RSC Archive

RSC Chemistry Platform

Data quality

Global Chemistry Network

Page 3: Building a semantic chemistry platform with the royal society of chemistry
Page 4: Building a semantic chemistry platform with the royal society of chemistry
Page 6: Building a semantic chemistry platform with the royal society of chemistry

Chemical space - 1060

Page 8: Building a semantic chemistry platform with the royal society of chemistry

Automated learning

Page 9: Building a semantic chemistry platform with the royal society of chemistry

Managing Big Data

Page 10: Building a semantic chemistry platform with the royal society of chemistry

Big Data World and Chemistry

ChemSpider

RSC Archive

RSC Chemistry Platform

Data quality

Global Chemistry Network

Page 11: Building a semantic chemistry platform with the royal society of chemistry

• ~30 million chemicals and growing

• Data sourced from >500 different sources

• Crowdsourced curation and annotation

• Ongoing deposition of data from our journals and our collaborators

• A structure centric hub for web-searching

Page 12: Building a semantic chemistry platform with the royal society of chemistry

ChemSpider

Page 13: Building a semantic chemistry platform with the royal society of chemistry

ChemSpider - properties

Page 14: Building a semantic chemistry platform with the royal society of chemistry

ChemSpider - references

Page 15: Building a semantic chemistry platform with the royal society of chemistry

ChemSpider - classification

Page 16: Building a semantic chemistry platform with the royal society of chemistry

Share in a “proper way”

Page 17: Building a semantic chemistry platform with the royal society of chemistry

Big Data World and Chemistry

ChemSpider

RSC Archive

RSC Chemistry Platform

Data quality

Global Chemistry Network

Page 18: Building a semantic chemistry platform with the royal society of chemistry

RSC Archive – since 1841

Page 19: Building a semantic chemistry platform with the royal society of chemistry

It is so difficult to navigate…

What’s the structure?What’s the structure?

Are they in our file?

Are they in our file?

What’s similar?What’s

similar?

What’s the target?

What’s the target?Pharmacology

data?Pharmacology

data?

Known Pathways?

Known Pathways?

Working On Now?

Working On Now?Connections

to disease?Connections to disease?

Expressed in right cell type?Expressed in

right cell type?

Competitors?Competitors?

IP?IP?

Page 20: Building a semantic chemistry platform with the royal society of chemistry

Digitally Enabling RSC Archive

Page 21: Building a semantic chemistry platform with the royal society of chemistry

CSSP Article Example

Compounds

Reaction

Analytical Data

Text and References

Page 22: Building a semantic chemistry platform with the royal society of chemistry

Big Data World and Chemistry

ChemSpider

RSC Archive

RSC Chemistry Platform

Data quality

Global Chemistry Network

Page 23: Building a semantic chemistry platform with the royal society of chemistry

RSC Chemistry Platform

ChemSpider Compounds

ChemSpider Reactions

ChemSpider Spectra

ChemSpider Crystals

ChemSpider Materials

ChemSpider Assays

ChemSpider Algorithms

Page 24: Building a semantic chemistry platform with the royal society of chemistry

Data Pipeline

Deposition Gateway

Staging databases

Compounds Reactions Spectra Crystals

Materials

Compounds Module

Spectra Module

Reactions Module

Materials Module

TextminingModule

�͙Module

Web UI for unified depositions

DropBox, Google Drive, SkyDrive, etc

ELNs, templated data input

Documents

API, FTP, etc

Raw data

Val

idat

ed

data

Staging databases

All databases are sliced by data sources/data collections and have simple security model where each data slice/source is private, public or embargoed

Etc

Experiments

Research

Page 25: Building a semantic chemistry platform with the royal society of chemistry

Compounds Database

Page 26: Building a semantic chemistry platform with the royal society of chemistry

Reactions Database

• ChemSpider Synthetic Pages

• Methods in Organic Synthesis

• Catalysts and Catalyzed Reactions

• USPTO

Page 27: Building a semantic chemistry platform with the royal society of chemistry

Reactions Database

Page 28: Building a semantic chemistry platform with the royal society of chemistry

Analytical Data Database

Page 29: Building a semantic chemistry platform with the royal society of chemistry

Data Pipeline

Compounds Reactions Spectra Crystals Documents

CompoundsAPI

ReactionsAPI

SpectraAPI

CrystalsAPI

DocumentsAPI

CompoundsWidgets

ReactionsWidgets

SpectraWidgets

CrystalsWidgets

DocumentsWidgets

Data tier

Data access tier

User interface

components tier

Analytical Laboratory application

User interface tier

(examples) Electronic Laboratory Notebook

Paid 3rd party integrations (various platforms – SharePoint, Google, etc)

Chemical Inventory application

Page 30: Building a semantic chemistry platform with the royal society of chemistry

Big Data World and Chemistry

ChemSpider

RSC Archive

RSC Chemistry Platform

Data quality

Global Chemistry Network

Page 31: Building a semantic chemistry platform with the royal society of chemistry

Data quality

– Robochemistry

– Proliferation of errors in public and private databases

– Automated quality control system

– Crowdsourcing

Page 32: Building a semantic chemistry platform with the royal society of chemistry

Typical public databases errors

J. Brechner, IUPACGraphical Representation of stereochem. configurationsSection: ST-1.1.10

DB06287

Page 33: Building a semantic chemistry platform with the royal society of chemistry

Chemistry Validation and Standardization Platform

Page 34: Building a semantic chemistry platform with the royal society of chemistry

Crowdsourcing and AltMetrics

Page 35: Building a semantic chemistry platform with the royal society of chemistry

RSC/Rewards and Recognition

Congratulations! Your 1st CSSP article has been published. Philosopher Lao Tzu said “A journey of a thousand miles begins with a single step”. In the same way we hope that this will be the first of many submissions that you make to CSSP.

The First Step badge is awarded when a user submits (& has published) their 1st CSSP article.

Page 36: Building a semantic chemistry platform with the royal society of chemistry

Big Data World and Chemistry

ChemSpider

RSC Archive

RSC Chemistry Platform

Data quality

Global Chemistry Network

Page 37: Building a semantic chemistry platform with the royal society of chemistry

We are a part of a much larger world

Page 38: Building a semantic chemistry platform with the royal society of chemistry

Research data network

University 1

Data Hub

Workstations

University 2

Data Hub

Workstations

Company 3

Data Hub

Workstations

Data Repositoryindexed storage

Data Repository provideddata storage

Chemically intelligent services

Indexes

Data

External clients Publishers

Scientists Funding bodies

Page 39: Building a semantic chemistry platform with the royal society of chemistry

ChemSpider APIs

Page 40: Building a semantic chemistry platform with the royal society of chemistry

National Chemistry Database

Page 41: Building a semantic chemistry platform with the royal society of chemistry

http://www.openphacts.org

Open PHACTS is an Innovative Medicines Initiative (IMI) project, aiming to reduce the barriers to

drug discovery in industry, academia and for small

businesses.

Semantic web is one of the corner stones

Page 42: Building a semantic chemistry platform with the royal society of chemistry
Page 43: Building a semantic chemistry platform with the royal society of chemistry

OSDD

Page 44: Building a semantic chemistry platform with the royal society of chemistry

Thank you

Email: [email protected]

Slides: http://www.slideshare.net/valerytkachenko16


Recommended