+ All Categories
Home > Technology > The Great Promise of Online Data for Chemistry and the Life Sciences

The Great Promise of Online Data for Chemistry and the Life Sciences

Date post: 21-Nov-2014
Category:
Upload: antony-williams-chemconnector
View: 2,698 times
Download: 1 times
Share this document with a friend
Description:
This is the presentation I gave at the Silverchair Colloquium at Keswick Hall in Charlottesville. This presentation
47
The Great Promise of Online Data for Chemistry and the Life Sciences Antony J Williams Silverchair Colloquium 2012
Transcript
Page 1: The Great Promise of Online Data for Chemistry and the Life Sciences

The Great Promise of Online Data for Chemistry and the Life Sciences

Antony J WilliamsSilverchair Colloquium 2012

Page 2: The Great Promise of Online Data for Chemistry and the Life Sciences

READ FAST – IT’S HAPPENING NOW

20 minutes, >40 slides

Disruption Can be Cheap, Fast and Unexpectedly

Successful

Page 3: The Great Promise of Online Data for Chemistry and the Life Sciences

Online Chemistry Databases in 2007

Page 4: The Great Promise of Online Data for Chemistry and the Life Sciences

A search gave LOTS of “info”..What is Yohimbine?

Page 5: The Great Promise of Online Data for Chemistry and the Life Sciences

For chemists…try filtering!

Page 6: The Great Promise of Online Data for Chemistry and the Life Sciences

Why not Index the web of chemistry?

Build a search engine for chemistry

Index all public domain chemicals and link

Build a structure searchable web

Crowdsource new chemistry from the community

Crowdsource curation and annotation

Page 7: The Great Promise of Online Data for Chemistry and the Life Sciences

Create a structure-centric hub

Page 8: The Great Promise of Online Data for Chemistry and the Life Sciences
Page 9: The Great Promise of Online Data for Chemistry and the Life Sciences

Answering Real Questions

Questions a chemist might ask… What is the melting point of n-heptanol? What is the chemical structure of Xanax? Chemically, what is phenolphthalein? What are the stereocenters of cholesterol? Where can I find publications about xylene? What are the different trade names for Ketoconazole? What is the NMR spectrum of Aspirin? What are the safety handling issues for Thymol Blue?

Page 10: The Great Promise of Online Data for Chemistry and the Life Sciences

The World of Online Chemistry Safety data Toxicity data Blogs and Wikis Property databases Experimental results Scientific publications Compound aggregators Open Notebook Science Metabolic pathway databases Encyclopedic articles (Wikipedia)

Page 11: The Great Promise of Online Data for Chemistry and the Life Sciences

Linked Data for Life Sciences growing…

Page 12: The Great Promise of Online Data for Chemistry and the Life Sciences

Solve Real World Problems

Provide programmable interface against content Provide a chemistry database tuned to integrators

Page 13: The Great Promise of Online Data for Chemistry and the Life Sciences

RSC and ChemSpider – May 2009

Page 14: The Great Promise of Online Data for Chemistry and the Life Sciences

Why RSC acquired ChemSpider

Commitment to serve the community

Bring cheminformatics expertise in-house

Add additional data to publications

Potential freemium model – web services, data

Because data is critical to science

Page 15: The Great Promise of Online Data for Chemistry and the Life Sciences

Making sense of data is overwhelming

Page 16: The Great Promise of Online Data for Chemistry and the Life Sciences

Publications are Hosts to Data

Page 17: The Great Promise of Online Data for Chemistry and the Life Sciences

Data has value, is Free, is Open

Data cannot be copyrighted. A particular expression of data, such as a chart or table in a publication, can be.

Data licensing is being dealt with and openness encouraged

Research data mandates are starting…

Who will manage the integration and curation and keep the access FREE!

Page 18: The Great Promise of Online Data for Chemistry and the Life Sciences

Tell me about Yohimbine…

Page 19: The Great Promise of Online Data for Chemistry and the Life Sciences

Of course it is out there…

Page 21: The Great Promise of Online Data for Chemistry and the Life Sciences

Tell me more…but…

Where can I find the electronic structure? Papers/Patents about Yohimbine? What are the side effects of Yohimbine? Where can I order Yohimbine? What are the physicochemical properties? What are the associated metabolic pathways? Different synonyms of Yohimbine? Are there side effects with Yohimbine?

ChemSpider links all of this information and more

Page 23: The Great Promise of Online Data for Chemistry and the Life Sciences

RSC Databases are Integrated

Page 24: The Great Promise of Online Data for Chemistry and the Life Sciences

RSC Journals are Integrated

Page 25: The Great Promise of Online Data for Chemistry and the Life Sciences

Patents are Linked

Page 26: The Great Promise of Online Data for Chemistry and the Life Sciences

Google Books are Integrated

Page 27: The Great Promise of Online Data for Chemistry and the Life Sciences

And so are…

Chemical vendors Safety and Toxicity information Experimental and Predicted properties Analytical data Images and Movies

And all for free…

Page 28: The Great Promise of Online Data for Chemistry and the Life Sciences

And all “mobile”

Page 29: The Great Promise of Online Data for Chemistry and the Life Sciences

Not only compounds but syntheses

Page 30: The Great Promise of Online Data for Chemistry and the Life Sciences

And analytical data…

Page 31: The Great Promise of Online Data for Chemistry and the Life Sciences

The world can take and contribute

Scientists can deposit their data

They can annotate and curate

They can download data

They can embed data in the social network

They can integrate and connect

Page 32: The Great Promise of Online Data for Chemistry and the Life Sciences

Integrate to electronic lab notebooks

Page 33: The Great Promise of Online Data for Chemistry and the Life Sciences

Integrate to electronic lab notebooks

Page 34: The Great Promise of Online Data for Chemistry and the Life Sciences

Integrate to instruments and software

Primary analytical instrumentation vendors integrate

Agilent, Bruker, Thermo, Waters

Cheminformatics vendors link to ChemSpider

Accelrys, ACD/Labs, ChemAxon, iChemLabs

Page 35: The Great Promise of Online Data for Chemistry and the Life Sciences

Publications are a summary of work

Scientific publications are a summary of work Is all work reported? How much science is lost to pruning? What of value sits in notebooks and is lost?

How much data is lost? How many compounds never reported? How many syntheses fail or succeed? How many characterization measurements?

Page 36: The Great Promise of Online Data for Chemistry and the Life Sciences

What if we could capture it all?

Page 37: The Great Promise of Online Data for Chemistry and the Life Sciences

Start with data in publications

Page 38: The Great Promise of Online Data for Chemistry and the Life Sciences

But in the time of Big Data…it’s linked!

Page 39: The Great Promise of Online Data for Chemistry and the Life Sciences

ONE example – data for life sciences

What’s the structure?What’s the structure?

Are they in our file?

Are they in our file?

What’s similar?What’s

similar?

What’s the target?

What’s the target?Pharmacology

data?Pharmacology

data?

Known Pathways?

Known Pathways?

Working On Now?

Working On Now?Connections

to disease?Connections to disease?

Expressed in right cell type?Expressed in

right cell type?

Competitors?Competitors?

IP?IP?

Page 40: The Great Promise of Online Data for Chemistry and the Life Sciences

Crowdsourcing across drug discovery Open PHACTS : partnership between European

Community and European Pharma Companies 22 partners, 8 pharmaceutical companies, 3

biotechs working together for 3 years

Freely accessible for knowledge discovery and verification. Data on chemistry and biology Pharmacological profiles Proprietary and public data sources.

Page 41: The Great Promise of Online Data for Chemistry and the Life Sciences
Page 42: The Great Promise of Online Data for Chemistry and the Life Sciences

All that glisters is not gold…

Page 43: The Great Promise of Online Data for Chemistry and the Life Sciences

Crowdsourced Assertions The future of publishing will include generation and

consumption of “nanopublications”

http://www.nanopub.org/

Page 44: The Great Promise of Online Data for Chemistry and the Life Sciences

Nanopublications??

Page 45: The Great Promise of Online Data for Chemistry and the Life Sciences

So what’s the business model?

Decisions are based on data

Publications encapsulate, reference and link data

More data is free and open. More services and APIS allow access – free or for fee. Ask Google

The large-scale licensed content business model is at risk without interfaces to integrate and mine

Page 46: The Great Promise of Online Data for Chemistry and the Life Sciences

Acknowledgments

The RSC ChemSpider team

Our users, our depositors, our curators

GGA Software Services, OpenEye, ACD/Labs and a lot of Open Source code!

And Al Gore for supporting the internethttp://en.wikipedia.org/wiki/

Al_Gore_and_information_technology

Page 47: The Great Promise of Online Data for Chemistry and the Life Sciences

Thank you

Email: [email protected] Twitter: ChemConnectorPersonal Blog: www.chemconnector.com SLIDES: www.slideshare.net/AntonyWilliams


Recommended