Date post: | 07-May-2015 |
Category: |
Technology |
Upload: | royal-society-of-chemistry |
View: | 684 times |
Download: | 1 times |
Linking chemistry: wider lessons for how we publish research
Anomalocarisfloreslivroselua.files.wordpress.com
What have we gained?
Very interesting Very clever
We’re all innovating into niches, growing extra legs, fins, ears...
news.sky.com
Who has this been for?
US!
All good publicity, shows we’ll all good at our jobs, pats on the back all round
Have any of these widgets solved any more research problems? Saved time? Money?
And how much are they used? Really?
But helps become flexible to try out new things, test models, approaches, technologies
OK, what do we think? 2007 version
Publishing methods are changing Reading methods are changing We know what our readers want We know how the researchers work
How many did we get right?
RSC Prospect – semantic publishing
What were we trying to improve? Discoverability Use Understanding Linking
Steering how the chemical sciences develop on the web...
What do we learn with Prospect?
This is probably the way to go How do we cover all subjects? Scale-up in manual QA Scale-up during huge growth and scope
How to use all that real chemistry data Pump prime to change what
we ask from authors Is our vision the day-glo article?
“Free headache for every user”
blogs.guardian.co.uk
Do we know what’s going on?
Vision …
http://www.optimumengineering.com.au/cyclone_silos/cyclone_silo_4.jpg
Market Research
Students
“Starting point for finding information is Wikipedia” “I’d like to use images from articles” “Colleagues tell me about papers so I don’t really
need to go looking for them”
Not very sophisticated outlook: have access to expensive services and content but don’t use.
Academic Faculty
“Too busy to spend much time online” “My students tell me about everything that I need
to know” “I use email alerts to find content”
Industrial Researchers
“Need to find information any way we can” “Don’t have access to expensive systems”
Most ingenious and sophisticated users.
Librarians
“Don’t change any URLs” “Users are confused when they don’t have access
to all the content” “When things go wrong, we have to sort it out” “I need my customers to know that the library has
paid for this content”
Key Findings
Key Findings
So what have we built?
Simple interface to fulfill the main use – come in, find a paper
RSC Publishing platform
All our journal, books, databases XML, in Marklogic. Normalised. A single query interface.
The article of the future
PDFs
Dumbing down can be good
Lowest common denominator PDF Word & ChemDraw JPG
Enables participation, exchange & comparison familiarity for the user
Authors?
number
tech ability/interest of authors & readers
Rzepa,H.
• Phil Bourne
• Lynn Fink
Source code and binary:http://research.microsoft.com/ontology/
Relationships: Ontology browser
Relationships: Ontology browser
Intent: Term recognition & disambiguation based on OBO or OWL formats
Intent: Term recognition & disambiguation based on OBO or OWL formats
John WilbanksServices: Ontology download web service
Ontology Add-in for Word 2007
<?xml version="1.0" ?><cml version="3" convention="org-synth-report" xmlns="http://www.xml-cml.org/schema"> <molecule id="m1"> <atomArray> <atom id="a1" elementType="C" x2="-2.9149999618530273" y2="0.7699999809265137" /> <atom id="a2" elementType="C" x2="-1.5813208400249916" y2="1.5399999809265137" /> <atom id="a3" elementType="O" x2="-0.24764171819695613" y2="0.7699999809265134" /> <atom id="a4" elementType="O" x2="-1.5813208400249912" y2="3.0799999809265137" /> <atom id="a5" elementType="H" x2="-4.248679083681063" y2="1.5399999809265137" /> <atom id="a6" elementType="H" x2="-2.914999961853028" y2="-0.7700000190734864" /> <atom id="a7" elementType="H" x2="-4.248679083681063" y2="-1.907348645691087E-8" /> <atom id="a8" elementType="H" x2="1.0860374036310796" y2="1.5399999809265132" /> </atomArray> <bondArray> <bond atomRefs2="a1 a2" order="1" /> <bond atomRefs2="a2 a3" order="1" /> <bond atomRefs2="a2 a4" order="2" /> <bond atomRefs2="a1 a5" order="1" /> <bond atomRefs2="a1 a6" order="1" /> <bond atomRefs2="a1 a7" order="1" /> <bond atomRefs2="a3 a8" order="1" /> </bondArray> </molecule></cml>
<?xml version="1.0" ?><cml version="3" convention="org-synth-report" xmlns="http://www.xml-cml.org/schema"> <molecule id="m1"> <atomArray> <atom id="a1" elementType="C" x2="-2.9149999618530273" y2="0.7699999809265137" /> <atom id="a2" elementType="C" x2="-1.5813208400249916" y2="1.5399999809265137" /> <atom id="a3" elementType="O" x2="-0.24764171819695613" y2="0.7699999809265134" /> <atom id="a4" elementType="O" x2="-1.5813208400249912" y2="3.0799999809265137" /> <atom id="a5" elementType="H" x2="-4.248679083681063" y2="1.5399999809265137" /> <atom id="a6" elementType="H" x2="-2.914999961853028" y2="-0.7700000190734864" /> <atom id="a7" elementType="H" x2="-4.248679083681063" y2="-1.907348645691087E-8" /> <atom id="a8" elementType="H" x2="1.0860374036310796" y2="1.5399999809265132" /> </atomArray> <bondArray> <bond atomRefs2="a1 a2" order="1" /> <bond atomRefs2="a2 a3" order="1" /> <bond atomRefs2="a2 a4" order="2" /> <bond atomRefs2="a1 a5" order="1" /> <bond atomRefs2="a1 a6" order="1" /> <bond atomRefs2="a1 a7" order="1" /> <bond atomRefs2="a3 a8" order="1" /> </bondArray> </molecule></cml>
Relationships: Navigate and link referenced chemistry
Relationships: Navigate and link referenced chemistry
Available soon:http://research.microsoft.com/chem4word/
Data: Semantics stored in Chemistry Markup Language
Data: Semantics stored in Chemistry Markup Language
Intent: Recognizes chemical dictionary and ontology terms
Intent: Recognizes chemical dictionary and ontology terms
Author and edit 1D and 2D chemistry. Author and edit 1D and 2D chemistry.
Intelligence: Verifies validity of authored chemistry
Intelligence: Verifies validity of authored chemistry
Authoring: Chem4Word – Chemistry Drawing in Word
We forget our place
We assume our readers spend their lives in our platforms
www.visitingdc.com
media.obsessable.com
As few interfaces as possible
What do humans want?
What do computers want?
Web services
flickr.com/photos/microcosmos
What is ChemSpider?
+ +
Free to use
Quick history Launched 2007 with a vision Bedroom project for 2 years Acquired by RSC in 2009
Since acquisition Infrastructure and development resource Community and user support added Integration with RSC Publishing resources Engagement with RSC’s community of influence
ChemSpider as aggregator Public and non-public compound sets Publications Domain resources – DailyMed, Wikipedia Personal data sets SureChem patents Multimedia additions, blogs, Open Notebook Sci
Data normalised, errors rejected
ChemSpider as chemical search
Single search across 400 data sources Text, structure and substructure search tools
Simple and very advanced Validated names for external search expansion
Google Scholar, PubMed, RSC, Google patents Links to original sources Mobile version
ChemSpider as wikiAdds quality and quantity
Users can comment, others action Registered users can curate
names, add links, spectra, data and multimedia resources
Depositors can load data sets, curated on load
Curators and Master Curators approve additions and decide what is the correct structure
ChemSpider as resource Download and reuse structures Download and reuse search structure set 2D and 3D images Embed and link - never draw a structure again Predicted properties
Physical, Environmental, Biological Spectra, crystal structures ChemSpider SyntheticPages Spectral game
ChemSpider as compound hub Extensive web services
Utility conversion services MassSpec services Name + structure lookups Prediction services
Validated name:structure pairs provides public disambiguation service
InChI Resolver – links InChIKeys to compounds: MYPYJXKWCTUITO-LYRMYLQWSA-N
For the RSC (more selfishly) Critical mass for a chemistry portal
Support for semantic developments Proven user need to search for compounds Shop window for our other publications
Ability to test other business models (some of which are a little scary)
Engagement with new publishing methods
ChemSpider as chemistry portal
What will the RSC be doing?
Facilitating scientific communication
Publish articles – with the semantics
But what else can be part of this?
Partial publication syntheses, blogs, unpublished data
How does all this sit alongside traditional articles?
Micropublication?
Nanopublication?
Slide from Jan Velterop
Exploring standards and extraction of semantic assertions
Pistoia Alliance“An initiative to provide an open foundation of data
standards, ontologies and web-services to streamline the Pharmaceutical Drug Discovery workflow”
Semantic Enrichment of the Scientific Literature (SESL) Oct09-Oct10
Pistoia Alliance-funded EBI Elsevier, NPG, OUP, RSC
Do we know what the answer is?
Probably not, but RSC Publishing & ChemSpider allows us to cover pretty much everything. Serve existing behaviours Allow experimentation
Science – keying into the readers as well as the researchers is more important than the technology.