rsc|chemspider the online chemistry database …...chemspider the rsc’s online chemical database a...

101
RSC|ChemSpider The Online Chemistry Database Where Community Contributions Count

Upload: others

Post on 01-Aug-2020

5 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: RSC|ChemSpider The Online Chemistry Database …...ChemSpider The RSC’s Online Chemical Database A central hub for chemists to source information >28 million unique chemical records

RSC|ChemSpider – The Online

Chemistry Database Where

Community Contributions Count

Page 2: RSC|ChemSpider The Online Chemistry Database …...ChemSpider The RSC’s Online Chemical Database A central hub for chemists to source information >28 million unique chemical records

ChemSpider

The RSC’s Online Chemical Database

A central hub for chemists to source information

>28 million unique chemical records

Aggregated from >400 data sources

Chemicals, spectra, CIF files, movies, images, podcasts, links to patents, publications, predictions

A central hub for chemists to deposit & curate data

Page 3: RSC|ChemSpider The Online Chemistry Database …...ChemSpider The RSC’s Online Chemical Database A central hub for chemists to source information >28 million unique chemical records

Answer Questions with ChemSpider

Questions a chemist might ask…

What is the melting point of n-heptanol?

What is the chemical structure of Xanax?

Chemically, what is phenolphthalein?

What are the stereocenters of cholesterol?

Where can I find publications about xylene?

What are the different trade names for Ketoconazole?

What is the NMR spectrum of Aspirin?

What are the safety handling issues for Thymol Blue?

Page 4: RSC|ChemSpider The Online Chemistry Database …...ChemSpider The RSC’s Online Chemical Database A central hub for chemists to source information >28 million unique chemical records

I want to know about “Vincristine”

Page 5: RSC|ChemSpider The Online Chemistry Database …...ChemSpider The RSC’s Online Chemical Database A central hub for chemists to source information >28 million unique chemical records

I want to know about “Vincristine”

If all algorithms work then

everything on the page is

correct by default except

the name!

Page 6: RSC|ChemSpider The Online Chemistry Database …...ChemSpider The RSC’s Online Chemical Database A central hub for chemists to source information >28 million unique chemical records

Vincristine: Identifiers and Properties

Page 7: RSC|ChemSpider The Online Chemistry Database …...ChemSpider The RSC’s Online Chemical Database A central hub for chemists to source information >28 million unique chemical records

Vincristine: Identifiers and Properties

Page 8: RSC|ChemSpider The Online Chemistry Database …...ChemSpider The RSC’s Online Chemical Database A central hub for chemists to source information >28 million unique chemical records

Vincristine: Vendors and Sources

Page 9: RSC|ChemSpider The Online Chemistry Database …...ChemSpider The RSC’s Online Chemical Database A central hub for chemists to source information >28 million unique chemical records

Vincristine: Patents

Page 10: RSC|ChemSpider The Online Chemistry Database …...ChemSpider The RSC’s Online Chemical Database A central hub for chemists to source information >28 million unique chemical records

Vincristine: Articles

Page 12: RSC|ChemSpider The Online Chemistry Database …...ChemSpider The RSC’s Online Chemical Database A central hub for chemists to source information >28 million unique chemical records

Spectra Linked

Page 13: RSC|ChemSpider The Online Chemistry Database …...ChemSpider The RSC’s Online Chemical Database A central hub for chemists to source information >28 million unique chemical records

Multiple Spectra for One Structure

Page 14: RSC|ChemSpider The Online Chemistry Database …...ChemSpider The RSC’s Online Chemical Database A central hub for chemists to source information >28 million unique chemical records

ChemSpider ID 24528095 H1 NMR

Page 15: RSC|ChemSpider The Online Chemistry Database …...ChemSpider The RSC’s Online Chemical Database A central hub for chemists to source information >28 million unique chemical records

ChemSpider ID 24528095 C13 NMR

Page 16: RSC|ChemSpider The Online Chemistry Database …...ChemSpider The RSC’s Online Chemical Database A central hub for chemists to source information >28 million unique chemical records

ChemSpider ID 24528095 HHCOSY

Page 17: RSC|ChemSpider The Online Chemistry Database …...ChemSpider The RSC’s Online Chemical Database A central hub for chemists to source information >28 million unique chemical records

ChemSpider ID 24528095 HSQC

Page 18: RSC|ChemSpider The Online Chemistry Database …...ChemSpider The RSC’s Online Chemical Database A central hub for chemists to source information >28 million unique chemical records

ChemSpider ID 24528095 HMBC

Page 19: RSC|ChemSpider The Online Chemistry Database …...ChemSpider The RSC’s Online Chemical Database A central hub for chemists to source information >28 million unique chemical records

About Structures

Page 20: RSC|ChemSpider The Online Chemistry Database …...ChemSpider The RSC’s Online Chemical Database A central hub for chemists to source information >28 million unique chemical records

The InChI Standard

Page 21: RSC|ChemSpider The Online Chemistry Database …...ChemSpider The RSC’s Online Chemical Database A central hub for chemists to source information >28 million unique chemical records

InChIKeys

Search the Web by Structure

Page 22: RSC|ChemSpider The Online Chemistry Database …...ChemSpider The RSC’s Online Chemical Database A central hub for chemists to source information >28 million unique chemical records

InChIs

Page 23: RSC|ChemSpider The Online Chemistry Database …...ChemSpider The RSC’s Online Chemical Database A central hub for chemists to source information >28 million unique chemical records

Searches: The INTERNET

All ChemSpider and Internet searches are “simply algorithms”

but synonym searching is based on an assertion

Page 24: RSC|ChemSpider The Online Chemistry Database …...ChemSpider The RSC’s Online Chemical Database A central hub for chemists to source information >28 million unique chemical records

Validated Names for Searching…

Page 25: RSC|ChemSpider The Online Chemistry Database …...ChemSpider The RSC’s Online Chemical Database A central hub for chemists to source information >28 million unique chemical records

Scientists are measured by…

Impact

Citations

Papers

Patents

Funding

and increasingly by “Alt-Metrics” – what you say, what you contribute, your data depositions, your code in repositories, your voice in the network, your activities on Facebook (be careful!)

Page 26: RSC|ChemSpider The Online Chemistry Database …...ChemSpider The RSC’s Online Chemical Database A central hub for chemists to source information >28 million unique chemical records

If it was not just about me…

Page 27: RSC|ChemSpider The Online Chemistry Database …...ChemSpider The RSC’s Online Chemical Database A central hub for chemists to source information >28 million unique chemical records

If it was not just about me…

We might have a community built encyclopedia

I might know where the best restaurants are

I might get good advice on books to read

I might know which movies to watch

I might know which plumber to call

Data might just be Open

Page 28: RSC|ChemSpider The Online Chemistry Database …...ChemSpider The RSC’s Online Chemical Database A central hub for chemists to source information >28 million unique chemical records

If it was not just about me…

We might have a community built encyclopedia

I might know where the best restaurants are

I might get good advice on books to read

I might know which movies to watch

I might know which plumber to call

Data might just be Open

Page 29: RSC|ChemSpider The Online Chemistry Database …...ChemSpider The RSC’s Online Chemical Database A central hub for chemists to source information >28 million unique chemical records

The Social Network

Career-wise NOT having a personal presence online will be a detriment

Self-marketing

Establishing a profile

Getting on the record

Collaborative Science

Demonstrating a skill set

Measured using alternative metrics

Contributing to the public peer review process

Page 30: RSC|ChemSpider The Online Chemistry Database …...ChemSpider The RSC’s Online Chemical Database A central hub for chemists to source information >28 million unique chemical records

Social Networking Tools

A growing number of social networking tools:

Facebook

Twitter

Linked-In

Flickr

YouTube

Blogs

Communities

Collaborative environments

Page 31: RSC|ChemSpider The Online Chemistry Database …...ChemSpider The RSC’s Online Chemical Database A central hub for chemists to source information >28 million unique chemical records

Chemistry Social Networking

Methods of sharing MY chemistry online include:

Wikis or blogs

Slideshare for presentations

YouTube for videos

Flickr, Wikimedia etc. for images

PubChem for assay data

NMRShiftDB for NMR assignments

GoogleDocs for data

Page 32: RSC|ChemSpider The Online Chemistry Database …...ChemSpider The RSC’s Online Chemical Database A central hub for chemists to source information >28 million unique chemical records

Your profile online…

Page 33: RSC|ChemSpider The Online Chemistry Database …...ChemSpider The RSC’s Online Chemical Database A central hub for chemists to source information >28 million unique chemical records

Establish a Mendeley Accounthttp://www.mendeley.com/profiles/antony-williams/

Page 34: RSC|ChemSpider The Online Chemistry Database …...ChemSpider The RSC’s Online Chemical Database A central hub for chemists to source information >28 million unique chemical records

ResearchGatehttp://www.researchgate.net/profile/Antony_Williams/

Page 35: RSC|ChemSpider The Online Chemistry Database …...ChemSpider The RSC’s Online Chemical Database A central hub for chemists to source information >28 million unique chemical records

Microsoft Academic Searchhttp://academic.research.microsoft.com/Author/12789419

Page 36: RSC|ChemSpider The Online Chemistry Database …...ChemSpider The RSC’s Online Chemical Database A central hub for chemists to source information >28 million unique chemical records

The Alt-Metrics Manifesto

http://altmetrics.org/manifesto/

Page 37: RSC|ChemSpider The Online Chemistry Database …...ChemSpider The RSC’s Online Chemical Database A central hub for chemists to source information >28 million unique chemical records

What is my ImpactStory?

Page 38: RSC|ChemSpider The Online Chemistry Database …...ChemSpider The RSC’s Online Chemical Database A central hub for chemists to source information >28 million unique chemical records

ImpactStory

Page 39: RSC|ChemSpider The Online Chemistry Database …...ChemSpider The RSC’s Online Chemical Database A central hub for chemists to source information >28 million unique chemical records

Enabled by ORCID…

Page 40: RSC|ChemSpider The Online Chemistry Database …...ChemSpider The RSC’s Online Chemical Database A central hub for chemists to source information >28 million unique chemical records

The Linked Network

Page 42: RSC|ChemSpider The Online Chemistry Database …...ChemSpider The RSC’s Online Chemical Database A central hub for chemists to source information >28 million unique chemical records

Times have changed Immediacy of social networks

Commenting on articles/data is here

The “participating scientist” has high profile

And who can be a scientist now???

The World of Contribution

Page 43: RSC|ChemSpider The Online Chemistry Database …...ChemSpider The RSC’s Online Chemical Database A central hub for chemists to source information >28 million unique chemical records

A Ten Year Old Scientist

Page 44: RSC|ChemSpider The Online Chemistry Database …...ChemSpider The RSC’s Online Chemical Database A central hub for chemists to source information >28 million unique chemical records
Page 45: RSC|ChemSpider The Online Chemistry Database …...ChemSpider The RSC’s Online Chemical Database A central hub for chemists to source information >28 million unique chemical records

Share Science!!! Not Just Yourself

If you have time, and the inclination, become a community contributor

Share your expertise in the new world of openness

Share your Open Source code

Share your data and your model

Share your Figures

Contribute to Wikis – Wikipedia and others

Become an Open Notebook Scientist

Page 46: RSC|ChemSpider The Online Chemistry Database …...ChemSpider The RSC’s Online Chemical Database A central hub for chemists to source information >28 million unique chemical records

Expose Data and Figures on FigShare

Page 47: RSC|ChemSpider The Online Chemistry Database …...ChemSpider The RSC’s Online Chemical Database A central hub for chemists to source information >28 million unique chemical records

Expose Data and Figures on FigShare

Page 48: RSC|ChemSpider The Online Chemistry Database …...ChemSpider The RSC’s Online Chemical Database A central hub for chemists to source information >28 million unique chemical records

ChemSpider SyntheticPages

Many syntheses are not published but are of value

A database of synthesis procedures built for the community, by the community.

Peer-reviewed by the community

Each contribution DOI’ed. Develop online scientific reputation at a time of “micro-publications”

Integrates semantic mark-up and visualization tools

Page 49: RSC|ChemSpider The Online Chemistry Database …...ChemSpider The RSC’s Online Chemical Database A central hub for chemists to source information >28 million unique chemical records

ChemSpider SyntheticPages

http://cssp.chemspider.com

Page 50: RSC|ChemSpider The Online Chemistry Database …...ChemSpider The RSC’s Online Chemical Database A central hub for chemists to source information >28 million unique chemical records
Page 51: RSC|ChemSpider The Online Chemistry Database …...ChemSpider The RSC’s Online Chemical Database A central hub for chemists to source information >28 million unique chemical records

ChemSpider SyntheticPages

Page 52: RSC|ChemSpider The Online Chemistry Database …...ChemSpider The RSC’s Online Chemical Database A central hub for chemists to source information >28 million unique chemical records

Submission process Register as a user

Use the Submit button and fill in the fields…

Page 53: RSC|ChemSpider The Online Chemistry Database …...ChemSpider The RSC’s Online Chemical Database A central hub for chemists to source information >28 million unique chemical records

Submission Process

Submissions reviewed by editorial board

Published as is or comments sent to author

Online Peer Review process – engage chemists in ongoing discussions and feedback loop

Data supported include web movies, images, live spectra etc.

Page 54: RSC|ChemSpider The Online Chemistry Database …...ChemSpider The RSC’s Online Chemical Database A central hub for chemists to source information >28 million unique chemical records

Recent Submissions

Page 55: RSC|ChemSpider The Online Chemistry Database …...ChemSpider The RSC’s Online Chemical Database A central hub for chemists to source information >28 million unique chemical records
Page 56: RSC|ChemSpider The Online Chemistry Database …...ChemSpider The RSC’s Online Chemical Database A central hub for chemists to source information >28 million unique chemical records
Page 57: RSC|ChemSpider The Online Chemistry Database …...ChemSpider The RSC’s Online Chemical Database A central hub for chemists to source information >28 million unique chemical records

Interactive Data

Page 58: RSC|ChemSpider The Online Chemistry Database …...ChemSpider The RSC’s Online Chemical Database A central hub for chemists to source information >28 million unique chemical records
Page 59: RSC|ChemSpider The Online Chemistry Database …...ChemSpider The RSC’s Online Chemical Database A central hub for chemists to source information >28 million unique chemical records

Most Accessed

Page 60: RSC|ChemSpider The Online Chemistry Database …...ChemSpider The RSC’s Online Chemical Database A central hub for chemists to source information >28 million unique chemical records

Is it working? Show of hands…

How many of you know ChemSpider?

How many of you know CSSP?

Have any of you submitted to CSSP?

Low submissions but some dedicated authors

Page 61: RSC|ChemSpider The Online Chemistry Database …...ChemSpider The RSC’s Online Chemical Database A central hub for chemists to source information >28 million unique chemical records

Popular Authors

Page 62: RSC|ChemSpider The Online Chemistry Database …...ChemSpider The RSC’s Online Chemical Database A central hub for chemists to source information >28 million unique chemical records

Is it working? Show of hands…

How many of you know CSSP?

Have any of you submitted to CSSP?

Low submissions but some dedicated authors

What reasons are there you would not publish?

Time

Approval from supervisor

Need to keep the science quiet

Publishing on CSSP prevents future publishing?

Page 63: RSC|ChemSpider The Online Chemistry Database …...ChemSpider The RSC’s Online Chemical Database A central hub for chemists to source information >28 million unique chemical records

Contributing to The Quality of Data

What is the Structure of Vitamin K?

Page 64: RSC|ChemSpider The Online Chemistry Database …...ChemSpider The RSC’s Online Chemical Database A central hub for chemists to source information >28 million unique chemical records

Contributing to The Quality of Data

What is the Structure of Vitamin K?

A lipid cofactor that is required for normal bloodclotting. Several forms of vitamin K have beenidentified: VITAMIN K1 (phytomenadione)derived from plants, VITAMIN K2(menaquinone) from bacteria & syntheticnaphthoquinone provitamins, VITAMIN K3(menadione).

Page 65: RSC|ChemSpider The Online Chemistry Database …...ChemSpider The RSC’s Online Chemical Database A central hub for chemists to source information >28 million unique chemical records

What is the Structure of Vitamin K1?

Page 66: RSC|ChemSpider The Online Chemistry Database …...ChemSpider The RSC’s Online Chemical Database A central hub for chemists to source information >28 million unique chemical records

CAS’s Common Chemistry

Page 67: RSC|ChemSpider The Online Chemistry Database …...ChemSpider The RSC’s Online Chemical Database A central hub for chemists to source information >28 million unique chemical records

Wikipedia

Page 68: RSC|ChemSpider The Online Chemistry Database …...ChemSpider The RSC’s Online Chemical Database A central hub for chemists to source information >28 million unique chemical records

Wolfram Alpha

Page 69: RSC|ChemSpider The Online Chemistry Database …...ChemSpider The RSC’s Online Chemical Database A central hub for chemists to source information >28 million unique chemical records

DailyMed

Page 70: RSC|ChemSpider The Online Chemistry Database …...ChemSpider The RSC’s Online Chemical Database A central hub for chemists to source information >28 million unique chemical records
Page 71: RSC|ChemSpider The Online Chemistry Database …...ChemSpider The RSC’s Online Chemical Database A central hub for chemists to source information >28 million unique chemical records

People Use Trusted Resources…

Page 72: RSC|ChemSpider The Online Chemistry Database …...ChemSpider The RSC’s Online Chemical Database A central hub for chemists to source information >28 million unique chemical records

Quality police…

Page 73: RSC|ChemSpider The Online Chemistry Database …...ChemSpider The RSC’s Online Chemical Database A central hub for chemists to source information >28 million unique chemical records

How will it improve?

Participation

and

contribution

Page 74: RSC|ChemSpider The Online Chemistry Database …...ChemSpider The RSC’s Online Chemical Database A central hub for chemists to source information >28 million unique chemical records

ALL Different, ALL “Domoic Acids”

Page 75: RSC|ChemSpider The Online Chemistry Database …...ChemSpider The RSC’s Online Chemical Database A central hub for chemists to source information >28 million unique chemical records

The EXPERTS must get it right?!

Page 76: RSC|ChemSpider The Online Chemistry Database …...ChemSpider The RSC’s Online Chemical Database A central hub for chemists to source information >28 million unique chemical records

Question Everything Online

Page 77: RSC|ChemSpider The Online Chemistry Database …...ChemSpider The RSC’s Online Chemical Database A central hub for chemists to source information >28 million unique chemical records

ANYBODY can annotate a record on ChemSpider

Registered users can deposit new data

Registered users can validate existing data

Deposition, Annotation and

Validation

Page 78: RSC|ChemSpider The Online Chemistry Database …...ChemSpider The RSC’s Online Chemical Database A central hub for chemists to source information >28 million unique chemical records

CURATION Search “Vitamin H”

Page 79: RSC|ChemSpider The Online Chemistry Database …...ChemSpider The RSC’s Online Chemical Database A central hub for chemists to source information >28 million unique chemical records

“Curate” Identifiers

Page 80: RSC|ChemSpider The Online Chemistry Database …...ChemSpider The RSC’s Online Chemical Database A central hub for chemists to source information >28 million unique chemical records

“Curate” Identifiers

Page 81: RSC|ChemSpider The Online Chemistry Database …...ChemSpider The RSC’s Online Chemical Database A central hub for chemists to source information >28 million unique chemical records

Spectra Linked

Page 83: RSC|ChemSpider The Online Chemistry Database …...ChemSpider The RSC’s Online Chemical Database A central hub for chemists to source information >28 million unique chemical records

Spectral Uploading

Various types of NMR spectra supported

Page 86: RSC|ChemSpider The Online Chemistry Database …...ChemSpider The RSC’s Online Chemical Database A central hub for chemists to source information >28 million unique chemical records

www.SpectralGame.comhttp://www.jcheminf.com/content/1/1/9

Page 87: RSC|ChemSpider The Online Chemistry Database …...ChemSpider The RSC’s Online Chemical Database A central hub for chemists to source information >28 million unique chemical records

Spectral Game

Page 88: RSC|ChemSpider The Online Chemistry Database …...ChemSpider The RSC’s Online Chemical Database A central hub for chemists to source information >28 million unique chemical records

Increasing Complexity

Page 89: RSC|ChemSpider The Online Chemistry Database …...ChemSpider The RSC’s Online Chemical Database A central hub for chemists to source information >28 million unique chemical records

SpectralGame in the hand

Page 90: RSC|ChemSpider The Online Chemistry Database …...ChemSpider The RSC’s Online Chemical Database A central hub for chemists to source information >28 million unique chemical records

Work in Progress – 300k Reactions

Page 91: RSC|ChemSpider The Online Chemistry Database …...ChemSpider The RSC’s Online Chemical Database A central hub for chemists to source information >28 million unique chemical records

Data Enabling the RSC Archive

An archive going back to 1841. Project underway to “data enable” the archive:

Extract chemistry – chemicals, reactions, experimental data points, complex data

Semantic enriching of the articles for interactive viewing and crowdsourced annotation/curation

Dramatically enables the type of queries possible across the archive

Page 92: RSC|ChemSpider The Online Chemistry Database …...ChemSpider The RSC’s Online Chemical Database A central hub for chemists to source information >28 million unique chemical records

A model for data segregation

Integrate to Institutional repositories

Access to Theses and Dissertations

Page 93: RSC|ChemSpider The Online Chemistry Database …...ChemSpider The RSC’s Online Chemical Database A central hub for chemists to source information >28 million unique chemical records

Model Building with Community Data

Community data can be the basis of model building

Consume data from available databases, RSC archive, new publications and build predictive algorithms for the community

Accept research data from the community and include into predictions

Page 94: RSC|ChemSpider The Online Chemistry Database …...ChemSpider The RSC’s Online Chemical Database A central hub for chemists to source information >28 million unique chemical records

Internet Data

An Open Data-Centric Chemistry Hub

Commercial Software

Pre-competitive Data

Open Science

Open Data

Publishers

Educators

Open Databases

Chemical Vendors

Small organic molecules

Undefined materials

Organometallics

Nanomaterials

Polymers

Minerals

Particle bound

Links to Biologicals

Page 95: RSC|ChemSpider The Online Chemistry Database …...ChemSpider The RSC’s Online Chemical Database A central hub for chemists to source information >28 million unique chemical records

Wikipediahttp://en.wikipedia.org/wiki/Antony_John_Williams

Page 96: RSC|ChemSpider The Online Chemistry Database …...ChemSpider The RSC’s Online Chemical Database A central hub for chemists to source information >28 million unique chemical records

An Interesting Readhttp://tinyurl.com/7e3l6rz

Page 97: RSC|ChemSpider The Online Chemistry Database …...ChemSpider The RSC’s Online Chemical Database A central hub for chemists to source information >28 million unique chemical records

ScientistsDBhttp://tinyurl.com/7cqylsp

Page 98: RSC|ChemSpider The Online Chemistry Database …...ChemSpider The RSC’s Online Chemical Database A central hub for chemists to source information >28 million unique chemical records

ScientistsDB Write your OWN article about yourself on

ScientistsDB

It is a community-policed site so any comments you write might be challenged/edited. It is “your” page but edited by all

An article, once approved by the community, can, in theory, be moved to Wikipedia

All content is licensed under standard CC-BY-SA 3.0 licensing provided by Wikipedia

Page 99: RSC|ChemSpider The Online Chemistry Database …...ChemSpider The RSC’s Online Chemical Database A central hub for chemists to source information >28 million unique chemical records

Acknowledgments

RSC|ChemSpider team

CSSP Editorial Team

All data source providers

Curators and annotators

Service providers:

ACD/Labs

OpenEye

GGA Software Services

Many others….

Page 100: RSC|ChemSpider The Online Chemistry Database …...ChemSpider The RSC’s Online Chemical Database A central hub for chemists to source information >28 million unique chemical records

Communicating Science

As scientists one of our primary roles is contribution

The internet enables contribution in different ways, benefitting the scientist and the community

Share your data and experience – it can enhance your public profile as a scientist, make you more discoverable and contribute data to the community

AltMetrics will be a measure of scientists…

Page 101: RSC|ChemSpider The Online Chemistry Database …...ChemSpider The RSC’s Online Chemical Database A central hub for chemists to source information >28 million unique chemical records

Thank you

Email: [email protected]

Twitter: ChemConnector

Personal Blog: www.chemconnector.com

SLIDES: www.slideshare.net/AntonyWilliams