1 / 53

Antony Williams, Valery Tkachenko and Richard Kidd ACS Dallas March 2014

Antony Williams, Valery Tkachenko and Richard Kidd ACS Dallas March 2014. UK National Chemical Database Service: An integration of commercial and public chemistry services to support chemists in the United Kingdom. UK Chemical Database Service.

phyre
Download Presentation

Antony Williams, Valery Tkachenko and Richard Kidd ACS Dallas March 2014

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Antony Williams, Valery Tkachenko and Richard Kidd ACS Dallas March 2014 UK National Chemical Database Service: An integration of commercial and public chemistry services to support chemists in the United Kingdom

  2. UK Chemical Database Service The National Chemical Database Service is for UK academics – see later for Rest of World

  3. Vision for the Service PART 1 Provide access to databases and services of interest to the academic community to serve their needs. Access to services to include: Crystallography data – Organic and inorganic materials Thermophysical data Reactions Data including retrosynthetic analysis Prediction technologies – name generation, physicochemical parameters, NMR prediction

  4. Service Rollout Many services are hosted in the cloud Access through login/password, IP authentication or Shibboleth authentication Lots of hard work in a very short time – so much thanks to all of the service providers More providers stepped up to help –ChemAxon Crystallography concern (understatement!)

  5. Feedback from Community Converted initial public negativity spike on Twitter pre-release to very positive feedback post-release Training required – onsite training sessions organized Available Chemicals Directory is big plus! Concerns with Retrosynthetic Analysis tool

  6. Usage Majority of usage is for crystallography data – previous provider had same bias Usage is increasing month-by-month Still way-under used and in many cases low awareness

  7. Vision for the Service PART 2 Response to the call for proposals included our vision for a 21st Century data repository At a time of Open Access, Open Data and funding agency requirement to make data public – build a data repository Funding is split for licensing content and services (VAST MAJORITY) and some funding for research and development

  8. An Initial “Vague” Vision Set Manage “all” of the chemistry data associated with chemical substances Data to be downloadable, reusable, interactive Build a platform that enables the scientist Data storage, validation, standardization and curation Collaborative data sharing Provide data platform that can enable and enhance publishing of scientific papers

  9. Data Repository Registration of chemical compounds Deposition of chemical syntheses Addition of analytical data Integration to electronic notebooks Rewards and recognition for data sharing Document processing Hosting of data as private, embargoed or public

  10. What we will deliver for all data Simple interfaces for uploading of data Embeddable widgets and programming interfaces to utilize in in-house systems, ELNs Automated harvesting approaches – sweeping directories for data Data validation where possible

  11. Input data pipeline

  12. Compounds upload Draw chemicals in the interface (Javascript editors – PC, Mac, Tablets, Phones) Drag and drop of compounds Automated generate of properties – Formulae, Mw, Mi, physchem properties Metadata input forms Bulk upload

  13. Depositions Gateway User Interface

  14. Depositions Gateway User Interface

  15. Chemical Validation and Standardization

  16. Reactions Hosting of reaction data – standard “document formats” – full flexibility but limiting – extraction of data from embedded objects Encourage template formats – using ELNs for example, community agreed templates

  17. Electronic Notebook Data Development work integrating chemistry into the Southampton Labtrove notebook Stoichiometry table development Analytical data integration “ChemTrove” rolled out to a small test group in January

  18. Micropublishing Syntheses

  19. ChemSpider SyntheticPages

  20. Requirements Community agreement on acceptable templates for CSSP/Reactions deposition Data Model deposition based on mappings between template and CSSP model Adoption of Labtrove interface for deposition

  21. What we will deliver Micropublishing platform for submission of Protocols and Procedures Reactions Safety and Hazard data (LATER) Template-based submissions of procedures Matched to ELN submissions Full details for user submission versus mapped submission into database

  22. Reaction Deposition/Validation

  23. Reaction Deposition/Validation

  24. Spectral Data Support for “structure identification” is a must – “greatest value” for reference and lookup Support for data standards primarily – JCAMP, mzML, SPC Want to support ASSIGNED data formats Hold binary files but prefer standards – WHY?

  25. Raw Spectral Data

  26. 10 years from now… Binary file formats generally need original data processing software to deal with them – from Bruker, Agilent, Jeol, Thermo, Waters, blah, blah, blah, blah,… While we can store the original raw data files for posterity should we? This has been one focus for data repositories

  27. This is way more useful

  28. Processed data… Spectral searching is made possible Spectral matching is possible

  29. This is what we really want…

  30. Addition of Analytical Data Spectral Container is in development using componentized widgets for display NIST spectra converted into standardized JCAMP format for deposition - 296,103 spectra deposited 10% of remaining NIST spectra need to be curated as there are obvious structure issues

  31. Javascript viewer NMR, MS, IR

  32. Depositions Gateway User Interface

  33. Document processing

  34. Depositions Gateway User Interface

  35. User Interface Approach

  36. User Interface Approach

  37. Present activities for ACS Fall Deposition process development of compounds, reactions and spectral data by end of Spring FTP, DropBox, Web-upload, ELN integration Compounds, Reactions, Spectral data search, display, download Data sharing – private, public, collaborative Metadata, metadata, metadata standards! Open Sourcing Chemical Registry System including CVSP

More Related