1 / 21

Traceability, reproducibility, and scalability in Integrated Ecosystem Assessments: July 2013

Traceability, reproducibility, and scalability in Integrated Ecosystem Assessments: July 2013. ECO-OP is supported by NSF Grant #0955649 PIs: Peter Fox (RPI) and Andrew Maffei (WHOI) NEFSC Collaborators: Jon Hare and Mike Fogarty Software programmer: Massimo Di Stefano

adina
Download Presentation

Traceability, reproducibility, and scalability in Integrated Ecosystem Assessments: July 2013

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Traceability, reproducibility, and scalability in Integrated Ecosystem Assessments: July 2013 ECO-OP is supported by NSF Grant #0955649 PIs: Peter Fox (RPI) and Andrew Maffei (WHOI) NEFSC Collaborators: Jon Hare and Mike Fogarty Software programmer: Massimo Di Stefano Informatics and metadata: Stace Beaulieu stace@whoi.edu

  2. Traceability, reproducibility, and scalability in Integrated Ecosystem Assessments:Adopting a provenance model for a collaborative report July 2013 ECO-OP is supported by NSF Grant #0955649 PIs: Peter Fox (RPI) and Andrew Maffei (WHOI) NEFSC Collaborators: Jon Hare and Mike Fogarty Software programmer: Massimo Di Stefano Informatics and metadata: Stace Beaulieu stace@whoi.edu

  3. Traceability, reproducibility, and scalability in Integrated Ecosystem Assessments:Adopting a provenance model for a collaborative report July 2013

  4. Traceability, reproducibility, and scalability in Integrated Ecosystem Assessments:Adopting a provenance model for a collaborative report July 2013 Metadata for data and workflow provenance (i.e., the marine ecosystem indicators and the collaborative report)

  5. Use Case: Northeast Shelf Large Marine Ecosystem Ecosystem Status Report Goal: “traceability, repeatability, explanation, verification, and validation” for ecosystem data and information products in the NEFSC Ecosystem Status Report (ESR)

  6. Page from 2009 ESR Section on Climate Forcing Figures available for download as PDF or image files – but without access to data or metadata

  7. Page from 2009 ESR Section on Climate Forcing Figures available for download as PDF or image files – but without access to data or metadata Note: NOAA directive for ISO 19115 metadata, but these are not sufficient to describe time-series indicators

  8. Software design to track provenance M. Di Stefano

  9. Software design to track provenance M. Di Stefano

  10. PROV Data Model http://www.w3.org/TR/prov-dm/ W3C Recommendation 30 April 2013 Core Structures (types and relations)

  11. PROV Data Model http://www.w3.org/TR/prov-dm/ W3C Recommendation 30 April 2013 Core Structures (types and relations) Entity may be a single data product, or a chapter containing several data products

  12. PROV Data Model http://www.w3.org/TR/prov-dm/ W3C Recommendation 30 April 2013 Core Structures (types and relations) Entity may be a single data product, or a chapter containing several data products PROV-O: The PROV Ontology (expresses PROV-DM using OWL2) http://www.w3.org/TR/prov-o/

  13. http://ipython.org/ Screenshot of IPython Notebook used to track both data and workflow provenance

  14. http://ipython.org/ Screenshot of IPython Notebook used to track both data and workflow provenance Code in Python, Matlab, R, other

  15. http://ipython.org/ Screenshot of IPython Notebook used to track both data and workflow provenance Code in Python, Matlab, R, other

  16. http://ipython.org/ Screenshot of IPython Notebook used to track both data and workflow provenance Notebook can be shared, or output as script, HTML, PDF, other

  17. PDF output of IPython Notebook with clickable links to data and code

  18. PDF output of IPython Notebook with clickable links to data and code

  19. Screenshot of csv file at GitHub

  20. Screenshot of csv file at GitHub Having access not only to the data that are plotted, but also to provenance metadata increases the (re-) usability of the data

More Related