RO-Crate for workflow run provenance

Presenter: Simone Leo (CRS4)

Schedule:

  • Session 1: :earth_americas: :earth_africa: (Americas-EMEA) Monday, February 27th 09:00 - 13:00 US EST / 14:00 - 18:00 UTC

Provenance is information on the production process of a physical or digital object. In the case of workflow executions, we are interested in what outputs were produced, what where the values of the input parameters, which tools were run, how long the execution took, etc. The CWLProv standard provides guidelines to represent such information, but it suffers from interoperability, usability and flexibility problems, and it’s only implemented in cwltool. This talk introduces Workflow Run RO-Crate, a lightweight approach to workflow run provenance that’s machine actionable, flexible and interoperable across WMSs. A software tool is available to convert CWLProv RO bundles to RO-Crates conforming to the Workflow Run RO-Crate specifications. Support for the format is available in StreamFlow (CWL), Galaxy, COMPSs, WfExS and Sapporo, and is planned to be included in a future release of cwltool.

Slides: RO-Crate for workflow run provenance - Google Slides

Please leave your questions for the presenter below!

As an alternative to YouTube, this presentations is also available on ConfTube

2 Likes

Thanks for mentioning the StreamFlow implementation and runcreate. I will have more time this week to work on the Autosubmit crate & validation. I will look at the StreamFlow code and also try runcreate. Thanks!!