Arvados: Reproducible CWL Workflows and Data Management at Scale

Presenter: Peter Amstutz

Session 1 (Americas-EMEA) Monday, February 28th, 15:30 UTC

Summary: Arvados is an open source platform for managing, processing, and sharing genomic and other large scientific and biomedical data. This talk will describe how Arvados can manage petabytes of data, run scalable workflows, identify the origin and verify the content of every dataset and track every CWL workflow you run, reliably reproducing any output. Additionally, this talk will discuss new Arvados features including LSF and Singularity support, cost reporting and support for CWL1.2.

