Published August 30, 2017 | Version v2
Working paper Open

Introducing Parsl: A Python Parallel Scripting Library

  • 1. University of Chicago
  • 2. Argonne National Laboratory
  • 3. University of Illinois Urbana-Champaign,

Description

Researchers frequently rely on large-scale and domain-specific workflows to conduct their science. These workflows may integrate a variety of independent software functions and external applications. However, developing and executing such workflows can be difficult, requiring complex orchestration and management of applications and data as well as customization for specific execution environments. Parsl (Parallel Scripting Library), a Python library for programming and executing data-oriented workflows in parallel, addresses these problems. Developers simply annotate a Python script with Parsl directives; Parsl manages the execution of the script on clusters, clouds, grids, and other resources. Parsl orchestrates required data movement and manages the execution of Python functions and external applications in parallel. In this abstract we describe Parsl’s architecture and highlight two domains in which it has been used.

Files

Parsl-Abstract.pdf

Files (300.8 kB)

Name Size Download all
md5:f41d97df0cbe365a4d3a4e4df9c618bf
300.8 kB Preview Download