DataBiosphere / toil
A scalable, efficient, cross-platform (Linux/macOS) and easy-to-use workflow engine in pure Python.
β894Updated this week
Related projects: β
- Repository for the CWL standards. Use https://cwl.discourse.group/ for support πβ1,447Updated last month
- Common Workflow Language reference implementationβ332Updated last week
- Workflow Description Language - Specification and Implementationsβ763Updated last month
- An open source platform for managing and analyzing biomedical big dataβ372Updated last week
- Scientific workflow engine designed for simplicity & scalability. Trivially transition between one off use cases to massive scale productβ¦β988Updated this week
- A lightweight parallel task engineβ145Updated 4 years ago
- Create clusters of VMs on the cloud and configure them with Ansible.β335Updated last year
- CGAT-ruffus is a lightweight python module for running computational pipelinesβ173Updated 3 years ago
- A light-weight wrapper library around Spotify's Luigi workflow library to make writing scientific workflows more fluent, flexible and modβ¦β332Updated last year
- Visual and code editor for Common Workflow Languageβ301Updated last year
- Workflow Description Language developer tools & local runnerβ172Updated last month
- Python package to extend Airflow functionality with CWL1.1 supportβ184Updated 10 months ago
- A language and runtime for distributed, incremental data processing in the cloudβ964Updated 11 months ago
- A DSL for data-driven computational pipelinesβ2,690Updated this week
- Python package for building, comparing, annotating, manipulating and visualising trees. It provides a comprehensive API and a collection β¦β779Updated this week
- Data Migration for the Blaze Projectβ1,000Updated 2 years ago
- Simple DAG-based job scheduler in Pythonβ755Updated 5 years ago
- A Variant Call Format reader for Python.β401Updated 11 months ago
- Bpipe - a tool for running and managing bioinformatics pipelinesβ226Updated 2 weeks ago
- Validated, scalable, community developed variant calling, RNA-seq and small RNA analysisβ985Updated 3 weeks ago
- Parallel programming with Pythonβ410Updated 2 months ago
- Pysam is a Python package for reading, manipulating, and writing genomics data such as SAM/BAM/CRAM and VCF/BCF files. It's a lightweightβ¦β774Updated 2 months ago
- Efficient pythonic random access to fasta subsequencesβ450Updated last month
- The Nested Containment List for Python. Basically a static interval-tree that is silly fast for both construction and lookups.β209Updated 2 weeks ago
- Python Scientific Pipeline Management Systemβ71Updated last year
- A cross-compatible CLI and Python API for accessing block and object storageβ35Updated 4 months ago
- StarCluster is an open source cluster-computing toolkit for Amazon's Elastic Compute Cloud (EC2).β582Updated 2 years ago
- ADAM is a genomics analysis platform with specialized file formats built using Apache Avro, Apache Spark, and Apache Parquet. Apache 2 liβ¦β996Updated 3 weeks ago
- A curated list of nextflow based pipelinesβ568Updated last year
- Docker Images tracking the stable Galaxy releases.β225Updated last year