googlegenomics / dockerflow
Dockerflow is a workflow runner that uses Dataflow to run a series of tasks in Docker with the Pipelines API
☆97Updated 6 years ago
Related projects: ⓘ
- Google Cloud Dataflow pipelines such as Identity-By-State as well as useful utility classes.☆36Updated last year
- A scalable genome browser. Apache 2 licensed.☆124Updated last year
- Advanced BigQuery examples on genomic data.☆89Updated 7 years ago
- Deprecated☆100Updated 5 years ago
- workflow and resource management system for bioinformatics data analysis☆69Updated 3 years ago
- ☆18Updated this week
- This repository implements converters and tools for working with NGS data in HPC or Hadoop cluster☆17Updated 6 years ago
- MaRe leverages the power of Docker and Spark to run and scale your serial tools in MapReduce fashion.☆14Updated 2 years ago
- BigDataScript: Scirpting language for big data☆92Updated 3 years ago
- Example Cloud Datalab iPython Notebooks for genomics use cases.☆25Updated 7 years ago
- ☆24Updated this week
- Open source formats for scalable genomic processing systems using Avro. Apache 2 licensed.☆38Updated last month
- A Variant Caller, Distributed. Apache 2 licensed.☆71Updated 5 years ago
- Heterogeneity-incorporating Workflow ApplicationMaster for YARN☆26Updated 6 years ago
- Quality control methods for human genomic variants.☆62Updated 2 years ago
- Google Container Engine, JupyterHub, and Jupyter for classroom scenarios☆59Updated 6 years ago
- Documentation for the Google Genomics cookbook.☆142Updated 4 years ago
- Easily launch cloud applications.☆42Updated last year
- Examples of using CloudML with genomic data.☆18Updated 5 years ago
- TileDB☆80Updated last year
- Secure Cloud Object REsource: file transfer microservice☆18Updated last week
- Variant calling from sequence reads using cloud computing☆38Updated 10 years ago
- [Historical] Reproducible Analyses for Bioinformatics☆106Updated 5 years ago
- Examples of how to get started with genomics data in BigQuery in many languages.☆53Updated 7 years ago
- Package to extend Airflow functionality with CWL v1.0 support☆12Updated 5 years ago
- Ready-to-go Parquet-formatted public 'omics datasets☆30Updated 8 years ago
- An example of bioinformatics and bigdata tools can playing nicely together☆14Updated 8 years ago
- TheSparkBox is an all-in-one Spark deployment that you can use to fire up a local cluster.☆12Updated 6 years ago
- Integrate the GA4GH schemas and probably a scala impl of the service.☆14Updated 8 years ago
- Butler is a framework for running scientific workflows on public and academic clouds.☆69Updated 4 years ago