databricks / genomics-pipelines
secondary analysis pipelines parallelized with apache spark
☆15Updated 2 years ago
Related projects: ⓘ
- Genome-wide association studies identify genetic variations associated with a target disease or trait. Researchers and clinicians can use…☆11Updated 5 months ago
- DuckDB Extension for working with bioinformatic data.☆10Updated 11 months ago
- Parallel Genomic Analysis Toolkit☆14Updated 5 years ago
- Very large scale k-mer counting and analysis on Apache Spark.☆17Updated 7 months ago
- Open Targets evidence normalization and scoring pipeline☆12Updated last year
- ☆14Updated last year
- Distributed execution of bioinformatics tools on Apache Spark. Apache 2 licensed.☆38Updated 4 months ago
- An opinionated Cromwell orchestration manager.☆40Updated last year
- Issue tracker for Open Targets Platform and Open Targets Genetics Portal☆12Updated last week
- Use cloud technology to annotate human sequence variants in parallel.☆11Updated 3 years ago
- This guidance creates a scalable environment in AWS to prepare genomic, clinical, mutation, expression and imaging data for large-scale a…☆24Updated 5 months ago
- The Genomics Tertiary Analysis and Machine Learning Using Amazon SageMaker solution creates a scalable environment in AWS to develop mach…☆11Updated last year
- jinja2-enabled jupyter notebooks☆35Updated last month
- Genomic BigData Warehousing with Apache Spark and LakeHouse Architecture☆11Updated last year
- Spark VCF data source implementation for Dataframes☆14Updated 2 years ago
- Repository for development of the genomic module of the CDM.☆19Updated 5 years ago
- A library for manipulating bioinformatics sequencing formats in Apache Spark☆31Updated 5 months ago
- WebApp for DNA variants interpretation☆13Updated 2 weeks ago
- [in development] Proof-of-Concept variation translation, validation, and registration service☆12Updated last week
- A scala based DSL and framework for writing and executing bioinformatics pipelines as Directed Acyclic GRaphs☆69Updated 2 years ago
- Open source formats for scalable genomic processing systems using Avro. Apache 2 licensed.☆38Updated last month
- Pheno4j: a graph based HPO to NGS database☆32Updated last year
- ☆10Updated this week
- SQL support plugin for Nextflow☆25Updated 9 months ago
- Linter rules for Nextflow DSL scripts☆28Updated last week
- Map your disease and phenotype terms to the Open Targets platform ontology☆20Updated last month
- Exon is an OLAP query engine specifically for biology and life science applications.☆47Updated last week
- WDL tools for parsing, type-checking, and more☆23Updated last month
- SparkBLAST is a parallelization of a sequence alignment application (BLAST) that employs cloud computing for the provisioning of computat…☆9Updated 7 years ago
- WDL plugin for pytest☆48Updated last year