aehrc / VariantSpark
machine learning for genomic variants
☆143Updated 3 months ago
Alternatives and similar repositories for VariantSpark:
Users that are interested in VariantSpark are comparing it to the libraries listed below
- Distributed execution of bioinformatics tools on Apache Spark. Apache 2 licensed.☆39Updated 2 months ago
- A scalable genome browser. Apache 2 licensed.☆125Updated 2 years ago
- Hadoop-BAM is a Java library for the manipulation of files in common bioinformatics formats using the Hadoop MapReduce framework☆70Updated 2 years ago
- High performance data storage for importing, querying and transforming variants.☆97Updated 3 weeks ago
- Reference implementation of the APIs defined in ga4gh-schemas. RETIRED 2018-01-24☆96Updated 6 years ago
- Extensible specification for representing and uniquely identifying biological sequence variation☆82Updated last week
- MuTect -- Accurate and sensitive cancer mutation detection☆94Updated last year
- GenomicsDB☆111Updated 2 years ago
- GA4GH Variation Representation Python Implementation☆52Updated 2 weeks ago
- An opinionated Cromwell orchestration manager.☆40Updated last year
- Tibanna helps you run your genomic pipelines on Amazon cloud (AWS). It is used by the 4DN DCIC (4D Nucleome Data Coordination and Integr…☆70Updated last month
- This project is deprecated, please see strelka2 at https://github.com/Illumina/strelka☆38Updated 7 years ago
- This repo provides tools to convert ClinVar data into a tab-delimited flat file, and also provides that resulting tab-delimited flat file…☆122Updated 4 years ago
- The Pharmacogenomic Clinical Annotation Tool☆124Updated last month
- Workflow Description Language compiler for the DNAnexus platform☆40Updated last year
- Annotation of VCF variants with functional impact and from databases (executable+library)☆57Updated this week
- Efficient variant-call data storage and retrieval library using the TileDB storage library.☆91Updated last week
- An Open Computational Genomics Analysis platform for big data genomics analysis. OpenCGA is maintained and develop by its parent company …☆167Updated this week
- Associations of genomic features, drugs and diseases☆48Updated 2 years ago
- Scalable gVCF merging and joint variant calling for population sequencing projects☆154Updated 9 months ago
- The Platinum Genomes Truthset☆85Updated 7 years ago
- Do not use - please refer to our newest code: https://github.com/cgat-developers/cgat-apps☆124Updated 6 years ago
- Workflows used for WGS data processing -- replaced by https://github.com/gatk-workflows/gatk4-genome-processing-pipeline☆57Updated 4 years ago
- Obsolete/Legacy GATK repository -- go to https://github.com/broadinstitute/gatk instead☆33Updated 7 years ago
- An open-source toolkit for large-scale genomic analysis☆274Updated 3 weeks ago
- ☆82Updated 6 years ago
- An option to spin cost effective EMR clusters in AWS with Hail and JupyterNotebook installed☆16Updated 4 years ago
- A library for manipulating bioinformatics sequencing formats in Apache Spark☆31Updated 9 months ago
- Educational materials for learning WDL☆126Updated 10 months ago
- Various algorithms for analysing genomics data☆203Updated this week