aehrc / VariantSpark
machine learning for genomic variants
☆142Updated 4 months ago
Alternatives and similar repositories for VariantSpark:
Users that are interested in VariantSpark are comparing it to the libraries listed below
- A scalable genome browser. Apache 2 licensed.☆125Updated 2 years ago
- Distributed execution of bioinformatics tools on Apache Spark. Apache 2 licensed.☆40Updated last week
- GenomicsDB☆111Updated 2 years ago
- High performance data storage for importing, querying and transforming variants.☆98Updated last week
- Hadoop-BAM is a Java library for the manipulation of files in common bioinformatics formats using the Hadoop MapReduce framework☆70Updated 2 years ago
- Efficient variant-call data storage and retrieval library using the TileDB storage library.☆92Updated this week
- CLI for interacting with Cromwell servers☆53Updated 10 months ago
- Reference implementation of the APIs defined in ga4gh-schemas. RETIRED 2018-01-24☆98Updated 7 years ago
- MuTect -- Accurate and sensitive cancer mutation detection☆95Updated 2 years ago
- Extensible specification for representing and uniquely identifying biological sequence variation☆86Updated this week
- GA4GH Variation Representation Python Implementation☆53Updated this week
- This repo provides tools to convert ClinVar data into a tab-delimited flat file, and also provides that resulting tab-delimited flat file…☆122Updated 5 years ago
- Repository for the GA4GH Benchmarking Team work developing standardized benchmarking methods for germline small variant calls☆198Updated 3 years ago
- Workflows for germline short variant discovery with GATK4☆134Updated 3 years ago
- Scalable gVCF merging and joint variant calling for population sequencing projects☆154Updated 10 months ago
- A library for manipulating bioinformatics sequencing formats in Apache Spark☆32Updated last week
- ☆82Updated 6 years ago
- Workflows used for WGS data processing -- replaced by https://github.com/gatk-workflows/gatk4-genome-processing-pipeline☆57Updated 5 years ago
- Browser for ExAC consortium data☆106Updated 3 years ago
- ☆174Updated last year
- Workflows for processing high-throughput sequencing data for variant discovery with GATK4 and related tools☆149Updated 2 years ago
- Tibanna helps you run your genomic pipelines on Amazon cloud (AWS). It is used by the 4DN DCIC (4D Nucleome Data Coordination and Integr…☆70Updated 3 months ago
- Various algorithms for analysing genomics data☆210Updated this week
- Scripts for working with Google Cloud Dataproc service☆37Updated 5 years ago
- An Open Computational Genomics Analysis platform for big data genomics analysis. OpenCGA is maintained and develop by its parent company …☆167Updated this week
- A tool set for short variant discovery in genetic sequence data.☆195Updated 3 years ago
- Tools for working with genomic and high throughput sequencing data.☆321Updated this week
- A collection of Python clients and accessory scripts for interacting with the Cromwell☆22Updated 2 years ago
- The Pharmacogenomic Clinical Annotation Tool☆128Updated this week
- TransVar - multiway annotator for precision medicine☆122Updated last year