aehrc / VariantSpark
machine learning for genomic variants
☆145Updated 7 months ago
Alternatives and similar repositories for VariantSpark
Users that are interested in VariantSpark are comparing it to the libraries listed below
Sorting:
- High performance data storage for importing, querying and transforming variants.☆98Updated last month
- A scalable genome browser. Apache 2 licensed.☆125Updated 2 years ago
- Reference implementation of the APIs defined in ga4gh-schemas. RETIRED 2018-01-24☆98Updated 7 years ago
- GA4GH Variation Representation Python Implementation☆56Updated 2 weeks ago
- Distributed execution of bioinformatics tools on Apache Spark. Apache 2 licensed.☆40Updated 2 months ago
- Hadoop-BAM is a Java library for the manipulation of files in common bioinformatics formats using the Hadoop MapReduce framework☆70Updated 2 years ago
- Extensible specification for representing and uniquely identifying biological sequence variation☆88Updated last month
- Workflow Description Language compiler for the DNAnexus platform☆40Updated last year
- GenomicsDB☆111Updated 2 years ago
- This repo provides tools to convert ClinVar data into a tab-delimited flat file, and also provides that resulting tab-delimited flat file…☆125Updated 5 years ago
- Workflows used for WGS data processing -- replaced by https://github.com/gatk-workflows/gatk4-genome-processing-pipeline☆57Updated 5 years ago
- Browser for ExAC consortium data☆106Updated 3 years ago
- GCP Variant Transforms☆139Updated 3 years ago
- High-Performance NoSQL database and RESTful web services to access to most relevant biological data. Found a bug or have an idea for a ne…☆92Updated this week
- Source code and related materials for the O'Reilly book☆95Updated 2 years ago
- Tibanna helps you run your genomic pipelines on Amazon cloud (AWS). It is used by the 4DN DCIC (4D Nucleome Data Coordination and Integr…☆69Updated last month
- Annotation of VCF variants with functional impact and from databases (executable+library)☆59Updated 2 weeks ago
- Scripts for working with Google Cloud Dataproc service☆37Updated 5 years ago
- Efficient variant-call data storage and retrieval library using the TileDB storage library.☆93Updated last week
- Obsolete/Legacy GATK repository -- go to https://github.com/broadinstitute/gatk instead☆33Updated 7 years ago
- Fast and memory-efficient sequencing error corrector☆93Updated last year
- Analysis examples based on the ISB-CGC hosted TCGA data, using Python and IPython Notebooks.☆54Updated 5 years ago
- TransVar - multiway annotator for precision medicine☆125Updated 2 years ago
- This project is deprecated, please see strelka2 at https://github.com/Illumina/strelka☆37Updated 8 years ago
- The Pharmacogenomic Clinical Annotation Tool☆134Updated 3 weeks ago
- ☆82Updated 6 years ago
- A library for manipulating bioinformatics sequencing formats in Apache Spark☆32Updated 2 months ago
- CLI for interacting with Cromwell servers☆53Updated last year
- De novo assembly based variant calling pipeline for Illumina short reads☆108Updated 4 years ago
- An open-source toolkit for large-scale genomic analysis☆278Updated 2 weeks ago