aehrc / VariantSpark
machine learning for genomic variants
☆145Updated 6 months ago
Alternatives and similar repositories for VariantSpark:
Users that are interested in VariantSpark are comparing it to the libraries listed below
- Distributed execution of bioinformatics tools on Apache Spark. Apache 2 licensed.☆40Updated last month
- A scalable genome browser. Apache 2 licensed.☆125Updated 2 years ago
- High performance data storage for importing, querying and transforming variants.☆98Updated last month
- An Open Computational Genomics Analysis platform for big data genomics analysis. OpenCGA is maintained and develop by its parent company …☆170Updated last week
- Scripts for working with Google Cloud Dataproc service☆37Updated 5 years ago
- Tibanna helps you run your genomic pipelines on Amazon cloud (AWS). It is used by the 4DN DCIC (4D Nucleome Data Coordination and Integr…☆69Updated 2 weeks ago
- GenomicsDB☆111Updated 2 years ago
- Do not use - please refer to our newest code: https://github.com/cgat-developers/cgat-apps☆124Updated 6 years ago
- Efficient variant-call data storage and retrieval library using the TileDB storage library.☆93Updated 2 weeks ago
- Scalable gVCF merging and joint variant calling for population sequencing projects☆157Updated last year
- Hadoop-BAM is a Java library for the manipulation of files in common bioinformatics formats using the Hadoop MapReduce framework☆70Updated 2 years ago
- Browser for ExAC consortium data☆106Updated 3 years ago
- ☆82Updated 6 years ago
- High-Performance NoSQL database and RESTful web services to access to most relevant biological data. Found a bug or have an idea for a ne…☆92Updated last week
- This repo provides tools to convert ClinVar data into a tab-delimited flat file, and also provides that resulting tab-delimited flat file…☆124Updated 5 years ago
- A library for manipulating bioinformatics sequencing formats in Apache Spark☆32Updated last month
- Workflows for germline short variant discovery with GATK4☆135Updated 3 years ago
- GA4GH Variation Representation Python Implementation☆55Updated last week
- Annotation of VCF variants with functional impact and from databases (executable+library)☆59Updated last week
- De novo assembly based variant calling pipeline for Illumina short reads☆108Updated 4 years ago
- Workflow Description Language compiler for the DNAnexus platform☆40Updated last year
- Workflows used for WGS data processing -- replaced by https://github.com/gatk-workflows/gatk4-genome-processing-pipeline☆57Updated 5 years ago
- In-progress projects at Harvard School of Public Health Bioinformatics Core☆41Updated 6 years ago
- Reference implementation of the APIs defined in ga4gh-schemas. RETIRED 2018-01-24☆98Updated 7 years ago
- CLI for interacting with Cromwell servers☆53Updated last year
- A tool set for short variant discovery in genetic sequence data.☆196Updated 3 years ago
- MuTect -- Accurate and sensitive cancer mutation detection☆96Updated 2 years ago
- This project is deprecated, please see strelka2 at https://github.com/Illumina/strelka☆37Updated 8 years ago
- The Pharmacogenomic Clinical Annotation Tool☆131Updated last week
- Source code and related materials for the O'Reilly book☆94Updated 2 years ago