lifeomic / spark-vcf
Spark VCF data source implementation for Dataframes
☆14Updated 2 years ago
Alternatives and similar repositories for spark-vcf:
Users that are interested in spark-vcf are comparing it to the libraries listed below
- A genomics pipeline build on top of the GATK Queue framework. Main repository: https://github.com/NationalGenomicsInfrastructure/piper (m…☆21Updated 8 years ago
- Easily run WDL workflows on GCP☆13Updated 3 years ago
- Distributed execution of bioinformatics tools on Apache Spark. Apache 2 licensed.☆40Updated 3 weeks ago
- Online material and code base for the article Coordinates and Intervals in Graph Based Reference Genomes☆11Updated 7 years ago
- A library for manipulating bioinformatics sequencing formats in Apache Spark☆32Updated 3 weeks ago
- NGS duplicate marking☆19Updated 3 years ago
- ☆14Updated 2 years ago
- Library for indexing VCF files for random access searches by rsID☆17Updated last year
- Graphical assessment of structrial variants using 10x genomics data☆10Updated 8 years ago
- qtools has helper functions to submit jobs to compute clusters (PBS on TSCC, SGE on oolite) from within Python☆21Updated last year
- Streaming relation (overlap, distance, KNN) of (any number of) sorted genomic interval sets. #golang☆47Updated 4 years ago
- Utilities for analyzing mutations and neoepitopes in patient cohorts☆20Updated 6 years ago
- robust matching of small variant datasets using flexible scoring schemes☆10Updated 5 years ago
- ALPACA is a caller for genomic variants (single nucleotide and small indels) from next-generation sequencing data that uses a novel algeb…☆23Updated 4 months ago
- Exercises for training scientists to perform some RNA-seq analyses.☆11Updated 5 years ago
- Benchmarking toolkit for variant calling☆47Updated 4 years ago
- GenoTypes Compressor☆15Updated 2 years ago
- Library for visualising genomic features in Python.☆15Updated 7 years ago
- stageR package☆11Updated 2 years ago
- python (cython) wrapper for https://github.com/ryanlayer/giggle for fast interval searching of huge datasets.☆16Updated 7 years ago
- Simple matching of HTS samples based on HLA typing☆13Updated 8 years ago
- ☆15Updated 7 years ago
- Import a CWL workflow specification to Nextflow script (experimental)☆27Updated 6 years ago
- Meta-Storms 2 is the standalone implementation of the Microbiome Search Engine (MSE; http://mse.ac.cn). This is the official software re…☆10Updated 5 years ago
- Import and run CWL workflows on DNAnexus (alpha)☆13Updated 6 years ago
- Building the constrained coding regions (CCR) model☆16Updated 6 years ago
- python script to programmatically enrich your data using Enrichr API☆12Updated 7 years ago
- Analysis Framework for Biological Data from High Throughput Experiments☆34Updated 8 years ago
- The OpEx (Optimised Exome) pipeline☆9Updated 6 years ago
- Boiler: a software tool for highly efficient, lossy compression of RNA-seq alignments☆13Updated 8 years ago