lifeomic / spark-vcf
Spark VCF data source implementation for Dataframes
☆14Updated 2 years ago
Alternatives and similar repositories for spark-vcf:
Users that are interested in spark-vcf are comparing it to the libraries listed below
- A genomics pipeline build on top of the GATK Queue framework. Main repository: https://github.com/NationalGenomicsInfrastructure/piper (m…☆21Updated 8 years ago
- Distributed execution of bioinformatics tools on Apache Spark. Apache 2 licensed.☆39Updated 2 months ago
- Easily run WDL workflows on GCP☆13Updated 3 years ago
- ALPACA is a caller for genomic variants (single nucleotide and small indels) from next-generation sequencing data that uses a novel algeb…☆23Updated 2 months ago
- Exercises for training scientists to perform some RNA-seq analyses.☆11Updated 5 years ago
- python script to programmatically enrich your data using Enrichr API☆12Updated 7 years ago
- A library for manipulating bioinformatics sequencing formats in Apache Spark☆31Updated 9 months ago
- Python client for GA4GH htsget protocol☆15Updated 2 years ago
- python (cython) wrapper for https://github.com/ryanlayer/giggle for fast interval searching of huge datasets.☆16Updated 6 years ago
- qtools has helper functions to submit jobs to compute clusters (PBS on TSCC, SGE on oolite) from within Python☆21Updated last year
- Library for visualising genomic features in Python.☆15Updated 7 years ago
- Meta-Storms 2 is the standalone implementation of the Microbiome Search Engine (MSE; http://mse.ac.cn). This is the official software re…☆10Updated 4 years ago
- Boiler: a software tool for highly efficient, lossy compression of RNA-seq alignments☆13Updated 8 years ago
- Online material and code base for the article Coordinates and Intervals in Graph Based Reference Genomes☆11Updated 7 years ago
- Integrative visualization of multiple omic datasets onto KEGG pathways.☆11Updated 3 years ago
- stageR package☆11Updated last year
- provides common tools and lookup tables used primarily by the hgvs and uta packages☆22Updated 2 months ago
- ☆14Updated 2 years ago
- Workflow Description Language compiler for the DNAnexus platform☆40Updated last year
- Streaming relation (overlap, distance, KNN) of (any number of) sorted genomic interval sets. #golang☆47Updated 4 years ago
- Graphical assessment of structrial variants using 10x genomics data☆10Updated 7 years ago
- Homebrew formulae for bioinformatics software only available for Linux☆27Updated 5 years ago
- Library of snakemake rules.☆12Updated 6 years ago
- robust matching of small variant datasets using flexible scoring schemes☆10Updated 4 years ago
- Benchmarking toolkit for variant calling☆47Updated 4 years ago
- Ultrafast consensus module for raw de novo genome assembly of long uncorrected reads☆16Updated 4 years ago
- Collection of scripts for working with Wiggle files and analyzing sequencing data☆17Updated 5 years ago
- Distinguishing between generic and experiment-specific gene expression signals.☆12Updated last year