lifeomic / spark-vcfLinks
Spark VCF data source implementation for Dataframes
☆14Updated 2 years ago
Alternatives and similar repositories for spark-vcf
Users that are interested in spark-vcf are comparing it to the libraries listed below
Sorting:
- A genomics pipeline build on top of the GATK Queue framework. Main repository: https://github.com/NationalGenomicsInfrastructure/piper (m…☆21Updated 8 years ago
- Easily run WDL workflows on GCP☆13Updated 3 years ago
- Distributed execution of bioinformatics tools on Apache Spark. Apache 2 licensed.☆41Updated 3 months ago
- robust matching of small variant datasets using flexible scoring schemes☆10Updated 5 years ago
- Graphical assessment of structrial variants using 10x genomics data☆10Updated 8 years ago
- Online material and code base for the article Coordinates and Intervals in Graph Based Reference Genomes☆11Updated 8 years ago
- A library for manipulating bioinformatics sequencing formats in Apache Spark☆32Updated 3 months ago
- NExt generation Analysis Toolbox☆14Updated 9 years ago
- qtools has helper functions to submit jobs to compute clusters (PBS on TSCC, SGE on oolite) from within Python☆21Updated last year
- GenoTypes Compressor☆15Updated 3 years ago
- Exercises for training scientists to perform some RNA-seq analyses.☆11Updated 5 years ago
- Basic, no assumptions, multi-pileup☆24Updated 11 years ago
- ☆13Updated 8 years ago
- A catalogue of docker images for NGS data analysis tools☆9Updated 5 years ago
- Streaming relation (overlap, distance, KNN) of (any number of) sorted genomic interval sets. #golang☆47Updated 4 years ago
- Analysis Framework for Biological Data from High Throughput Experiments☆34Updated 8 years ago
- Python client for GA4GH htsget protocol☆15Updated 2 years ago
- Import and run CWL workflows on DNAnexus (alpha)☆13Updated 6 years ago
- Predict the functional consequences of both coding and non-coding single nucleotide variants (SNVs)☆20Updated 4 years ago
- python script to programmatically enrich your data using Enrichr API☆12Updated 7 years ago
- Finding a scalable alternative to the VCF File for genomics analysis☆14Updated 8 years ago
- python (cython) wrapper for https://github.com/ryanlayer/giggle for fast interval searching of huge datasets.☆16Updated 7 years ago
- Library of snakemake rules.☆12Updated 6 years ago
- stageR package☆13Updated 2 years ago
- Distinguishing between generic and experiment-specific gene expression signals.☆12Updated 2 years ago
- Survey of bioinformatics field☆26Updated 13 years ago
- PySeqArray: data manipulation of whole-genome sequencing variants with SeqArray files in Python (pre-release version)☆14Updated 7 years ago
- Benchmarking toolkit for variant calling☆47Updated 4 years ago
- variant integration methods for the 1000 Genomes Project☆21Updated 7 years ago
- ☆15Updated 2 months ago