lifeomic / spark-vcf
Spark VCF data source implementation for Dataframes
☆14Updated 2 years ago
Related projects ⓘ
Alternatives and complementary repositories for spark-vcf
- A genomics pipeline build on top of the GATK Queue framework. Main repository: https://github.com/NationalGenomicsInfrastructure/piper (m…☆21Updated 8 years ago
- Easily run WDL workflows on GCP☆13Updated 3 years ago
- A library for manipulating bioinformatics sequencing formats in Apache Spark☆31Updated 7 months ago
- Online material and code base for the article Coordinates and Intervals in Graph Based Reference Genomes☆11Updated 7 years ago
- Python client for GA4GH htsget protocol☆15Updated 2 years ago
- ALPACA is a caller for genomic variants (single nucleotide and small indels) from next-generation sequencing data that uses a novel algeb…☆23Updated 4 years ago
- PySeqArray: data manipulation of whole-genome sequencing variants with SeqArray files in Python (pre-release version)☆13Updated 7 years ago
- A better, faster way to count guides in CRISPR screens.☆27Updated last month
- ☆14Updated 2 years ago
- qtools has helper functions to submit jobs to compute clusters (PBS on TSCC, SGE on oolite) from within Python☆21Updated last year
- robust matching of small variant datasets using flexible scoring schemes☆10Updated 4 years ago
- Library for visualising genomic features in Python.☆15Updated 7 years ago
- Workflow Description Language compiler for the DNAnexus platform☆40Updated last year
- GenoTypes Compressor☆15Updated 2 years ago
- An opinionated Cromwell orchestration manager.☆40Updated last year
- GWAS and rare variants tests at high speed using regenie☆11Updated 5 months ago
- Open Targets Genetics UI☆14Updated last year
- python (cython) wrapper for https://github.com/ryanlayer/giggle for fast interval searching of huge datasets.☆16Updated 6 years ago
- Boiler: a software tool for highly efficient, lossy compression of RNA-seq alignments☆13Updated 8 years ago
- Library of snakemake rules.☆12Updated 6 years ago
- Aligner for sequencing data☆18Updated 6 years ago
- Docker images of bioinformatics software☆21Updated 7 years ago
- Benchmarking toolkit for variant calling☆47Updated 4 years ago
- Graphical assessment of structrial variants using 10x genomics data☆10Updated 7 years ago
- Exercises for training scientists to perform some RNA-seq analyses.☆11Updated 5 years ago
- Streaming relation (overlap, distance, KNN) of (any number of) sorted genomic interval sets. #golang☆47Updated 4 years ago
- ☆11Updated last year
- Library for indexing VCF files for random access searches by rsID☆17Updated last year