citiususc / SparkBWA
SparkBWA is a new tool that exploits the capabilities of a Big Data technology as Apache Spark to boost the performance of one of the most widely adopted sequence aligner, the Burrows-Wheeler Aligner (BWA).
☆69Updated 5 years ago
Alternatives and similar repositories for SparkBWA:
Users that are interested in SparkBWA are comparing it to the libraries listed below
- BigBWA is a new tool that uses the Big Data technology Hadoop to boost the performance of the Burrows–Wheeler aligner (BWA).☆31Updated 2 years ago
- ☆15Updated 7 years ago
- This project is deprecated, please see strelka2 at https://github.com/Illumina/strelka☆38Updated 7 years ago
- Hadoop-BAM is a Java library for the manipulation of files in common bioinformatics formats using the Hadoop MapReduce framework☆70Updated 2 years ago
- Obsolete/Legacy GATK repository -- go to https://github.com/broadinstitute/gatk instead☆33Updated 7 years ago
- Fast spliced aligner with low memory requirements☆41Updated 9 years ago
- GenomicsDB☆111Updated 2 years ago
- Java Bindings (JNI) for bwa☆19Updated 8 years ago
- Tools for early stage alignment file processing☆93Updated 5 years ago
- Efficient base quality score recalibrator for NGS data☆24Updated 9 years ago
- CRAM format specification and java API for read data.☆58Updated 6 years ago
- Official code repository for GATK versions 1.0 through 3.7 (full licensed package). For GATK 4 code, see the https://github.com/broadinst…☆143Updated 6 years ago
- Scalable RNA-seq analysis☆73Updated 4 years ago
- Fast and memory-efficient sequencing error corrector☆92Updated 9 months ago
- utilities for indexing and sequence extraction from FASTA files☆59Updated 3 years ago
- De novo assembly based variant calling pipeline for Illumina short reads☆108Updated 4 years ago
- Toil workflows for common genomic pipelines☆33Updated 5 years ago
- New url: https://github.com/biointec/halvade☆19Updated 7 years ago
- Workflows used for WGS data processing -- replaced by https://github.com/gatk-workflows/gatk4-genome-processing-pipeline☆57Updated 5 years ago
- To tackle the exponentially increasing throughput of Next-Generation Sequencing (NGS), most of the existing short-read aligners can be co…☆31Updated last year
- High performance data storage for importing, querying and transforming variants.☆98Updated this week
- Scalable gVCF merging and joint variant calling for population sequencing projects☆154Updated 10 months ago
- C++ Library to parse Illumina InterOp files☆75Updated 7 months ago
- A scalable genome browser. Apache 2 licensed.☆125Updated 2 years ago
- This repository contains information about latest release from Genome in a Bottle project☆73Updated 5 years ago
- Distributed execution of bioinformatics tools on Apache Spark. Apache 2 licensed.☆40Updated this week
- Assembly Based ReAligner☆72Updated 6 years ago
- VarSim: A high-fidelity simulation validation framework for high-throughput genome sequencing with cancer applications☆81Updated 4 months ago
- C++ htslib/bwa-mem/fermi interface for interrogating sequence data☆138Updated 6 months ago
- Rapid Mapping-based Isoform Quantification from RNA-Seq Reads☆126Updated 2 years ago