bigdatagenomics / adamLinks
ADAM is a genomics analysis platform with specialized file formats built using Apache Avro, Apache Spark, and Apache Parquet. Apache 2 licensed.
☆1,043Updated 6 months ago
Alternatives and similar repositories for adam
Users that are interested in adam are comparing it to the libraries listed below
Sorting:
- Cloud-native genomic dataframes and batch computing☆1,043Updated this week
- An open-source toolkit for large-scale genomic analysis☆293Updated this week
- Scientific workflow engine designed for simplicity & scalability. Trivially transition between one off use cases to massive scale product…☆1,045Updated this week
- Spark-based variant calling, with experimental support for multi-sample somatic calling (including RNA) and local assembly☆85Updated 7 years ago
- Official code repository for GATK versions 1.0 through 3.7 (core engine). For GATK 4 code, see the https://github.com/broadinstitute/gatk…☆299Updated 7 years ago
- A scalable genome browser. Apache 2 licensed.☆126Updated 3 years ago
- Hadoop-BAM is a Java library for the manipulation of files in common bioinformatics formats using the Hadoop MapReduce framework☆72Updated 3 years ago
- A Java API for high-throughput sequencing data (HTS) formats.☆293Updated 3 weeks ago
- Bioinformatics for the Scala programming language☆114Updated 4 months ago
- Official code repository for GATK versions 4 and up☆1,899Updated this week
- A Variant Caller, Distributed. Apache 2 licensed.☆71Updated 6 years ago
- Documentation for the Google Genomics cookbook.☆141Updated 5 years ago
- An Open Computational Genomics Analysis platform for big data genomics analysis. OpenCGA is maintained and develop by its parent company …☆176Updated this week
- Simplifying robust end-to-end machine learning on Apache Spark.☆474Updated 8 years ago
- GenomicsDB☆110Updated 3 years ago
- Specifications of SAM/BAM and related high-throughput sequencing file formats☆695Updated 2 months ago
- Mirror of Apache Toree (Incubating)☆749Updated last month
- Variant calling from sequence reads using cloud computing☆40Updated 11 years ago
- Scalable Nucleotide Alignment Program -- a fast and accurate read aligner for high-throughput sequencing data☆297Updated 4 months ago
- Ready-to-go Parquet-formatted public 'omics datasets☆30Updated 10 years ago
- Validated, scalable, community developed variant calling, RNA-seq and small RNA analysis☆1,023Updated last year
- Open source formats for scalable genomic processing systems using Avro. Apache 2 licensed.☆41Updated 10 months ago
- SparkBWA is a new tool that exploits the capabilities of a Big Data technology as Apache Spark to boost the performance of one of the mos…☆69Updated 6 years ago
- An open source platform for managing and analyzing biomedical big data☆415Updated this week
- Scripts used to setup a Spark cluster on EC2☆389Updated 8 years ago
- A set of command line tools (in Java) for manipulating high-throughput sequencing (HTS) data and formats such as SAM/BAM/CRAM and VCF.☆1,043Updated this week
- New url: https://github.com/biointec/halvade☆19Updated 8 years ago
- An efficient updatable key-value store for Apache Spark☆254Updated 8 years ago
- Interactive Scala REPL in a browser☆741Updated 3 years ago
- machine learning for genomic variants☆147Updated last month