bigdatagenomics / adam
ADAM is a genomics analysis platform with specialized file formats built using Apache Avro, Apache Spark, and Apache Parquet. Apache 2 licensed.
☆1,011Updated last month
Alternatives and similar repositories for adam:
Users that are interested in adam are comparing it to the libraries listed below
- An open-source toolkit for large-scale genomic analysis☆274Updated this week
- Cloud-native genomic dataframes and batch computing☆992Updated this week
- Official code repository for GATK versions 1.0 through 3.7 (core engine). For GATK 4 code, see the https://github.com/broadinstitute/gatk…☆295Updated 6 years ago
- A scalable genome browser. Apache 2 licensed.☆125Updated 2 years ago
- Scientific workflow engine designed for simplicity & scalability. Trivially transition between one off use cases to massive scale product…☆1,009Updated this week
- Spark-based variant calling, with experimental support for multi-sample somatic calling (including RNA) and local assembly☆84Updated 7 years ago
- Scalable Nucleotide Alignment Program -- a fast and accurate read aligner for high-throughput sequencing data☆288Updated 3 weeks ago
- An Open Computational Genomics Analysis platform for big data genomics analysis. OpenCGA is maintained and develop by its parent company …☆167Updated this week
- Hadoop-BAM is a Java library for the manipulation of files in common bioinformatics formats using the Hadoop MapReduce framework☆70Updated 2 years ago
- machine learning for genomic variants☆142Updated 3 months ago
- Documentation for the Google Genomics cookbook.☆143Updated 4 years ago
- a lightweight db framework for exploring genetic variation.☆317Updated 4 years ago
- Tools (written in C using htslib) for manipulating next-generation sequencing data☆1,670Updated this week
- Specifications of SAM/BAM and related high-throughput sequencing file formats☆667Updated this week
- C library for high-throughput sequencing data formats☆825Updated this week
- A Variant Caller, Distributed. Apache 2 licensed.☆71Updated 5 years ago
- Reference implementation of the APIs defined in ga4gh-schemas. RETIRED 2018-01-24☆96Updated 7 years ago
- Bioinformatics for the Scala programming language☆110Updated 11 months ago
- Integrative Genomics Viewer. Fast, efficient, scalable visualization tool for genomics data and annotations☆654Updated this week
- An open source platform for managing and analyzing biomedical big data☆399Updated this week
- GenomicsDB☆111Updated 2 years ago
- Bioinformatics containers☆700Updated 2 weeks ago
- A Java API for high-throughput sequencing data (HTS) formats.☆285Updated this week
- Validated, scalable, community developed variant calling, RNA-seq and small RNA analysis☆995Updated 5 months ago
- A toolkit to learn how to model and interpret regulatory sequence data using deep learning.☆259Updated last year
- bedtools - the swiss army knife for genome arithmetic☆952Updated 2 weeks ago
- SparkBWA is a new tool that exploits the capabilities of a Big Data technology as Apache Spark to boost the performance of one of the mos…☆69Updated 5 years ago
- Simplifying robust end-to-end machine learning on Apache Spark.☆470Updated 7 years ago
- Python for Bioinformatics☆249Updated 4 years ago
- Incubator for useful bioinformatics code, primarily in Python and R☆613Updated last year