Load genomic BAM files using Apache Spark
☆21Jun 17, 2018Updated 7 years ago
Alternatives and similar repositories for spark-bam
Users that are interested in spark-bam are comparing it to the libraries listed below
Sorting:
- Efficient, distributed downloads of large files from S3 to HDFS using Spark.☆17Apr 26, 2017Updated 8 years ago
- ☆10Feb 28, 2018Updated 8 years ago
- Hadoop-BAM is a Java library for the manipulation of files in common bioinformatics formats using the Hadoop MapReduce framework☆72Dec 2, 2022Updated 3 years ago
- Haplotype-based somatic genome simulator☆10May 19, 2017Updated 8 years ago
- Online material and code base for the article Coordinates and Intervals in Graph Based Reference Genomes☆11May 2, 2017Updated 8 years ago
- Workshop content for http://dib-training.readthedocs.org/en/pub/2016-01-13-adv-beg-shell.html☆13Feb 23, 2018Updated 8 years ago
- Fast, Accurate, and Complete SSR Detection in Genomic Sequences☆11Jun 29, 2020Updated 5 years ago
- Gene lists related to cancer immunotherapy☆14Sep 11, 2024Updated last year
- Collection of simple C scripts for parsing vcf or bam files using the htslib C library. These scripts can be used as the starting point f…☆11Dec 11, 2020Updated 5 years ago
- Encore Analysis Server☆13Nov 18, 2025Updated 3 months ago
- List of conferences with talk videos posted online☆12Sep 23, 2023Updated 2 years ago
- Multi-sample genome coverage viewer to observe large, coverage-based anomalies alongside annotations and sample metadata☆58Feb 17, 2022Updated 4 years ago
- Spark VCF data source implementation for Dataframes☆15Jul 15, 2022Updated 3 years ago
- 180+ Java applications for analyzing next generation sequencing data from ChIPSeq, RNASeq, BisSeq, DNASeq, variant annotation/ filtering,…☆18Feb 19, 2026Updated 2 weeks ago
- robust matching of small variant datasets using flexible scoring schemes☆11Mar 26, 2020Updated 5 years ago
- Ready-to-go Parquet-formatted public 'omics datasets☆30Nov 2, 2015Updated 10 years ago
- Finding a scalable alternative to the VCF File for genomics analysis☆14Jan 5, 2017Updated 9 years ago
- Dependency and data pipeline management framework for Spark and Scala☆15Apr 8, 2017Updated 8 years ago
- Allele frequency filter app☆14May 4, 2022Updated 3 years ago
- Prototype of the Libling concept. Libling is a way to add source dependencies to your sbt project.☆13Aug 18, 2017Updated 8 years ago
- Convert sequence IDs between ucsc/refseq/genbank☆16Aug 28, 2018Updated 7 years ago
- Allele frequency filtering for Mendelian variant discovery☆18Sep 27, 2016Updated 9 years ago
- Benchmark pipeline for Structural Variation analyses, funded by the ALLBio.☆24Aug 1, 2014Updated 11 years ago
- Integrative pipeline for profiling DNA copy number and inferring tumor phylogeny☆20Jan 15, 2020Updated 6 years ago
- A mirror of https://bitbucket.org/weischen/pcawg-delly-workflow☆18Jan 22, 2020Updated 6 years ago
- Read CRAM v3 and v2 in node or in the browser☆18Feb 14, 2026Updated 2 weeks ago
- BAM CIGAR / MD transcoder for compact on-memory representation and quick drawing☆20Jul 19, 2023Updated 2 years ago
- A data processing platform for ChIP-seq, RNA-seq, MNase-seq, DNase-seq, ATAC-seq and GRO-seq datasets. Please ignore information on ciphe…☆19Dec 22, 2017Updated 8 years ago
- Library for indexing VCF files for random access searches by rsID☆17Updated this week
- Teaching modules for Human Genome Variation Lab.☆20Jun 6, 2025Updated 8 months ago
- H3ABioNet 16S rDNA diverstity analysis package☆19May 20, 2019Updated 6 years ago
- A simple macro-less logging typeclass with some common backends☆22Feb 25, 2026Updated last week
- GenomicsDB☆109Jan 3, 2023Updated 3 years ago
- Rapid and accurate ancestry inference using SNVs.☆28Aug 15, 2025Updated 6 months ago
- commandline manipulation of genomic variants and NGS reads☆19Sep 6, 2024Updated last year
- Miscellaneous functionality for manipulating Apache Spark RDDs.☆22Dec 29, 2018Updated 7 years ago
- Reference-free variant discovery in large eukaryotic genomes☆42Jul 13, 2021Updated 4 years ago
- A small repo for storing the code for making the files and html for CCRs.☆22Oct 22, 2019Updated 6 years ago
- Single sample network reconstruction in R☆24Apr 6, 2020Updated 5 years ago