BD2KGenomics/conductor

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/BD2KGenomics/conductor)

BD2KGenomics / conductor

Efficient, distributed downloads of large files from S3 to HDFS using Spark.

☆17

Alternatives and similar repositories for conductor

Users that are interested in conductor are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

BD2KGenomics / s3am
View on GitHub
A fast, parallel, streaming multipart uploader for S3
☆14Mar 23, 2017Updated 9 years ago
Steven-N-Hart / VariantDB_Challenge
View on GitHub
Finding a scalable alternative to the VCF File for genomics analysis
☆14Jan 5, 2017Updated 9 years ago
med-at-scale / high-health
View on GitHub
Integrate the GA4GH schemas and probably a scala impl of the service.
☆14May 20, 2016Updated 10 years ago
hammerlab / spark-bam
View on GitHub
Load genomic BAM files using Apache Spark
☆21Jun 17, 2018Updated 8 years ago
bigdatagenomics / cannoli
View on GitHub
Distributed execution of bioinformatics tools on Apache Spark. Apache 2 licensed.
☆41Mar 17, 2026Updated 4 months ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
BD2KGenomics / cgl-docker-lib
View on GitHub
☆21Jul 3, 2019Updated 7 years ago
nfergu / popstrat
View on GitHub
Population Stratification Analysis on Genomics Data Using Deep Learning
☆25Sep 5, 2016Updated 9 years ago
shivaram / spark-ec2
View on GitHub
Scripts used to setup a Spark cluster on EC2
☆21Mar 24, 2016Updated 10 years ago
Consonance / consonance
View on GitHub
Core consonance utilities for scheduling, reporting on, and provisioning VMs for workflows
☆14Jun 27, 2018Updated 8 years ago
BD2KGenomics / toil-scripts
View on GitHub
Toil workflows for common genomic pipelines
☆33Oct 3, 2019Updated 6 years ago
statgenetics / seqspark
View on GitHub
SEQSpark documentation
☆18Nov 17, 2020Updated 5 years ago
bigdatagenomics / eggo
View on GitHub
Ready-to-go Parquet-formatted public 'omics datasets
☆30Nov 2, 2015Updated 10 years ago
allenday / spark-genome-alignment-demo
View on GitHub
An example of bioinformatics and bigdata tools can playing nicely together
☆14May 17, 2016Updated 10 years ago
vgteam / GetBlunted
View on GitHub
For bluntifying overlapped GFAs
☆13Jul 26, 2024Updated last year
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
BD2KGenomics / cgcloud
View on GitHub
Image and VM management for Jenkins, Spark and Mesos clusters in EC2
☆22Jul 24, 2020Updated 5 years ago
lightning-viz / lightning-scala
View on GitHub
Scala client for the Lightning data visualization server (WIP)
☆47Jun 25, 2019Updated 7 years ago
vgteam / toil-vg
View on GitHub
Distributed and cloud computing framework for vg
☆23Apr 21, 2026Updated 3 months ago
hammerlab / immuno
View on GitHub
Use somatic mutations to choose a personalized cancer vaccine (tumor-specific immunogenic peptides)
☆16Sep 23, 2016Updated 9 years ago
bigdatagenomics / bdg-formats
View on GitHub
Open source formats for scalable genomic processing systems using Avro. Apache 2 licensed.
☆42Feb 13, 2026Updated 5 months ago
chapmanb / homebrew-cbl
View on GitHub
Homebrew repository for CloudBioLinux: incubator for formulas to end up in homebrew-science
☆19Oct 17, 2016Updated 9 years ago
volkansevim / alpha-CENTAURI
View on GitHub
A python package from Pacific Biosciences to analyze centromeric sequences
☆21Oct 7, 2015Updated 10 years ago
hammerlab / magic-rdds
View on GitHub
Miscellaneous functionality for manipulating Apache Spark RDDs.
☆22Dec 29, 2018Updated 7 years ago
kelproject / kel-router
View on GitHub
HTTP/TLS reverse proxy in Go
☆12May 10, 2016Updated 10 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
Intel-HLS / GenomicsDB
View on GitHub
GenomicsDB
☆109Jan 3, 2023Updated 3 years ago
BRCAChallenge / brca-exchange
View on GitHub
Overall management and deployment of the BRCA Exchange web portal and pipeline scripts
☆27Updated this week
feliperazeek / spark-algebird-amazon-wordcloud
View on GitHub
Sample App. Amazon Product Descriptions Wordcloud. Spark Streaming, Algebird, Storehaus, Redis, Scala Scraper, OpenNLP, Play Framework, D…
☆12Nov 9, 2015Updated 10 years ago
Gaius-Augustus / clamsa
View on GitHub
ClaMSA (Classify Multiple Sequence Alignments).
☆13Nov 21, 2024Updated last year
xenserver / xha
View on GitHub
XenServer high availability daemon
☆13Mar 6, 2026Updated 4 months ago
felixcheung / vagrant-projects
View on GitHub
Vagrant projects for various use-cases with Spark, Zeppelin, IPython / Jupyter, SparkR
☆34May 13, 2016Updated 10 years ago
vgteam / xg-old
View on GitHub
succinct labeled graphs with collections and paths
☆15Nov 18, 2018Updated 7 years ago
yu-iskw / spark-dataframe-introduction
View on GitHub
This is an introduction of Apache Spark DataFrames.
☆41Mar 12, 2015Updated 11 years ago
gage-russell / pandas-lineage
View on GitHub
☆13Sep 19, 2022Updated 3 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
AtlasPilotPuppy / SparkAlgorithms
View on GitHub
Additional useful algorithms that can be used with spark.
☆24Dec 24, 2014Updated 11 years ago
CanonicalLtd / canonical-kubernetes-third-party-integrations
View on GitHub
Official repository for Canonical Kubernetes Third Party Integration Documentation
☆10Sep 23, 2018Updated 7 years ago
braingeneers / SIMS
View on GitHub
SIMS: Scalable, Interpretable Models for Cell Annotation of large scale single-cell RNA-seq data
☆12Apr 8, 2026Updated 3 months ago
ga4gh / tool-registry-service-schemas
View on GitHub
APIs for discovering genomics tools, their metadata and their containers
☆35Mar 28, 2026Updated 3 months ago
jasonbaldridge / twitter4j-tutorial
View on GitHub
A simple tutorial application for working with Twitter4j using Scala.
☆14Feb 26, 2013Updated 13 years ago
jpeelle / paperchecklist
View on GitHub
Checklist for scientific papers
☆21Sep 2, 2018Updated 7 years ago
mohae / feedlot
View on GitHub
Generate Packer Templates from JSON or TOML definitions.
☆15Mar 14, 2017Updated 9 years ago