TheSparkBox is an all-in-one Spark deployment that you can use to fire up a local cluster.
☆12Jun 26, 2018Updated 7 years ago
Alternatives and similar repositories for TheSparkBox
Users that are interested in TheSparkBox are comparing it to the libraries listed below
Sorting:
- MaRe leverages the power of Docker and Spark to run and scale your serial tools in MapReduce fashion.☆14Apr 12, 2022Updated 3 years ago
- Conformal Prediction in Scala☆15Oct 13, 2020Updated 5 years ago
- Deploy Spark on OpenStack. Now!☆11Feb 16, 2018Updated 8 years ago
- The Lean AI Stack provides a complete End to End solution for taking AI into production.☆10Sep 8, 2020Updated 5 years ago
- ☆15Apr 2, 2025Updated 11 months ago
- Parallel Recipes : parallel workflow execution made easy☆13Sep 1, 2015Updated 10 years ago
- Viral Identification and Discovery - A viral characterization pipeline built in Nextflow.☆11May 19, 2020Updated 5 years ago
- Nextflow workflow for automatic repeat detection, classification and masking☆13Feb 19, 2018Updated 8 years ago
- ☆13Nov 20, 2015Updated 10 years ago
- Trigger the Google Genomics Pipeline API with CWL☆11Feb 7, 2017Updated 9 years ago
- Demonstrating the PRoot program☆11Jul 29, 2016Updated 9 years ago
- Luslab nextflow modules☆14Apr 30, 2021Updated 4 years ago
- Minimal docker image for bwa. Not developed any more.☆11Apr 26, 2015Updated 10 years ago
- ☆43Apr 20, 2016Updated 9 years ago
- CPSign - an open source modeling software made for QSAR, written in Java☆15Aug 17, 2024Updated last year
- A fast, easy way to present complex bioinformatics pipelines to biologists☆11Sep 28, 2018Updated 7 years ago
- Introduction to predictive modeling in Spark with applications in pharmaceutical bioinformatics☆39Feb 13, 2016Updated 10 years ago
- Library for indexing VCF files for random access searches by rsID☆17Feb 2, 2026Updated last month
- Container-based Slurm cluster with support for running on multiple ssh-accessible computers. Currently it is based on podman, systemd, no…☆25Dec 21, 2020Updated 5 years ago
- Documentation and wiki for the PhenoMeNal H2020 E-Infrastructure Project☆22Mar 28, 2025Updated 11 months ago
- Heterogeneity-incorporating Workflow ApplicationMaster for YARN☆26Oct 31, 2017Updated 8 years ago
- Jupyter kernel for the LAPPS Services DSL.☆24May 3, 2018Updated 7 years ago
- Streaming relation (overlap, distance, KNN) of (any number of) sorted genomic interval sets. #golang☆47Jul 12, 2020Updated 5 years ago
- An enterprise-ready and vendor-agnostic federated learning platform.☆166Feb 3, 2026Updated 3 weeks ago
- A collection of publications on comparison of high-throughput sequencing technologies.☆27Dec 3, 2025Updated 2 months ago
- A novel management, annotation, and machine learning framework for analyzing cancer mutations☆31Jul 4, 2024Updated last year
- ALPACA is a caller for genomic variants (single nucleotide and small indels) from next-generation sequencing data that uses a novel algeb…☆23Nov 19, 2024Updated last year
- UMCU Genetics Nextflow modules☆30Oct 25, 2024Updated last year
- ☆28Nov 23, 2020Updated 5 years ago
- Whole Exome/Whole Genome Sequencing alignment pipeline☆30Sep 18, 2024Updated last year
- Linter rules for Nextflow DSL scripts☆34Feb 16, 2026Updated 2 weeks ago
- Import a CWL workflow specification to Nextflow script (experimental)☆27Aug 9, 2018Updated 7 years ago
- A scala based DSL and framework for writing and executing bioinformatics pipelines as Directed Acyclic GRaphs☆69May 27, 2022Updated 3 years ago
- Scala framework for collecting performance metrics and conducting sound experimental benchmarking.☆13Nov 19, 2025Updated 3 months ago
- ☆29Jun 12, 2024Updated last year
- CheckQC inspects the content of an Illumina runfolder and determines if it passes a set of quality criteria☆29Nov 27, 2025Updated 3 months ago
- BigWig manpulation tools using libBigWig and htslib☆30Aug 8, 2024Updated last year
- Rapid and robust analysis of RNA-Seq experiments.☆32Apr 16, 2016Updated 9 years ago
- Query language for filtering SAM/BAM reads☆31Oct 15, 2024Updated last year