sparkblastproject / v2
SparkBLAST is a parallelization of a sequence alignment application (BLAST) that employs cloud computing for the provisioning of computational resources and Apache Spark as the coordination framework. Here you will find suplementary material to the article entitled "SparkBLAST: Scalable BLAST processing using in-memory operations", submitted to …
☆9Updated 7 years ago
Alternatives and similar repositories for v2:
Users that are interested in v2 are comparing it to the libraries listed below
- Single Cell Multiplexed Imaging Jupyter Voila Dashboard using CODEX Data☆16Updated last year
- Batch scripts curating BioRxiv and PubMed articles by using Altmetric score.☆11Updated 4 years ago
- MOVIS: A Multi-Omics Software Solution for Multi-modal Time-Series Clustering, Embedding, and Visualizing Tasks, by Aleksandar Anžel, Dom…☆10Updated 2 years ago
- Teaching data science☆8Updated 8 years ago
- Snakemake-like pipeline manager for reproducible Jupyter Notebooks☆17Updated 3 years ago
- Here I show how to use Deep Learning for biological and biomedical Data Integration.☆11Updated 4 years ago
- Namespace encoding hierarchical relationships between proteins, protein families, and protein complexes.☆12Updated 4 years ago
- The Baseline Site Selection Tool implements simulation tools for clinical trial enrollment.☆18Updated 2 years ago
- Parallel Genomic Analysis Toolkit☆14Updated 6 years ago
- Deep learning-based prediction of regulatory genome sequences☆11Updated 4 years ago
- Benchmark for LLM-based Agents in Computational Biology☆26Updated last week
- Peax is a tool for interactive visual pattern search and exploration in epigenomic data based on unsupervised representation learning wit…☆68Updated 2 years ago
- Data stories☆10Updated 6 years ago
- Proteins as words, genomes as documents.☆20Updated 4 years ago
- Tips and tricks for plotting in python☆28Updated 6 years ago
- Pandas ExtensionDtypes for dealing with genomics data☆47Updated 4 months ago
- Tools for munging genomic data☆21Updated 5 years ago
- Implementation of LSTM for detecting regions of Neanderthal introgression in modern human genomes☆9Updated 5 years ago
- Explore biomolecular pathways in Reactome from the command-line or a Python script☆22Updated 5 months ago
- Major Histocompatibility Complex (MHC) Binding Affinity Prediction☆10Updated 3 years ago
- Recipes for bioinformatics analyses with scikit-bio☆41Updated 8 years ago
- Platform for integrating genomic analysis with Jupyter Notebooks.☆44Updated 7 months ago
- Code for Russell et al. "A large-scale analysis of bioinformatics code on GitHub"☆31Updated 5 years ago
- Efficiently keep track of changes to genomes☆38Updated 8 months ago
- Examples using Clustergrammer2 to explore high-dimensional datasets.☆40Updated 4 years ago
- Ultrafast consensus module for raw de novo genome assembly of long uncorrected reads☆16Updated 4 years ago
- Search public databases for given genotypic information☆11Updated 7 years ago
- jinja2-enabled jupyter notebooks☆37Updated 7 months ago
- bioinformatics visualization tools with pyviz/bokeh☆20Updated last year
- Methods for the parallel and distributed analysis and mining of the Protein Data Bank using MMTF and Apache Spark.☆68Updated last year