victorskl / genomic-bigdata-spark
Genomic BigData Warehousing with Apache Spark and LakeHouse Architecture
☆11Updated 2 years ago
Alternatives and similar repositories for genomic-bigdata-spark:
Users that are interested in genomic-bigdata-spark are comparing it to the libraries listed below
- VCF Observer is a VCF file analysis, comparison, and visualization tool.☆16Updated 2 months ago
- Tool for finding matches to degenerate sequence motifs in FASTA files.☆12Updated 11 months ago
- Tokenizers and Machine Learning Models for biological sequence data☆25Updated 5 months ago
- A provenance library for bioinformatics workflows 🧬 🔀 📝☆14Updated 3 years ago
- Viral Identification and Discovery - A viral characterization pipeline built in Nextflow.☆11Updated 4 years ago
- Deep learning library for biological sequences. Extension of Fastai and Pytorch.☆40Updated 2 weeks ago
- Pipeline for the identification of (coding) gene structures in draft genomes.☆27Updated 10 months ago
- Namespace encoding hierarchical relationships between proteins, protein families, and protein complexes.☆12Updated 4 years ago
- NEAT (NExt-generation Analysis Toolkit) simulates next-gen sequencing reads and can learn simulation parameters from real data.☆51Updated last week
- A repository for the GenGraph toolkit for the creation and manipulation of graph genomes☆51Updated 3 years ago
- toolkit for file system virtualisation of random access compressed FASTA, FAI, DICT & TWOBIT files☆22Updated 6 months ago
- Forensic analysis tool useful in backwards computing information from next-generation sequencing data.☆11Updated last week
- Feature Annotation Location Description Ontology☆34Updated 5 years ago
- LLM-based gene function enrichment tool☆10Updated 2 weeks ago
- A very simple BLAST filtering pipeline☆18Updated 10 years ago
- Integrative visualization of multiple omic datasets onto KEGG pathways.☆11Updated 3 years ago
- DuckDB Extension for working with bioinformatic data.☆14Updated last year
- Cython bindings and Python interface to trimAl, a tool for automated alignment trimming. Now with SIMD!☆23Updated this week
- Listing of GPU based bioinformatics software & sites & publications☆10Updated 3 years ago
- Galaxy on AWS Guidance provides all the infrastructure components required to run Galaxy in the cloud and are preconfigured with industry…☆15Updated last month
- Linter rules for Nextflow DSL scripts☆30Updated 2 months ago
- Experimental plugin to integrate GPT like prompt into Nextflow☆15Updated 10 months ago
- evaluating vcf parsing libraries☆18Updated 3 years ago
- The PanGenome Graph Builder☆14Updated 7 months ago
- MOVIS: A Multi-Omics Software Solution for Multi-modal Time-Series Clustering, Embedding, and Visualizing Tasks, by Aleksandar Anžel, Dom…☆10Updated 2 years ago
- 🧬 MSABrowser: dynamic and fast visualization of sequence alignments, variations, and annotations☆32Updated 9 months ago
- ViraPipe is distributed Apache Spark based metagenome analytics pipeline for scalable detection of pathogens from NGS data☆16Updated 6 years ago
- Library for visualising genomic features in Python.☆15Updated 7 years ago
- Oxford Nanopore HDF/Fast5 to CRAM conversion tool☆22Updated 5 years ago