victorskl / genomic-bigdata-spark
Genomic BigData Warehousing with Apache Spark and LakeHouse Architecture
☆11Updated last year
Related projects ⓘ
Alternatives and complementary repositories for genomic-bigdata-spark
- Namespace encoding hierarchical relationships between proteins, protein families, and protein complexes.☆12Updated 3 years ago
- Tool for finding matches to degenerate sequence motifs in FASTA files.☆12Updated 8 months ago
- Home of the Genomic Feature and Variation Ontology (GFVO)☆16Updated 3 years ago
- deploy a snakemake pipeline directly from version control (under development)☆21Updated 2 weeks ago
- PAgeRAnk-flux on Graphlet-guided network for multi-Omic data integratioN - Network Inference☆11Updated last week
- Very large scale k-mer counting and analysis on Apache Spark.☆18Updated 9 months ago
- Feature Annotation Location Description Ontology☆34Updated 4 years ago
- Forensic analysis tool useful in backwards computing information from next-generation sequencing data.☆12Updated this week
- Library for visualising genomic features in Python.☆15Updated 7 years ago
- A specification and Python implementation for representing variants from Multiplexed Assays of Variant Effect.☆11Updated 10 months ago
- Literature mining for T cell relations☆23Updated 2 years ago
- Useful scripts and tools related to alevin-fry☆9Updated 2 years ago
- Implementation of gene-level rare coding variant association tests targeting allelic series: cases where increasingly deleterious mutatio…☆12Updated this week
- Applied Statistics for High-Throughput Biology☆16Updated 3 months ago
- An option to spin cost effective EMR clusters in AWS with Hail and JupyterNotebook installed☆16Updated 4 years ago
- Standard for describing and searching biomedical data developed by the Global Alliance for Genomics & Health.☆24Updated 11 months ago
- BioThings API framework - Making high-performance API for biological annotation data☆46Updated this week
- HuBMAP Data Portal front end☆12Updated this week
- Target discovery platform for exploring rankings of genes, disease models, and other entities. @JKU-ICG @datavisyn☆12Updated 8 months ago
- Short course using RStudio for biological data analysis☆13Updated 2 years ago
- Job Manager API and UI for interacting with asynchronous batch jobs and workflows.☆26Updated 4 months ago
- Viral Identification and Discovery - A viral characterization pipeline built in Nextflow.☆11Updated 4 years ago
- Batch scripts curating BioRxiv and PubMed articles by using Altmetric score.☆11Updated 4 years ago
- Get a nicely-chunked local copy of the biomedical literature (to use for other projects)!☆13Updated 5 months ago
- jinja2-enabled jupyter notebooks☆35Updated 3 months ago
- Repository for development of the genomic module of the CDM.☆19Updated 5 years ago
- CPU and GPU deterministic and therefore fully reproducible machine learning pipelines using MLflow.☆46Updated last year
- 3D Genome Browser☆31Updated 2 years ago