dnbaker / bioseq
Tokenizers and Machine Learning Models for biological sequence data
☆22Updated last week
Related projects: ⓘ
- A lightweight platform-accelerated library for biological motif scanning using position weight matrices.☆39Updated 2 weeks ago
- GRAph-based Finding of Individual Motif Occurrences☆27Updated 3 weeks ago
- A Generative Pre-Trained Transformer Package for Pangenomes☆47Updated 4 months ago
- Intel lab's open sourced data science framework for accelerating digital biology☆36Updated 2 weeks ago
- Bidirectional WFA (Paper)☆40Updated 4 months ago
- ☆10Updated 3 months ago
- A framework for training graph neural networks to untangle assembly graphs obtained from OLC-based de novo genome assemblers.☆24Updated 3 weeks ago
- A method for measuring allele-specific TL and characterizing telomere variant repeat (TVR) sequences from long reads.☆12Updated last week
- Clair3-Trio: variant calling in trio using Nanopore long-reads☆14Updated 5 months ago
- toolkit for file system virtualisation of random access compressed FASTA, FAI, DICT & TWOBIT files☆22Updated last month
- Fast, sensitive and accurate protein remote homology search on GPUs☆15Updated 4 months ago
- Construct a Physical Map from Linked Reads☆18Updated 5 months ago
- Proof-of-concept implementation of GWFA for sequence-to-graph alignment☆56Updated 3 months ago
- Interpretable splicing model☆18Updated last year
- pathoscore evaluates variant pathogenicity tools and scores.☆21Updated 2 years ago
- This repository contains all the source files required to run DeLUCS, a deep learning clustering algorithm for DNA sequences.☆24Updated 2 years ago
- ClairS-TO - a deep-learning method for tumor-only somatic variant calling☆43Updated last week
- A tool for summarizing, extracting, generating and modifying DNA sequences.☆23Updated 2 months ago
- Universal RObust Peak Annotator☆15Updated 9 months ago
- A bit-packed k-mer representation (and relevant utilities) for rust☆47Updated 2 months ago
- A fast, AVX2 and ARM Neon accelerated FM index library☆28Updated last month
- Ultra rapid nanopore whole genome sequencing pipeline, published in https://www.nature.com/articles/s41587-022-01221-5☆18Updated 2 months ago
- Diverse Genomic Embedding Benchmark☆30Updated 2 weeks ago
- Learning to untangle genome assembly with graph neural networks.☆69Updated last year
- A python package for showing JBrowse views☆23Updated 8 months ago
- Toolkit for training hyenaDNA-based autoregressive language models on DNA sequences.☆24Updated 3 months ago
- ✂️ Deep learning-based splice site predictor that improves spliced alignments☆32Updated 4 months ago
- A python library for creating simulated regulatory DNA sequences☆38Updated last year
- Catalogue of pairwise alignment algorithms and benchmarks☆25Updated last month
- 🧬 MSABrowser: dynamic and fast visualization of sequence alignments, variations, and annotations☆30Updated 4 months ago