dnbaker / bioseq
Tokenizers and Machine Learning Models for biological sequence data
☆24Updated last month
Related projects ⓘ
Alternatives and complementary repositories for bioseq
- Bidirectional WFA (Paper)☆41Updated 6 months ago
- A lightweight platform-accelerated library for biological motif scanning using position weight matrices.☆41Updated this week
- GRAph-based Finding of Individual Motif Occurrences☆28Updated 2 months ago
- Cython bindings and Python interface to MUSCLE v5, a highly efficient and accurate multiple sequence alignment software.☆18Updated 6 months ago
- Intel lab's open sourced data science framework for accelerating digital biology☆37Updated this week
- NEAT (NExt-generation Analysis Toolkit) simulates next-gen sequencing reads and can learn simulation parameters from real data.☆50Updated last week
- toolkit for file system virtualisation of random access compressed FASTA, FAI, DICT & TWOBIT files☆22Updated 3 months ago
- A genome browser in your Jupyter notebook☆26Updated 7 months ago
- Learning to untangle genome assembly with graph neural networks.☆71Updated this week
- Plot multiple sequence alignment (MSA)☆12Updated last month
- A Generative Pre-Trained Transformer Package for Pangenomes☆47Updated 6 months ago
- Cython bindings and Python interface to trimAl, a tool for automated alignment trimming. Now with SIMD!☆20Updated 2 months ago
- A tool for simulating random mutations in any genome☆36Updated 9 months ago
- A method for measuring allele-specific TL and characterizing telomere variant repeat (TVR) sequences from long reads.☆12Updated last week
- Small variant, structural variant, and short tandem repeat phasing tool for PacBio HiFi reads☆70Updated last month
- Deep learning embedding for nucleotide sequences☆16Updated 7 months ago
- ☆9Updated 5 months ago
- ✂️ Deep learning-based splice site predictor that improves spliced alignments☆36Updated 3 weeks ago
- A bit-packed k-mer representation (and relevant utilities) for rust☆47Updated 4 months ago
- The PanGenome Graph Builder☆14Updated 4 months ago
- A tool for summarizing, extracting, generating and modifying DNA sequences.☆23Updated 3 weeks ago
- Polygraph evaluates and compares groups of nucleic acid sequences based on their sequence and functional content for effective design of …☆27Updated this week
- Reference-guided multiple sequence alignment of viral genomes☆61Updated 2 months ago
- SPROUT is a machine learning tool to predict the DNA repair outcome in CRISPR experiments.☆15Updated 3 years ago
- Fast, sensitive and accurate protein remote homology search on GPUs☆15Updated 6 months ago
- Clair3-Trio: variant calling in trio using Nanopore long-reads☆14Updated 7 months ago
- RNA modification detection using Nanopore raw reads with Deep One Class classification☆19Updated 3 years ago
- A python package for showing JBrowse views☆23Updated 10 months ago
- ☆16Updated 8 months ago
- Dynamic, adaptive sampling during nanopore sequencing☆27Updated last month