dnbaker / bioseq
Tokenizers and Machine Learning Models for biological sequence data
☆25Updated 6 months ago
Alternatives and similar repositories for bioseq:
Users that are interested in bioseq are comparing it to the libraries listed below
- Fast, sensitive and accurate protein remote homology search on GPUs☆15Updated 10 months ago
- Plot multiple sequence alignment (MSA)☆14Updated 6 months ago
- Learning to untangle genome assembly with graph neural networks.☆72Updated 4 months ago
- ☆9Updated 9 months ago
- Cython bindings and Python interface to trimAl, a tool for automated alignment trimming. Now with SIMD!☆23Updated 3 weeks ago
- Intel lab's open sourced data science framework for accelerating digital biology☆44Updated last month
- Ledidi turns any machine learning model into a biological sequence editor, allowing you to design sequences with desired properties.☆71Updated 2 months ago
- A lightweight platform-accelerated library for biological motif scanning using position weight matrices.☆43Updated 3 weeks ago
- Diverse Genomic Embedding Benchmark☆43Updated last week
- Nextflow pipeline for the computation of structure-based MSAs with AlphaFold2 models☆12Updated 2 years ago
- VCF Observer is a VCF file analysis, comparison, and visualization tool.☆17Updated 2 months ago
- ☆31Updated 7 months ago
- Polygraph evaluates and compares groups of nucleic acid sequences based on their sequence and functional content for effective design of …☆28Updated 2 months ago
- GRAph-based Finding of Individual Motif Occurrences☆31Updated 7 months ago
- Genomic sequence preprocessing toolkit☆12Updated last week
- A genome browser in your Jupyter notebook☆30Updated 3 months ago
- Cython bindings and Python interface to MUSCLE v5, a highly efficient and accurate multiple sequence alignment software.☆19Updated 10 months ago
- Ultra rapid nanopore whole genome sequencing pipeline, published in https://www.nature.com/articles/s41587-022-01221-5☆20Updated 9 months ago
- A Generative Pre-Trained Transformer Package for Pangenomes☆50Updated 10 months ago
- Annotated sequence data☆11Updated last month
- Jax code for functional genomics ML☆10Updated 3 weeks ago
- Clair3-Trio: variant calling in trio using Nanopore long-reads☆14Updated 11 months ago
- Github Repository for multiOmics Integration project involving genomics, epigenomics and transcriptomics☆11Updated 4 years ago
- NEAT (NExt-generation Analysis Toolkit) simulates next-gen sequencing reads and can learn simulation parameters from real data.☆52Updated 3 weeks ago
- toolkit for file system virtualisation of random access compressed FASTA, FAI, DICT & TWOBIT files☆22Updated 7 months ago
- pathoscore evaluates variant pathogenicity tools and scores.☆21Updated 3 years ago
- Deep learning library for biological sequences. Extension of Fastai and Pytorch.☆40Updated last month
- Construct a Physical Map from Linked Reads☆18Updated 11 months ago
- Infer selection pressures on features of amino acid CDR3 sequences.☆24Updated 11 months ago
- A python package for showing JBrowse views☆23Updated last year