Tokenizers and Machine Learning Models for biological sequence data
☆25Sep 27, 2024Updated last year
Alternatives and similar repositories for bioseq
Users that are interested in bioseq are comparing it to the libraries listed below
Sorting:
- C++ library for CUDA accelerated computation of Non-negative Matrix Factorizations (NMF)☆12Mar 22, 2017Updated 8 years ago
- ☆10Jun 16, 2022Updated 3 years ago
- DartMinHash: Fast Sketching for Weighted Sets☆12Dec 8, 2025Updated 2 months ago
- ☆12Sep 7, 2019Updated 6 years ago
- ☆14Jan 31, 2020Updated 6 years ago
- MISSION: Ultra Large-Scale Feature Selection using Count-Sketches☆13Oct 6, 2019Updated 6 years ago
- ☆13Jan 23, 2020Updated 6 years ago
- ☆62Sep 15, 2025Updated 5 months ago
- Implementation for sGLMM (Sparse graph-structured linear mixed model)☆15Feb 13, 2023Updated 3 years ago
- VariantStore: A Large-Scale Genomic Variant Search Index☆39Jul 9, 2021Updated 4 years ago
- Wavelet tree based on a fixed block boosting technique☆16May 18, 2021Updated 4 years ago
- ☆14Oct 14, 2020Updated 5 years ago
- Portable Crystal binary distributions for Linux on x86_64☆15Mar 22, 2021Updated 4 years ago
- Repository for modeling PRO-cap data with the BPNet-like model, ProCapNet.☆19Aug 28, 2025Updated 6 months ago
- Indexing & querying large assembly graphs -- in space, no one can hear you miao!☆118Jan 29, 2026Updated last month
- BlockPolish: accurate polishing of long-read assembly via block divide-and-conquer☆17Jun 15, 2023Updated 2 years ago
- Fast and accurate set similarity estimation via containment min hash☆42Jul 19, 2024Updated last year
- Fast and compact locality-preserving minimal perfect hashing for k-mer sets.☆43Nov 18, 2023Updated 2 years ago
- Deep learning model to predict degron sequences☆19Feb 24, 2023Updated 3 years ago
- Barcoded Molecular Families☆22Nov 20, 2017Updated 8 years ago
- ☆20Aug 18, 2020Updated 5 years ago
- Minhash and maxhash library in Python, combining flexibility, expressivity, and performance.☆22Dec 14, 2024Updated last year
- Pan-genome Seed Index☆20Mar 12, 2025Updated 11 months ago
- ☆21Jul 6, 2023Updated 2 years ago
- Texomer: Integrating Analysis of Cancer Genome and Transcriptome Sequencing Data☆21Aug 19, 2020Updated 5 years ago
- normalize, left-align, trim, validate and clean VCF files☆20Jul 22, 2015Updated 10 years ago
- Metabuli: specific and sensitive metagenomic classification via joint analysis of DNA and amino acid.☆166Jan 2, 2026Updated 2 months ago
- Nextflow pipeline to re-process all public single-cell RNA-seq data☆31Jul 9, 2025Updated 7 months ago
- Pan-Genomic Matching Statistics☆55Apr 3, 2024Updated last year
- An integrated high performance bioinformatics toolkit☆23Apr 24, 2019Updated 6 years ago
- Clustering the NCBI nr database with mmseq2 (90% length, 90% identity). Inspired by the NCBI's experimental ClusteredNR database.☆23Nov 7, 2025Updated 3 months ago
- A tool for merging large BWTs☆24Nov 26, 2020Updated 5 years ago
- Metalign: efficient alignment-based metagenomic profiling via containment min hash☆33Sep 12, 2023Updated 2 years ago
- ☆23Sep 27, 2019Updated 6 years ago
- Reference implementations of minimizer schemes to go with the mod-minimizers paper.☆28Apr 24, 2025Updated 10 months ago
- BagMinHash - Minwise Hashing Algorithm for Weighted Sets☆26Aug 26, 2020Updated 5 years ago
- ☆25Oct 26, 2021Updated 4 years ago
- ☆24Apr 2, 2021Updated 4 years ago
- ☆13Updated this week