qiaoqiaoLF / MxDNALinks
[NeurIPS 2024] Model Decides How to Tokenize: Adaptive DNA Sequence Tokenization with MxDNA
☆22Updated 8 months ago
Alternatives and similar repositories for MxDNA
Users that are interested in MxDNA are comparing it to the libraries listed below
Sorting:
- Benchmarking DNA Language Models on Biologically Meaningful Tasks☆127Updated last year
- Explore a comprehensive collection of basic theories, applications, papers, and best practices about Large Language Models (LLMs) in geno…☆111Updated last week
- [NeurIPS 2024] BEACON: Benchmark for Comprehensive RNA Tasks and Language Models☆57Updated last year
- ☆14Updated last month
- ☆45Updated 2 years ago
- ☆23Updated 6 months ago
- ☆22Updated 8 months ago
- Benchmarks for classification of genomic sequences☆167Updated 4 months ago
- Evaluating genomic sequence models for explaining personalized expression variation☆19Updated 2 years ago
- Primary RNA sequence model☆41Updated last year
- Inference and numerics for multi-hybrid AI model architectures☆84Updated 2 weeks ago
- The official code implementation for Chromoformer in PyTorch. (Lee et al., Nature Communications. 2022)☆36Updated 2 years ago
- ☆14Updated 2 months ago
- [ISMB 2025] DNABERT_S: Learning Species-Aware DNA Embedding with Genome Foundation Models☆118Updated 10 months ago
- Computational Optimization of DNA Activity (CODA)☆65Updated 8 months ago
- Orthrus is a mature RNA model for RNA property prediction. It uses a mamba encoder backbone, a variant of state-space models specifical…☆85Updated 2 weeks ago
- RNA-seq prediction with deep convolutional neural networks.☆216Updated 4 months ago
- Repository for the paper: "SPACE: STRING proteins as complementary embeddings"☆32Updated 3 weeks ago
- ☆75Updated last year
- ☆88Updated 4 months ago
- Pytorch implementation of the Borzoi model from Calico, and Flashzoi, a 3x faster Borzoi enhancement.☆89Updated last month
- Toolkit for training hyenaDNA-based autoregressive language models on DNA sequences.☆50Updated last year
- Code repository for study ''Evaluating the representational power of pre-trained DNA language models for regulatory genomics"☆24Updated last year
- ☆76Updated last year
- Official repo for CellPLM: Pre-training of Cell Language Model Beyond Single Cells.☆100Updated last year
- Diffusion Model for Single-Cell Multiome Data Generation and Analysis☆26Updated 2 weeks ago
- For fine-tuning Enformer using paired WGS & gene expression data☆23Updated 4 months ago
- ☆42Updated 5 months ago
- ☆122Updated last month
- A collection of awesome bio-foundation models, including protein, RNA, DNA, gene, single-cell, and so on.☆279Updated 7 months ago