kuleshov-group / caduceus
Bi-Directional Equivariant Long-Range DNA Sequence Modeling
☆157Updated last month
Related projects ⓘ
Alternatives and complementary repositories for caduceus
- Benchmarking DNA Language Models on Biologically Meaningful Tasks☆96Updated last week
- Official implementation for HyenaDNA, a long-range genomic foundation model built with Hyena☆593Updated 4 months ago
- Repository for StripedHyena, a state-of-the-art beyond Transformer architecture☆267Updated 8 months ago
- Simplified Masked Diffusion Language Model☆202Updated this week
- Orthrus is a mature RNA model for RNA property prediction. It uses a mamba encoder backbone, a variant of state-space models specifical…☆36Updated last week
- BioDiscoveryAgent is an LLM-based AI agent for closed-loop design of genetic perturbation experiments☆24Updated 3 weeks ago
- (Unofficial) Implementation of dilated attention from "LongNet: Scaling Transformers to 1,000,000,000 Tokens" (https://arxiv.org/abs/2307…☆50Updated last year
- My own attempt at a long context genomics model, leveraging recent advances in long context attention modeling (Flash Attention + other h…☆51Updated last year
- ☆13Updated last month
- [NeurIPS 2023] Official codes of "MuSe-GNN: Learning Unified Gene Representation From Multimodal Biological Graph Data"☆26Updated 4 months ago
- Benchmarks for classification of genomic sequences☆116Updated 8 months ago
- A Protein Large Language Model for Multi-Task Protein Language Processing☆137Updated last month
- Implementation of Infini-Transformer in Pytorch☆104Updated last month
- ChatCell: Facilitating Single-Cell Analysis with Natural Language☆46Updated 8 months ago
- ☆176Updated 7 months ago
- ☆51Updated 2 months ago
- ProtMamba: a homology-aware but alignment-free protein state space model☆49Updated last week
- Official implementation of "Hydra: Bidirectional State Space Models Through Generalized Matrix Mixers"☆102Updated 3 months ago
- A repository with exploration into using transformers to predict DNA ↔ transcription factor binding☆81Updated 2 years ago
- 🧬 Generative modeling of regulatory DNA sequences with diffusion probabilistic models 💨☆366Updated this week
- RNA foundation model☆203Updated 7 months ago
- A collection of awesome bio-foundation models, including protein, RNA, DNA, gene, single-cell, and so on.☆127Updated last week
- An annotated implementation of the Hyena Hierarchy paper☆31Updated last year
- BEACON: Benchmark for Comprehensive RNA Tasks and Language Models☆16Updated 3 months ago
- Primary RNA sequence model☆29Updated 5 months ago
- Repository for mRNA Paper and CodonBERT publication.☆112Updated 4 months ago
- [ICLR 2024] Mol-Instructions: A Large-Scale Biomolecular Instruction Dataset for Large Language Models☆248Updated last week
- ☆10Updated 4 months ago
- Reading list for research topics in state-space models☆238Updated last week
- Protein structure datasets for machine learning.☆101Updated 4 months ago