kuleshov-group / caduceus
Bi-Directional Equivariant Long-Range DNA Sequence Modeling
☆187Updated 3 months ago
Alternatives and similar repositories for caduceus:
Users that are interested in caduceus are comparing it to the libraries listed below
- Discovering Interpretable Features in Protein Language Models via Sparse Autoencoders☆170Updated 3 months ago
- Repository for StripedHyena, a state-of-the-art beyond Transformer architecture☆369Updated last year
- Pretraining infrastructure for multi-hybrid AI model architectures☆154Updated 2 weeks ago
- Benchmarking DNA Language Models on Biologically Meaningful Tasks☆114Updated 6 months ago
- [NeurIPS 2024] BEACON: Benchmark for Comprehensive RNA Tasks and Language Models☆38Updated 9 months ago
- Orthrus is a mature RNA model for RNA property prediction. It uses a mamba encoder backbone, a variant of state-space models specifical…☆62Updated 3 months ago
- ☆21Updated 2 months ago
- Official implementation for HyenaDNA, a long-range genomic foundation model built with Hyena☆673Updated 2 weeks ago
- My own attempt at a long context genomics model, leveraging recent advances in long context attention modeling (Flash Attention + other h…☆52Updated last year
- A Protein Large Language Model for Multi-Task Protein Language Processing☆175Updated last week
- [ICML 2024] VQDNA: Unleashing the Power of Vector Quantization for Multi-Species Genomic Sequence Modeling☆10Updated 7 months ago
- Benchmarks for classification of genomic sequences☆142Updated last month
- AI-Driven Digital Organism (AIDO) is a system of multiscale foundation models for predicting, simulating and programming biology at all l…☆85Updated 4 months ago
- Primary RNA sequence model☆37Updated 11 months ago
- ☆39Updated last year
- Nature Methods: RhoFold+, Accurate RNA 3D structure prediction using a language model-based deep learning approach☆142Updated last month
- Repository for mRNA Paper and CodonBERT publication.☆128Updated 10 months ago
- A collection of awesome bio-foundation models, including protein, RNA, DNA, gene, single-cell, and so on.☆221Updated last week
- 🧬 Generative modeling of regulatory DNA sequences with diffusion probabilistic models 💨☆389Updated this week
- Implementation of Chroma, generative models of protein using DDPM and GNNs, in Pytorch☆158Updated 2 years ago
- GenSLMs: Genome-scale language models reveal SARS-CoV-2 evolutionary dynamics☆128Updated 8 months ago
- Dirichlet Diffusion Score Model for Biological Sequence Generation.☆51Updated 2 months ago
- Official Implemetation of DPLM (ICML'24) - Diffusion Language Models Are Versatile Protein Learners☆159Updated last week
- Nature Methods: RNA foundation model (together with RhoFold)☆267Updated last week
- Official repository for the paper "Tranception: Protein Fitness Prediction with Autoregressive Transformers and Inference-time Retrieval"☆151Updated last year
- Implementation of Enformer, Deepmind's attention network for predicting gene expression, in Pytorch☆482Updated 7 months ago
- RNA-seq prediction with deep convolutional neural networks.☆146Updated last month
- ChatCell: Facilitating Single-Cell Analysis with Natural Language☆49Updated last year
- [ICML-23 ORAL] ProtST: Multi-Modality Learning of Protein Sequences and Biomedical Texts☆97Updated last year
- Official implementation of "Hydra: Bidirectional State Space Models Through Generalized Matrix Mixers"☆132Updated 3 months ago