kuleshov-group / caduceus
Bi-Directional Equivariant Long-Range DNA Sequence Modeling
☆171Updated 2 weeks ago
Alternatives and similar repositories for caduceus:
Users that are interested in caduceus are comparing it to the libraries listed below
- Discovering Interpretable Features in Protein Language Models via Sparse Autoencoders☆142Updated 2 months ago
- Repository for StripedHyena, a state-of-the-art beyond Transformer architecture☆331Updated 10 months ago
- Benchmarking DNA Language Models on Biologically Meaningful Tasks☆102Updated 2 months ago
- My own attempt at a long context genomics model, leveraging recent advances in long context attention modeling (Flash Attention + other h…☆52Updated last year
- Orthrus is a mature RNA model for RNA property prediction. It uses a mamba encoder backbone, a variant of state-space models specifical…☆52Updated last week
- (Unofficial) Implementation of dilated attention from "LongNet: Scaling Transformers to 1,000,000,000 Tokens" (https://arxiv.org/abs/2307…☆50Updated last year
- Protein language model customized for antibodies☆118Updated last month
- Simplified Masked Diffusion Language Model☆262Updated 2 months ago
- ☆19Updated 4 months ago
- A Protein Large Language Model for Multi-Task Protein Language Processing☆159Updated 3 weeks ago
- Official repository for the paper "Tranception: Protein Fitness Prediction with Autoregressive Transformers and Inference-time Retrieval"☆143Updated last year
- ☆237Updated 10 months ago
- Official implementation for HyenaDNA, a long-range genomic foundation model built with Hyena☆629Updated 7 months ago
- Official implementation of "Hydra: Bidirectional State Space Models Through Generalized Matrix Mixers"☆119Updated 5 months ago
- Contrasting Sequence with Structure: Pre-training Graph Representations with PLMs☆23Updated 10 months ago
- AFusion: AlphaFold 3 GUI & Toolkit with Visualization☆90Updated last month
- [NeurIPS 2024] BEACON: Benchmark for Comprehensive RNA Tasks and Language Models☆26Updated 5 months ago
- Benchmarks for classification of genomic sequences☆126Updated 11 months ago
- Official Implemetation of DPLM (ICML'24) - Diffusion Language Models Are Versatile Protein Learners☆107Updated 2 months ago
- ☆132Updated 10 months ago
- Implementation and replication of ProGen, Language Modeling for Protein Generation, in Jax☆110Updated 3 years ago
- A repository with exploration into using transformers to predict DNA ↔ transcription factor binding☆82Updated 2 years ago
- ChatCell: Facilitating Single-Cell Analysis with Natural Language☆48Updated 11 months ago
- Repository for mRNA Paper and CodonBERT publication.☆118Updated 7 months ago
- Dirichlet Diffusion Score Model for Biological Sequence Generation.☆46Updated 8 months ago
- A collection of awesome bio-foundation models, including protein, RNA, DNA, gene, single-cell, and so on.☆175Updated last month
- ☆96Updated 8 months ago
- Nature Methods: RNA foundation model (together with RhoFold)☆235Updated 2 months ago
- Official repo for CellPLM: Pre-training of Cell Language Model Beyond Single Cells.☆77Updated 10 months ago
- Primary RNA sequence model☆32Updated 8 months ago