lucidrains / flash-genomics-model
My own attempt at a long context genomics model, leveraging recent advances in long context attention modeling (Flash Attention + other hierarchical methods)
☆52Updated last year
Alternatives and similar repositories for flash-genomics-model:
Users that are interested in flash-genomics-model are comparing it to the libraries listed below
- RITA is a family of autoregressive protein models, developed by LightOn in collaboration with the OATML group at Oxford and the Debora Ma…☆96Updated 2 years ago
- Discovering Interpretable Features in Protein Language Models via Sparse Autoencoders☆163Updated last month
- Pretraining infrastructure for multi-hybrid AI model architectures☆129Updated 2 weeks ago
- An annotated implementation of the Hyena Hierarchy paper☆32Updated last year
- JAX/Flax implementation of the Hyena Hierarchy☆34Updated last year
- Implementation of Tranception, an attention network, paired with retrieval, that is SOTA for protein fitness prediction☆31Updated 2 years ago
- A repository with exploration into using transformers to predict DNA ↔ transcription factor binding☆84Updated 2 years ago
- Implementation of the Triangle Multiplicative module, used in Alphafold2 as an efficient way to mix rows or columns of a 2d feature map, …☆29Updated 3 years ago
- Implementation of GateLoop Transformer in Pytorch and Jax☆87Updated 9 months ago
- Some personal experiments around routing tokens to different autoregressive attention, akin to mixture-of-experts☆116Updated 5 months ago
- Pytorch implementation of a simple way to enable (Stochastic) Frame Averaging for any network☆49Updated 8 months ago
- Ledidi turns any machine learning model into a biological sequence editor, allowing you to design sequences with desired properties.☆71Updated 2 months ago
- Replication attempt for the Protein Folding Model described in https://www.biorxiv.org/content/10.1101/2021.08.02.454840v1☆37Updated 2 years ago
- Bi-Directional Equivariant Long-Range DNA Sequence Modeling☆178Updated 2 months ago
- Implementation of an Attention layer where each head can attend to more than just one token, using coordinate descent to pick topk☆46Updated last year
- Implementation of Infini-Transformer in Pytorch☆109Updated 2 months ago
- some common Huggingface transformers in maximal update parametrization (µP)☆80Updated 3 years ago
- Exploring an idea where one forgets about efficiency and carries out attention across each edge of the nodes (tokens)☆45Updated last month
- Implementation and replication of ProGen, Language Modeling for Protein Generation, in Jax☆112Updated 3 years ago
- (Unofficial) Implementation of dilated attention from "LongNet: Scaling Transformers to 1,000,000,000 Tokens" (https://arxiv.org/abs/2307…☆50Updated last year
- Fork of HyenaDNA, a long-range genomic foundation model built with Hyena☆9Updated last year
- Code for CELL-E: Biological Zero-Shot Text-to-Image Synthesis for Protein Localization Prediction☆28Updated last year
- ResiDual: Transformer with Dual Residual Connections, https://arxiv.org/abs/2304.14802☆93Updated last year
- LayerNorm(SmallInit(Embedding)) in a Transformer to improve convergence☆60Updated 3 years ago
- Implementation of Chroma, generative models of protein using DDPM and GNNs, in Pytorch☆158Updated 2 years ago
- Implementation of the Kalman Filtering Attention proposed in "Kalman Filtering Attention for User Behavior Modeling in CTR Prediction"☆57Updated last year
- CUDA implementation of autoregressive linear attention, with all the latest research findings☆44Updated last year
- Code for papers Linear Algebra with Transformers (TMLR) and What is my Math Transformer Doing? (AI for Maths Workshop, Neurips 2022)☆67Updated 7 months ago
- Triton Implementation of HyperAttention Algorithm☆47Updated last year
- Implementation of Denoising Diffusion for protein design, but using the new Equiformer (successor to SE3 Transformers) with some addition…☆56Updated 2 years ago