lucidrains / flash-genomics-model
My own attempt at a long context genomics model, leveraging recent advances in long context attention modeling (Flash Attention + other hierarchical methods)
☆52Updated last year
Alternatives and similar repositories for flash-genomics-model:
Users that are interested in flash-genomics-model are comparing it to the libraries listed below
- RITA is a family of autoregressive protein models, developed by LightOn in collaboration with the OATML group at Oxford and the Debora Ma…☆96Updated 2 years ago
- Discovering Interpretable Features in Protein Language Models via Sparse Autoencoders☆166Updated 2 months ago
- JAX/Flax implementation of the Hyena Hierarchy☆34Updated last year
- Implementation of Tranception, an attention network, paired with retrieval, that is SOTA for protein fitness prediction☆31Updated 2 years ago
- Pytorch implementation of a simple way to enable (Stochastic) Frame Averaging for any network☆49Updated 8 months ago
- An annotated implementation of the Hyena Hierarchy paper☆32Updated last year
- Implementation of GateLoop Transformer in Pytorch and Jax☆87Updated 9 months ago
- A repository with exploration into using transformers to predict DNA ↔ transcription factor binding☆84Updated 2 years ago
- Implementation of an Attention layer where each head can attend to more than just one token, using coordinate descent to pick topk☆46Updated last year
- CUDA implementation of autoregressive linear attention, with all the latest research findings☆44Updated last year
- Ledidi turns any machine learning model into a biological sequence editor, allowing you to design sequences with desired properties.☆72Updated last week
- Implementation of the Triangle Multiplicative module, used in Alphafold2 as an efficient way to mix rows or columns of a 2d feature map, …☆29Updated 3 years ago
- ResiDual: Transformer with Dual Residual Connections, https://arxiv.org/abs/2304.14802☆93Updated last year
- Exploring an idea where one forgets about efficiency and carries out attention across each edge of the nodes (tokens)☆50Updated 3 weeks ago
- some common Huggingface transformers in maximal update parametrization (µP)☆80Updated 3 years ago
- Replication attempt for the Protein Folding Model described in https://www.biorxiv.org/content/10.1101/2021.08.02.454840v1☆37Updated 2 years ago
- Explorations into whether a transformer with RL can direct a genetic algorithm to converge faster☆64Updated this week
- Standalone Product Key Memory module in Pytorch - for augmenting Transformer models☆78Updated 8 months ago
- Experiments around a simple idea for inducing multiple hierarchical predictive model within a GPT☆211Updated 7 months ago
- Implementation of Infini-Transformer in Pytorch☆110Updated 3 months ago
- Implementation of Gated State Spaces, from the paper "Long Range Language Modeling via Gated State Spaces", in Pytorch☆99Updated 2 years ago
- Bi-Directional Equivariant Long-Range DNA Sequence Modeling☆178Updated 3 months ago
- Some personal experiments around routing tokens to different autoregressive attention, akin to mixture-of-experts☆118Updated 6 months ago
- Your favourite classical machine learning algos on the GPU/TPU☆20Updated 3 months ago
- Implementation of the Kalman Filtering Attention proposed in "Kalman Filtering Attention for User Behavior Modeling in CTR Prediction"☆57Updated last year
- A MAD laboratory to improve AI architecture designs 🧪☆109Updated 4 months ago
- ☆50Updated 4 months ago
- ☆34Updated last year
- Code for papers Linear Algebra with Transformers (TMLR) and What is my Math Transformer Doing? (AI for Maths Workshop, Neurips 2022)☆67Updated 8 months ago
- Implementation of the conditionally routed attention in the CoLT5 architecture, in Pytorch☆226Updated 7 months ago