lucidrains / flash-genomics-model
My own attempt at a long context genomics model, leveraging recent advances in long context attention modeling (Flash Attention + other hierarchical methods)
☆52Updated last year
Alternatives and similar repositories for flash-genomics-model:
Users that are interested in flash-genomics-model are comparing it to the libraries listed below
- Discovering Interpretable Features in Protein Language Models via Sparse Autoencoders☆159Updated last month
- An annotated implementation of the Hyena Hierarchy paper☆32Updated last year
- RITA is a family of autoregressive protein models, developed by LightOn in collaboration with the OATML group at Oxford and the Debora Ma…☆96Updated 2 years ago
- JAX/Flax implementation of the Hyena Hierarchy☆34Updated last year
- Implementation of Tranception, an attention network, paired with retrieval, that is SOTA for protein fitness prediction☆31Updated 2 years ago
- Implementation of GateLoop Transformer in Pytorch and Jax☆87Updated 8 months ago
- Implementation of an Attention layer where each head can attend to more than just one token, using coordinate descent to pick topk☆46Updated last year
- Code for papers Linear Algebra with Transformers (TMLR) and What is my Math Transformer Doing? (AI for Maths Workshop, Neurips 2022)☆67Updated 7 months ago
- Implementation and replication of ProGen, Language Modeling for Protein Generation, in Jax☆112Updated 3 years ago
- Ledidi turns any machine learning model into a biological sequence editor, allowing you to design sequences with desired properties.☆70Updated 2 months ago
- Pytorch implementation of a simple way to enable (Stochastic) Frame Averaging for any network☆49Updated 7 months ago
- Some personal experiments around routing tokens to different autoregressive attention, akin to mixture-of-experts☆116Updated 4 months ago
- A repository with exploration into using transformers to predict DNA ↔ transcription factor binding☆84Updated 2 years ago
- Bi-Directional Equivariant Long-Range DNA Sequence Modeling☆176Updated 2 months ago
- Replication attempt for the Protein Folding Model described in https://www.biorxiv.org/content/10.1101/2021.08.02.454840v1☆37Updated 2 years ago
- Implementation of the Triangle Multiplicative module, used in Alphafold2 as an efficient way to mix rows or columns of a 2d feature map, …☆29Updated 3 years ago
- CUDA implementation of autoregressive linear attention, with all the latest research findings☆44Updated last year
- Implementation of the conditionally routed attention in the CoLT5 architecture, in Pytorch☆226Updated 6 months ago
- some common Huggingface transformers in maximal update parametrization (µP)☆80Updated 3 years ago
- Exploring an idea where one forgets about efficiency and carries out attention across each edge of the nodes (tokens)☆45Updated 3 weeks ago
- Standalone Product Key Memory module in Pytorch - for augmenting Transformer models☆78Updated 7 months ago
- Code for CELL-E: Biological Zero-Shot Text-to-Image Synthesis for Protein Localization Prediction☆28Updated last year
- ResiDual: Transformer with Dual Residual Connections, https://arxiv.org/abs/2304.14802☆93Updated last year
- Implementation of Gated State Spaces, from the paper "Long Range Language Modeling via Gated State Spaces", in Pytorch☆99Updated 2 years ago
- Implementation of Gradient Agreement Filtering, from Chaubard et al. of Stanford, but for single machine microbatches, in Pytorch☆23Updated last month
- Experiments around a simple idea for inducing multiple hierarchical predictive model within a GPT☆209Updated 6 months ago
- Implementation of Infini-Transformer in Pytorch☆109Updated 2 months ago
- LayerNorm(SmallInit(Embedding)) in a Transformer to improve convergence☆60Updated 3 years ago
- Latent Diffusion Language Models☆68Updated last year