AryaAftab / rotary-embedding-tensorflow
Implementation of Rotary Embeddings, from the Roformer paper, in Tensorflow
☆12Updated 3 years ago
Alternatives and similar repositories for rotary-embedding-tensorflow:
Users that are interested in rotary-embedding-tensorflow are comparing it to the libraries listed below
- Official Implementation of "Transferring Inductive Biases Through Knowledge Distillation"☆14Updated 4 years ago
- This repository contains code for reproducing results in our paper Interpreting Potts and Transformer Protein Models Through the Lens of …☆58Updated 2 years ago
- Unofficially Implements https://arxiv.org/abs/2112.05682 to get Linear Memory Cost on Attention for PyTorch☆12Updated 3 years ago
- ☆34Updated 4 years ago
- ☆25Updated 2 years ago
- Tensorflow port implementation of Single Headed Attention RNN☆16Updated 5 years ago
- ☆17Updated 2 years ago
- UCSF BMI219 Deep Learning (2017), Coding example (Prediction of protein folding with RNN and CNN)☆16Updated 7 years ago
- A tool for predicting the effects of missense mutations on protein stability changes upon missense mutation using protein sequence only. …☆22Updated last year
- Simple implementations of attention modules adapted for the biological data domain.☆13Updated 4 months ago
- Code for the CHAMPS Predicting Molecular Properties Kaggle competition☆52Updated 5 years ago
- Implementation of the Triangle Multiplicative module, used in Alphafold2 as an efficient way to mix rows or columns of a 2d feature map, …☆29Updated 3 years ago
- f-divergence based t-SNE☆16Updated 2 years ago
- ☆10Updated 2 years ago
- This Python package implements algorithms for multiviews (multimodals) learning☆14Updated 7 months ago
- ☆20Updated 8 years ago
- Implementation of "Semi-supervised learning of hierarchical representations of molecules using neural message passing" (arXiv:1711.10168)☆14Updated 6 years ago
- A molecule generation benchmarking platform☆13Updated 7 years ago
- Usable implementation of Mogrifier, a circuit for enhancing LSTMs and potentially other networks, from Deepmind☆18Updated 10 months ago
- ☆11Updated 6 years ago
- Optimization and Generalization Analysis of Transduction through Gradient Boosting and Application to Multi-scale Graph Neural Networks☆13Updated 4 years ago
- ☆11Updated 3 years ago
- ☆34Updated 3 years ago
- Unofficial Tensorflow-Keras implementation of Fastformer based on paper [Fastformer: Additive Attention Can Be All You Need](https://arxi…☆13Updated 3 years ago
- Histopathologic Cancer Detection model based on Kaggle Challenge https://www.kaggle.com/c/histopathologic-cancer-detection (top 1%)☆11Updated 4 years ago
- A simple Transformer where the softmax has been replaced with normalization☆19Updated 4 years ago
- To be a next-generation DL-based phenotype prediction from genome mutations.☆19Updated 3 years ago
- Gold medal #2 Kaggle "Predicting Molecular Properties" compatition☆61Updated 5 years ago
- Implementation of Tranception, an attention network, paired with retrieval, that is SOTA for protein fitness prediction☆31Updated 2 years ago
- A TensorFlow implementation of the paper 'Set Transformer: A Framework for Attention-based Permutation-Invariant Neural Networks'☆31Updated 11 months ago