OliverRichter / normalized-attentionView external linksLinks
Code publication to the paper "Normalized Attention Without Probability Cage"
β17Nov 9, 2021Updated 4 years ago
Alternatives and similar repositories for normalized-attention
Users that are interested in normalized-attention are comparing it to the libraries listed below
Sorting:
- Suite of 500 procedurally-generated NLP tasks to study language model adaptabilityβ21Jul 16, 2022Updated 3 years ago
- πΌοΈπβ11Jun 9, 2020Updated 5 years ago
- β13Nov 12, 2018Updated 7 years ago
- β12Mar 16, 2022Updated 3 years ago
- MXNet/Gluon implement of L-GM-Lossβ11Oct 17, 2018Updated 7 years ago
- A GPT, made only of MLPs, in Jaxβ59Jun 23, 2021Updated 4 years ago
- MXNet implementation of CapsNetβ29Nov 29, 2017Updated 8 years ago
- β22May 3, 2022Updated 3 years ago
- A JAX nn libraryβ22Sep 9, 2025Updated 5 months ago
- A simple Transformer where the softmax has been replaced with normalizationβ20Sep 11, 2020Updated 5 years ago
- A python library for highly configurable transformers - easing model architecture search and experimentation.β49Nov 30, 2021Updated 4 years ago
- β21Mar 14, 2021Updated 4 years ago
- Implementation of Token Shift GPT - An autoregressive model that solely relies on shifting the sequence space for mixingβ49Jan 27, 2022Updated 4 years ago
- The official repository for our paper "The Neural Data Router: Adaptive Control Flow in Transformers Improves Systematic Generalization".β34Jun 11, 2025Updated 8 months ago
- The Codebase for Causal Distillation for Language Models (NAACL '22)β26May 1, 2022Updated 3 years ago
- Implementation of the nearest neighbour CLR paperβ27Mar 17, 2022Updated 3 years ago
- An implementation of 2021 paper by Geoffrey Hinton: "How to represent part-whole hierarchies in a neural network" in Pytorch.β57Mar 29, 2021Updated 4 years ago
- Usable implementation of Emerging Symbol Binding Network (ESBN), in Pytorchβ25Jan 6, 2021Updated 5 years ago
- Implementation of the Triangle Multiplicative module, used in Alphafold2 as an efficient way to mix rows or columns of a 2d feature map, β¦β39Aug 3, 2021Updated 4 years ago
- Converting EfficientNet to Pytorch for use with fastaiβ27Jun 5, 2019Updated 6 years ago
- Learning Generative Models across Incomparable Spaces (ICML 2019)β27Mar 11, 2020Updated 5 years ago
- Implements the SM3-II adaptive optimization algorithm for PyTorch.β33Sep 3, 2024Updated last year
- [CoRL 2020] COG: Connecting New Skills to Past Experience with Offline Reinforcement Learningβ34Oct 28, 2020Updated 5 years ago
- Estimating Q(s,s') with Deep Deterministic Dynamics Gradientsβ32Feb 21, 2020Updated 5 years ago
- Graph-based and Transition-based dependency parsers based on BiLSTMsβ30Jan 4, 2019Updated 7 years ago
- β13Dec 28, 2018Updated 7 years ago
- Detect and reconstruct transparent objects from scan shadowsβ10Sep 22, 2017Updated 8 years ago
- Deep Learning Part 2, 2019 edition - transcriptions, screenshots and notebooksβ11Jul 19, 2019Updated 6 years ago
- β36Jul 22, 2019Updated 6 years ago
- Code for the papers "Induction of Subgoal Automata for Reinforcement Learning" (AAAI-20) and "Induction and Exploitation of Subgoal Automβ¦β13Aug 15, 2023Updated 2 years ago
- Reference implementation of algorithms for reinforcement learning and Markov decision processes.β12Jan 28, 2021Updated 5 years ago
- Pascal2 Harvest project QuEstβ14Sep 15, 2014Updated 11 years ago
- This repository provides the code for replicating the experiments in the paper "Building One-Shot Semi-supervised (BOSS) Learning up to Fβ¦β36Aug 9, 2020Updated 5 years ago
- An implement of U-net using MXNet gluonβ11Apr 3, 2018Updated 7 years ago
- β13Jul 20, 2023Updated 2 years ago
- College project about article http://www.cs.ust.hk/~quan/publications/yuan-deblur-siggraph07.pdfβ10Jan 25, 2013Updated 13 years ago
- Topic modelling and co-occurrence analysis of the bio-economyβ10Jul 17, 2017Updated 8 years ago
- CLASP - Contrastive Language-Aminoacid Sequence Pretrainingβ143Sep 17, 2021Updated 4 years ago
- Temporal Context Network for Activity Localization in Videosβ31Oct 25, 2017Updated 8 years ago