Code publication to the paper "Normalized Attention Without Probability Cage"
☆17Nov 9, 2021Updated 4 years ago
Alternatives and similar repositories for normalized-attention
Users that are interested in normalized-attention are comparing it to the libraries listed below
Sorting:
- Suite of 500 procedurally-generated NLP tasks to study language model adaptability☆21Jul 16, 2022Updated 3 years ago
- ☆21Mar 15, 2023Updated 2 years ago
- 🖼️📊☆11Jun 9, 2020Updated 5 years ago
- ☆13Nov 12, 2018Updated 7 years ago
- ☆12Mar 16, 2022Updated 3 years ago
- MXNet/Gluon implement of L-GM-Loss☆11Oct 17, 2018Updated 7 years ago
- MXNet implementation of CapsNet☆29Nov 29, 2017Updated 8 years ago
- ☆22May 3, 2022Updated 3 years ago
- Code for the paper "Query-Key Normalization for Transformers"☆52Mar 6, 2021Updated 4 years ago
- A python library for highly configurable transformers - easing model architecture search and experimentation.☆48Nov 30, 2021Updated 4 years ago
- Maximal Mutual Information (MMI) Tagger☆25Jun 6, 2019Updated 6 years ago
- ☆20Mar 14, 2021Updated 4 years ago
- The official repository for our paper "The Neural Data Router: Adaptive Control Flow in Transformers Improves Systematic Generalization".☆34Jun 11, 2025Updated 8 months ago
- The Codebase for Causal Distillation for Language Models (NAACL '22)☆26May 1, 2022Updated 3 years ago
- Implementation of the nearest neighbour CLR paper☆27Mar 17, 2022Updated 3 years ago
- An implementation of 2021 paper by Geoffrey Hinton: "How to represent part-whole hierarchies in a neural network" in Pytorch.☆57Mar 29, 2021Updated 4 years ago
- Converting EfficientNet to Pytorch for use with fastai☆27Jun 5, 2019Updated 6 years ago
- Learning Generative Models across Incomparable Spaces (ICML 2019)☆28Mar 11, 2020Updated 5 years ago
- Estimating Q(s,s') with Deep Deterministic Dynamics Gradients☆32Feb 21, 2020Updated 6 years ago
- ☆31Nov 26, 2021Updated 4 years ago
- Graph-based and Transition-based dependency parsers based on BiLSTMs☆30Jan 4, 2019Updated 7 years ago
- A Clojure library for deconstructing Korean unicode syllable characters into alphabet characters☆10Nov 22, 2021Updated 4 years ago
- ☆13Dec 28, 2018Updated 7 years ago
- Detect and reconstruct transparent objects from scan shadows☆10Sep 22, 2017Updated 8 years ago
- informal exposition of Weisfeiler-Leman similarity☆28Apr 30, 2021Updated 4 years ago
- Code for the 2019 TACL Paper "Trick Me If You Can: Human-in-the-loop Generation of Adversarial Question Answering Examples"☆36Jul 3, 2019Updated 6 years ago
- ☆37Jul 22, 2019Updated 6 years ago
- Pedagogical codebase for a simplified score-based generative model design, with training loop☆40Aug 28, 2021Updated 4 years ago
- Code repositoy for "AOWS: Adaptive and optimal network width search with latency constraints", CVPR 2020☆36Jun 19, 2020Updated 5 years ago
- Code repository for the ICLR 2022 paper "FlexConv: Continuous Kernel Convolutions With Differentiable Kernel Sizes" https://openreview.ne…☆116Nov 30, 2022Updated 3 years ago
- Pascal2 Harvest project QuEst☆14Sep 15, 2014Updated 11 years ago
- CLASP - Contrastive Language-Aminoacid Sequence Pretraining☆142Sep 17, 2021Updated 4 years ago
- Source code for paper "Trajectory of Alternating Direction Method of Multipliers and Adaptive Acceleration" of NeurIPS 2019☆10Jan 25, 2024Updated 2 years ago
- College project about article http://www.cs.ust.hk/~quan/publications/yuan-deblur-siggraph07.pdf☆10Jan 25, 2013Updated 13 years ago
- ☆13Jul 20, 2023Updated 2 years ago
- This repository provides the code for replicating the experiments in the paper "Building One-Shot Semi-supervised (BOSS) Learning up to F…☆36Aug 9, 2020Updated 5 years ago
- An implement of U-net using MXNet gluon☆11Apr 3, 2018Updated 7 years ago
- Topic modelling and co-occurrence analysis of the bio-economy☆10Jul 17, 2017Updated 8 years ago
- Code for the papers "Induction of Subgoal Automata for Reinforcement Learning" (AAAI-20) and "Induction and Exploitation of Subgoal Autom…☆13Aug 15, 2023Updated 2 years ago