OliverRichter/normalized-attention

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/OliverRichter/normalized-attention)

OliverRichter / normalized-attention

Code publication to the paper "Normalized Attention Without Probability Cage"

☆17

Alternatives and similar repositories for normalized-attention

Users that are interested in normalized-attention are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

AranKomat / Metroplex
View on GitHub
☆21Mar 15, 2023Updated 3 years ago
lucidrains / mlp-gpt-jax
View on GitHub
A GPT, made only of MLPs, in Jax
☆59Jun 23, 2021Updated 5 years ago
photogeniq / image-encoders
View on GitHub
🖼️📊
☆11Jun 9, 2020Updated 6 years ago
lucidrains / all-normalization-transformer
View on GitHub
A simple Transformer where the softmax has been replaced with normalization
☆20Sep 11, 2020Updated 5 years ago
oguiza / DataAugmentation
View on GitHub
☆12Mar 16, 2022Updated 4 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
halcy / tpuddim
View on GitHub
☆22May 3, 2022Updated 4 years ago
PetarV- / X-CNN
View on GitHub
Cross-modal convolutional neural networks
☆11Aug 29, 2017Updated 8 years ago
HA-Transformer / MAT
View on GitHub
The implementation of multi-branch attentive Transformer (MAT).
☆33Aug 27, 2020Updated 5 years ago
nshepperd / jaxtorch
View on GitHub
A JAX nn library
☆21Sep 9, 2025Updated 10 months ago
LeeJuly30 / L-GM-Loss-For-Gluon
View on GitHub
MXNet/Gluon implement of L-GM-Loss
☆11Oct 17, 2018Updated 7 years ago
lucidrains / token-shift-gpt
View on GitHub
Implementation of Token Shift GPT - An autoregressive model that solely relies on shifting the sequence space for mixing
☆49Jan 27, 2022Updated 4 years ago
leo-liuzy / probe-across-time
View on GitHub
☆22Aug 31, 2021Updated 4 years ago
renll / SparseLT
View on GitHub
[EMNLP 2022] Language Model Pre-Training with Sparse Latent Typing
☆14Feb 10, 2023Updated 3 years ago
karlstratos / mmi-tagger
View on GitHub
Maximal Mutual Information (MMI) Tagger
☆26Jun 6, 2019Updated 7 years ago
End-to-end encrypted email - Proton Mail • Ad
Special offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
lucidrains / ESBN-pytorch
View on GitHub
Usable implementation of Emerging Symbol Binding Network (ESBN), in Pytorch
☆25Jan 6, 2021Updated 5 years ago
yaohungt / Capsules-Inverted-Attention-Routing
View on GitHub
[ICLR'20] [PyTorch] Inverted Attention Routing for Capsules
☆29Feb 26, 2020Updated 6 years ago
uber-research / D3G
View on GitHub
Estimating Q(s,s') with Deep Deterministic Dynamics Gradients
☆32Feb 21, 2020Updated 6 years ago
RedRyan111 / GLOM
View on GitHub
An implementation of 2021 paper by Geoffrey Hinton: "How to represent part-whole hierarchies in a neural network" in Pytorch.
☆58Mar 29, 2021Updated 5 years ago
tonyduan / rs4a
View on GitHub
Randomized Smoothing of All Shapes and Sizes (ICML 2020).
☆51Jul 23, 2020Updated 6 years ago
robert-giaquinto / gradient-boosted-normalizing-flows
View on GitHub
We got a stew going!
☆27Oct 3, 2023Updated 2 years ago
facebookresearch / GraphLog
View on GitHub
API for accessing the GraphLog dataset
☆91May 3, 2024Updated 2 years ago
uber-research / Synthetic-Petri-Dish
View on GitHub
☆42May 18, 2020Updated 6 years ago
avisingh599 / cog
View on GitHub
[CoRL 2020] COG: Connecting New Skills to Past Experience with Offline Reinforcement Learning
☆35Oct 28, 2020Updated 5 years ago
Serverless GPU API endpoints on Runpod - Get Bonus Credits • Ad
Skip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
conda-forge / jaxlib-feedstock
View on GitHub
A conda-smithy repository for jaxlib.
☆17Jul 3, 2026Updated 3 weeks ago
WendyShang / flare
View on GitHub
Reinforcement Learning with Latent Flow
☆43Mar 25, 2021Updated 5 years ago
Edward-Sun / structured-nart
View on GitHub
☆15Dec 5, 2019Updated 6 years ago
SSS135 / aiqn-vae
View on GitHub
VAE + Quantile Networks for MNIST
☆12Nov 29, 2018Updated 7 years ago
CHARM-Tx / linear_mem_attention_pytorch
View on GitHub
Unofficially Implements https://arxiv.org/abs/2112.05682 to get Linear Memory Cost on Attention for PyTorch
☆12Jan 16, 2022Updated 4 years ago
timvieira / rl
View on GitHub
Reference implementation of algorithms for reinforcement learning and Markov decision processes.
☆12Jan 28, 2021Updated 5 years ago
exalearn / covid-drug-design
View on GitHub
Code and analyses related to the ExaLearn drug design efforts
☆11Sep 30, 2020Updated 5 years ago
keep-smile-001 / opentqa
View on GitHub
opentqa is a open framework of the textbook question answering, which includes xtqa, mcan, cmr, mfb, mutan.
☆11Mar 27, 2021Updated 5 years ago
flowersteam / geppg
View on GitHub
☆36Aug 10, 2018Updated 7 years ago
Serverless GPU API endpoints on Runpod - Get Bonus Credits • Ad
Skip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
yzh119 / BPT
View on GitHub
Source code of paper "BP-Transformer: Modelling Long-Range Context via Binary Partitioning"
☆127Apr 5, 2021Updated 5 years ago
microsoft / EA-VQ-VAE
View on GitHub
This repo provides the code for the ACL 2020 paper "Evidence-Aware Inferential Text Generation with Vector Quantised Variational AutoEnco…
☆57Nov 22, 2020Updated 5 years ago
gd-zhang / noisy-quadratic-model
View on GitHub
Large-batch Training, Neural Network Optimization
☆10Nov 8, 2019Updated 6 years ago
MicPie / clasp
View on GitHub
CLASP - Contrastive Language-Aminoacid Sequence Pretraining
☆142Sep 17, 2021Updated 4 years ago
bunnech / gwgan
View on GitHub
Learning Generative Models across Incomparable Spaces (ICML 2019)
☆29Mar 11, 2020Updated 6 years ago
j-towns / vdvae-jax
View on GitHub
Very deep VAEs in JAX/Flax
☆47Jun 16, 2021Updated 5 years ago
moiseshorta / MelGAN-VC
View on GitHub
MelGAN-VC: Voice Conversion and Audio Style Transfer on arbitrarily long samples using Spectrograms
☆12Nov 25, 2021Updated 4 years ago