michaelsdr / sinkformersLinks
Transformers with doubly stochastic attention
☆45Updated 2 years ago
Alternatives and similar repositories for sinkformers
Users that are interested in sinkformers are comparing it to the libraries listed below
Sorting:
- Euclidean Wasserstein-2 optimal transportation☆47Updated last year
- Code for GFlowNet-EM, a novel algorithm for fitting latent variable models with compositional latents and an intractable true posterior.☆40Updated last year
- Stochastic Normalizing Flows☆76Updated 3 years ago
- Sequence Modeling with Multiresolution Convolutional Memory (ICML 2023)☆124Updated last year
- ☆17Updated last year
- Repository for the "Gotta Go Fast When Generating Data with Score-Based Models" paper☆104Updated 3 years ago
- Sequence Modeling with Structured State Spaces☆64Updated 2 years ago
- Implicit Convolutional Kernels for Steerable CNNs [NeurIPS'23]☆29Updated 3 months ago
- Code for the paper https://arxiv.org/abs/2205.14987v2☆50Updated last year
- ☆53Updated 10 months ago
- ☆32Updated last year
- [ICML'21] Improved Contrastive Divergence Training of Energy Based Models☆63Updated 3 years ago
- ☆34Updated 2 years ago
- Improved sampling via learned diffusions (ICLR2024) and an optimal control perspective on diffusion-based generative modeling (TMLR2024)☆62Updated 2 months ago
- Pytorch code for "Improving Self-Supervised Learning by Characterizing Idealized Representations"☆41Updated 2 years ago
- Laplace Redux -- Effortless Bayesian Deep Learning☆42Updated 2 years ago
- Implementation of Action Matching for the Schrödinger equation☆24Updated last year
- ☆33Updated 2 years ago
- Squared Non-monotonic Probabilistic Circuits☆22Updated 4 months ago
- ☆34Updated 2 months ago
- Neural Diffusion Processes☆80Updated 10 months ago
- ☆64Updated last year
- Official release of code for "Oops I Took A Gradient: Scalable Sampling for Discrete Distributions"☆54Updated last year
- This repository contains code for applying Riemannian geometry in machine learning.☆77Updated 3 years ago
- Implementation of Action Matching☆44Updated 2 years ago
- Code for the paper: Rotating Features for Object Discovery☆52Updated 9 months ago
- A minimalist implementation of score-based diffusion model☆127Updated 3 years ago
- Source code of "What can linearized neural networks actually say about generalization?☆20Updated 3 years ago
- Implementation of Gated State Spaces, from the paper "Long Range Language Modeling via Gated State Spaces", in Pytorch☆100Updated 2 years ago
- Code for our paper "Generative Flow Networks for Discrete Probabilistic Modeling"☆82Updated 2 years ago