michaelsdr / sinkformersLinks
Transformers with doubly stochastic attention
☆47Updated 3 years ago
Alternatives and similar repositories for sinkformers
Users that are interested in sinkformers are comparing it to the libraries listed below
Sorting:
- Sequence Modeling with Multiresolution Convolutional Memory (ICML 2023)☆126Updated last year
- Sequence Modeling with Structured State Spaces☆66Updated 3 years ago
- Official release of code for "Oops I Took A Gradient: Scalable Sampling for Discrete Distributions"☆56Updated 2 years ago
- Euclidean Wasserstein-2 optimal transportation☆47Updated 2 years ago
- Neural Diffusion Processes☆81Updated last year
- Code for GFlowNet-EM, a novel algorithm for fitting latent variable models with compositional latents and an intractable true posterior.☆41Updated last year
- Implicit Convolutional Kernels for Steerable CNNs [NeurIPS'23]☆29Updated 6 months ago
- Code for our paper "Generative Flow Networks for Discrete Probabilistic Modeling"☆84Updated 2 years ago
- Improved sampling via learned diffusions (ICLR2024) and an optimal control perspective on diffusion-based generative modeling (TMLR2024)☆66Updated 6 months ago
- Repository for the "Gotta Go Fast When Generating Data with Score-Based Models" paper☆105Updated 3 years ago
- ☆64Updated last year
- This repository contains code for applying Riemannian geometry in machine learning.☆77Updated 4 years ago
- Gaussian-Bernoulli Restricted Boltzmann Machines☆105Updated 2 years ago
- ☆38Updated 2 years ago
- Pytorch code for "Improving Self-Supervised Learning by Characterizing Idealized Representations"☆41Updated 2 years ago
- A minimalist implementation of score-based diffusion model☆129Updated 4 years ago
- Stochastic Normalizing Flows☆77Updated 3 years ago
- Official code for "Maximum Likelihood Training of Score-Based Diffusion Models", NeurIPS 2021 (spotlight)☆149Updated 3 years ago
- Implementation of Action Matching for the Schrödinger equation☆23Updated 2 years ago
- [ICML'21] Improved Contrastive Divergence Training of Energy Based Models☆65Updated 3 years ago
- ☆54Updated last year
- Code repository of the paper "CKConv: Continuous Kernel Convolution For Sequential Data" published at ICLR 2022. https://arxiv.org/abs/21…☆123Updated 2 years ago
- ☆31Updated last year
- Official Release of "Learning the Stein Discrepancy for Training and Evaluating Energy-Based Models without Sampling"☆49Updated 5 years ago
- ☆38Updated 2 years ago
- ☆17Updated 2 years ago
- PyTorch implementation of Algorithm 1 of "On the Anatomy of MCMC-Based Maximum Likelihood Learning of Energy-Based Models"☆38Updated last year
- Implementation of Gated State Spaces, from the paper "Long Range Language Modeling via Gated State Spaces", in Pytorch☆101Updated 2 years ago
- Laplace Redux -- Effortless Bayesian Deep Learning☆42Updated 3 months ago
- JAX exponential map normalising flows on sphere☆17Updated 4 years ago