michaelsdr / sinkformers
Transformers with doubly stochastic attention
☆40Updated 2 years ago
Related projects ⓘ
Alternatives and complementary repositories for sinkformers
- Code for GFlowNet-EM, a novel algorithm for fitting latent variable models with compositional latents and an intractable true posterior.☆38Updated 9 months ago
- Euclidean Wasserstein-2 optimal transportation☆44Updated last year
- ☆33Updated last year
- Laplace Redux -- Effortless Bayesian Deep Learning☆38Updated last year
- [NeurIPS'20] Code for the Paper Compositional Visual Generation and Inference with Energy Based Models☆43Updated last year
- Official repository of Implicit Neural Convolutional Kernels for Steerable CNNs, Zhdanov et al.☆26Updated 8 months ago
- Pytorch code for "Improving Self-Supervised Learning by Characterizing Idealized Representations"☆40Updated last year
- ☆31Updated 5 months ago
- Kernel Stein Discrepancy Descent : a method to sample from unnormalized densities☆21Updated 7 months ago
- PyTorch implementation for our ICLR 2024 paper "Diffusion Generative Flow Samplers: Improving learning signals through partial trajectory…☆22Updated 11 months ago
- ☆16Updated last year
- ☆23Updated 3 years ago
- Improved sampling via learned diffusions (ICLR2024) and an optimal control perspective on diffusion-based generative modeling (TMLR2024)☆52Updated 2 months ago
- Neural Diffusion Processes☆73Updated 3 months ago
- ☆20Updated last month
- Sequence Modeling with Structured State Spaces☆60Updated 2 years ago
- ☆49Updated 3 years ago
- [ICML'21] Improved Contrastive Divergence Training of Energy Based Models☆62Updated 2 years ago
- Tensorflow implementation and notebooks for Implicit Maximum Likelihood Estimation☆68Updated 2 years ago
- ☆52Updated 3 months ago
- scipy linear operators for the Hessian, Fisher/GGN, and more in PyTorch☆18Updated 2 weeks ago
- ☆31Updated 4 years ago
- Code for the paper https://arxiv.org/abs/2205.14987v2☆43Updated 7 months ago
- ☆62Updated 9 months ago
- JAX exponential map normalising flows on sphere☆17Updated 4 years ago
- Code for our paper "Generative Flow Networks for Discrete Probabilistic Modeling"☆76Updated last year
- Official code for "Accelerating Feedforward Computation via Parallel Nonlinear Equation Solving", ICML 2021☆26Updated 3 years ago
- Official release of code for "Oops I Took A Gradient: Scalable Sampling for Discrete Distributions"☆52Updated last year
- Code for "Invariance Learning in Deep Neural Networks with Differentiable Laplace Approximations"☆23Updated 2 years ago
- Code for "The Intrinsic Dimension of Images and Its Impact on Learning" - ICLR 2021 Spotlight https://openreview.net/forum?id=XJk19XzGq2J☆65Updated 7 months ago