michaelsdr / sinkformersLinks
Transformers with doubly stochastic attention
☆46Updated 2 years ago
Alternatives and similar repositories for sinkformers
Users that are interested in sinkformers are comparing it to the libraries listed below
Sorting:
- Sequence Modeling with Multiresolution Convolutional Memory (ICML 2023)☆125Updated last year
- Code repository of the paper "CKConv: Continuous Kernel Convolution For Sequential Data" published at ICLR 2022. https://arxiv.org/abs/21…☆123Updated 2 years ago
- Neural Diffusion Processes☆81Updated last year
- Sequence Modeling with Structured State Spaces☆65Updated 3 years ago
- Repository for the "Gotta Go Fast When Generating Data with Score-Based Models" paper☆105Updated 3 years ago
- Euclidean Wasserstein-2 optimal transportation☆47Updated last year
- Code for the paper https://arxiv.org/abs/2205.14987v2☆53Updated last year
- Implementation of Gated State Spaces, from the paper "Long Range Language Modeling via Gated State Spaces", in Pytorch☆101Updated 2 years ago
- ☆63Updated last year
- A minimalist implementation of score-based diffusion model☆128Updated 3 years ago
- Implicit Convolutional Kernels for Steerable CNNs [NeurIPS'23]☆29Updated 5 months ago
- Code for paper "Argmax Flows and Multinomial Diffusion: Learning Categorical Distributions"☆90Updated 4 years ago
- Official code for "Maximum Likelihood Training of Score-Based Diffusion Models", NeurIPS 2021 (spotlight)☆147Updated 3 years ago
- Code for GFlowNet-EM, a novel algorithm for fitting latent variable models with compositional latents and an intractable true posterior.☆41Updated last year
- Score-Based Generative Modeling with Critically-Damped Langevin Diffusion☆197Updated last year
- Pytorch code for "Improving Self-Supervised Learning by Characterizing Idealized Representations"☆41Updated 2 years ago
- This repository contains code for applying Riemannian geometry in machine learning.☆77Updated 4 years ago
- Official release of code for "Oops I Took A Gradient: Scalable Sampling for Discrete Distributions"☆56Updated 2 years ago
- Official Release of "Learning the Stein Discrepancy for Training and Evaluating Energy-Based Models without Sampling"☆49Updated 5 years ago
- ☆163Updated 2 years ago
- Gaussian-Bernoulli Restricted Boltzmann Machines☆104Updated 2 years ago
- Code for our paper "Generative Flow Networks for Discrete Probabilistic Modeling"☆85Updated 2 years ago
- [NeurIPS'20] Code for the Paper Compositional Visual Generation and Inference with Energy Based Models☆45Updated 2 years ago
- ☆17Updated 2 years ago
- ☆32Updated last year
- Meta Optimal Transport☆103Updated 2 years ago
- ☆54Updated last year
- Improved sampling via learned diffusions (ICLR2024) and an optimal control perspective on diffusion-based generative modeling (TMLR2024)☆65Updated 4 months ago
- Stochastic Normalizing Flows☆78Updated 3 years ago
- NF-Layers for constructing neural functionals.☆87Updated last year