michaelsdr / sinkformersLinks

Transformers with doubly stochastic attention

☆46

Alternatives and similar repositories for sinkformers

Users that are interested in sinkformers are comparing it to the libraries listed below

Sorting:

thjashin / multires-conv
Sequence Modeling with Multiresolution Convolutional Memory (ICML 2023)
☆125Updated last year
dwromero / ckconv
Code repository of the paper "CKConv: Continuous Kernel Convolution For Sequential Data" published at ICLR 2022. https://arxiv.org/abs/21…
☆123Updated 2 years ago
vdutor / neural-diffusion-processes
Neural Diffusion Processes
☆81Updated last year
ag1988 / dss
Sequence Modeling with Structured State Spaces
☆65Updated 3 years ago
AlexiaJM / score_sde_fast_sampling
Repository for the "Gotta Go Fast When Generating Data with Score-Based Models" paper
☆105Updated 3 years ago
facebookresearch / w2ot
Euclidean Wasserstein-2 optimal transportation
☆47Updated last year
andrew-cr / tauLDR
Code for the paper https://arxiv.org/abs/2205.14987v2
☆53Updated last year
lucidrains / gated-state-spaces-pytorch
Implementation of Gated State Spaces, from the paper "Long Range Language Modeling via Gated State Spaces", in Pytorch
☆101Updated 2 years ago
wgrathwohl / VERA
☆63Updated last year
CW-Huang / sdeflow-light
A minimalist implementation of score-based diffusion model
☆128Updated 3 years ago
maxxxzdn / implicit-steerable-kernels
Implicit Convolutional Kernels for Steerable CNNs [NeurIPS'23]
☆29Updated 5 months ago
didriknielsen / argmax_flows
Code for paper "Argmax Flows and Multinomial Diffusion: Learning Categorical Distributions"
☆90Updated 4 years ago
yang-song / score_flow
Official code for "Maximum Likelihood Training of Score-Based Diffusion Models", NeurIPS 2021 (spotlight)
☆147Updated 3 years ago
GFNOrg / GFlowNet-EM
Code for GFlowNet-EM, a novel algorithm for fitting latent variable models with compositional latents and an intractable true posterior.
☆41Updated last year
nv-tlabs / CLD-SGM
Score-Based Generative Modeling with Critically-Damped Langevin Diffusion
☆197Updated last year
YannDubs / Invariant-Self-Supervised-Learning
Pytorch code for "Improving Self-Supervised Learning by Characterizing Idealized Representations"
☆41Updated 2 years ago
georgiosarvanitidis / geometric_ml
This repository contains code for applying Riemannian geometry in machine learning.
☆77Updated 4 years ago
wgrathwohl / GWG_release
Official release of code for "Oops I Took A Gradient: Scalable Sampling for Discrete Distributions"
☆56Updated 2 years ago
wgrathwohl / LSD
Official Release of "Learning the Stein Discrepancy for Training and Evaluating Energy-Based Models without Sampling"
☆49Updated 5 years ago
ctlllll / SGConv
☆163Updated 2 years ago
DSL-Lab / GRBM
Gaussian-Bernoulli Restricted Boltzmann Machines
☆104Updated 2 years ago
GFNOrg / EB_GFN
Code for our paper "Generative Flow Networks for Discrete Probabilistic Modeling"
☆85Updated 2 years ago
yilundu / ebm_compositionality
[NeurIPS'20] Code for the Paper Compositional Visual Generation and Inference with Energy Based Models
☆45Updated 2 years ago
cscarv / riemannian-metric-learning-ot
☆17Updated 2 years ago
necludov / wl-mechanics
☆32Updated last year
facebookresearch / meta-ot
Meta Optimal Transport
☆103Updated 2 years ago
MichaelArbel / GeneralizedEBM
☆54Updated last year
juliusberner / sde_sampler
Improved sampling via learned diffusions (ICLR2024) and an optimal control perspective on diffusion-based generative modeling (TMLR2024)
☆65Updated 4 months ago
noegroup / stochastic_normalizing_flows
Stochastic Normalizing Flows
☆78Updated 3 years ago
AllanYangZhou / nfn
NF-Layers for constructing neural functionals.
☆87Updated last year