michaelsdr / sinkformers

Transformers with doubly stochastic attention
40Updated 2 years ago

Related projects: