wiedersehne / Paramixer

Parameterizing Mixing Links in Sparse Factors Works Better than Dot-Product Self-Attention (CVPR 2022)
19Updated last year

Related projects: