Markin-Wang / MixViT

[Pattern Recognition] Mix-ViT: Mixing Attentive Vision Transformer for Ultra-Fine-Grained Visual Categorization.
20Updated last year

Related projects: