CatworldLee / Gaussian-Mixture-Mask-Attention
☆8Updated 2 months ago
Alternatives and similar repositories for Gaussian-Mixture-Mask-Attention:
Users that are interested in Gaussian-Mixture-Mask-Attention are comparing it to the libraries listed below
- Pytorch Implementation of CLIP-Lite | Accepted at AISTATS 2023☆12Updated last year
- ☆24Updated last year
- Log-Polar Space Convolution for Convolutional Neural Networks☆11Updated 2 years ago
- A benchmark dataset and simple code examples for measuring the perception and reasoning of multi-sensor Vision Language models.☆16Updated 2 weeks ago
- Original PyTorch implementation of the paper "Semantic Segmentation under Adverse Conditions: A Weather and Nighttime-aware Synthetic Dat…☆22Updated 10 months ago
- [ACL 2023] PuMer: Pruning and Merging Tokens for Efficient Vision Language Models☆28Updated 3 months ago
- Code of CropMix: Sampling a Rich Input Distribution via Multi-Scale Cropping☆17Updated 2 years ago
- [WACV2023] This is the official PyTorch impelementation of our paper "[Rethinking Rotation in Self-Supervised Contrastive Learning: Adapt…☆12Updated last year
- 🔥 🔥 [WACV2024] Mini but Mighty: Finetuning ViTs with Mini Adapters☆18Updated 6 months ago
- MIMIC: Masked Image Modeling with Image Correspondences☆16Updated 6 months ago
- Open source community's implementation of the model from "LANGUAGE MODEL BEATS DIFFUSION — TOKENIZER IS KEY TO VISUAL GENERATION"☆15Updated last month
- [ICLR 2023] “ Layer Grafted Pre-training: Bridging Contrastive Learning And Masked Image Modeling For Better Representations”, Ziyu Jian…☆23Updated last year
- [ICRA 2024] Official Implementation of the Paper "Parameter-efficient Prompt Learning for 3D Point Cloud Understanding"☆20Updated 11 months ago
- Multivariate Learned Adaptive Noise for Diffusion Models☆15Updated 3 weeks ago
- Code for "RankDNN: Learning to Rank for Few-shot Learning" accepted to AAAI 2023☆12Updated 9 months ago
- An official PyTorch implementation for CLIPPR☆29Updated last year
- ☆32Updated last year
- Code repository for the public reproduction of the language modelling experiments on "MatFormer: Nested Transformer for Elastic Inference…☆18Updated last year
- TiC: Exploring Vision Transformer in Convolution☆11Updated last year
- SkyScenes: A Synthetic Dataset for Aerial Scene Understanding☆17Updated 3 months ago
- ☆19Updated 3 years ago
- Official implementation of SBNet as described in "Single-branch Network for Multimodal Training".☆9Updated last year
- GIFT: Generative Interpretable Fine-Tuning☆19Updated 3 months ago
- Unofficial reimplementation of ViR: Vision Retention Networks by Hatamizadeh et. al. (https://arxiv.org/abs/2310.19731)☆18Updated 5 months ago
- HGRN2: Gated Linear RNNs with State Expansion☆52Updated 4 months ago
- Pytorch implementation of Mix-Shifting-MLP (MS-MLP)☆16Updated 2 years ago
- ☆14Updated 6 months ago
- [CVPRW 2023, Best Paper Award] DeFlow: Self-supervised 3D Motion Estimation of Debris Flow☆30Updated last year
- Experimental scripts for researching data adaptive learning rate scheduling.☆23Updated last year