CatworldLee / Gaussian-Mixture-Mask-AttentionLinks
☆8Updated 7 months ago
Alternatives and similar repositories for Gaussian-Mixture-Mask-Attention
Users that are interested in Gaussian-Mixture-Mask-Attention are comparing it to the libraries listed below
Sorting:
- Code for paper "Modality Plug-and-Play: Elastic Modality Adaptation in Multimodal LLMs for Embodied AI"☆11Updated last year
- Code for "Don't trust your eyes: on the (un)reliability of feature visualizations" (ICML 2024)☆32Updated last year
- [ACL 2023] PuMer: Pruning and Merging Tokens for Efficient Vision Language Models☆30Updated 8 months ago
- ☆24Updated last year
- Evaluating Deep Multimodal Reasoning in Vision-Centric Agentic Tasks☆14Updated 2 weeks ago
- Pytorch Implementation of CLIP-Lite | Accepted at AISTATS 2023☆13Updated 2 years ago
- Code for "R2-T2: Re-Routing in Test-Time for Multimodal Mixture-of-Experts"☆15Updated 3 months ago
- The official code of "PixelWorld: Towards Perceiving Everything as Pixels"☆14Updated 4 months ago
- A token pruning method that accelerates ViTs for various tasks while maintaining high performance.☆14Updated 5 months ago
- codes for paper "Interpretability-Aware Vision Transformer"☆23Updated last year
- Official implementation of the WACV 2025 paper "3D Part Segmentation via Geometric Aggregation of 2D Visual Features"☆19Updated 2 weeks ago
- Official implementation for "Knowledge Distillation with Refined Logits".☆14Updated 10 months ago
- A benchmark dataset and simple code examples for measuring the perception and reasoning of multi-sensor Vision Language models.☆18Updated 6 months ago
- [WACV2023] This is the official PyTorch impelementation of our paper "[Rethinking Rotation in Self-Supervised Contrastive Learning: Adapt…☆12Updated 2 years ago
- HGRN2: Gated Linear RNNs with State Expansion☆55Updated 10 months ago
- ☆13Updated 2 years ago
- [BMVC 2022] Information Theoretic Representation Distillation☆18Updated last year
- [ACL 2023] Code for paper “Tailoring Instructions to Student’s Learning Levels Boosts Knowledge Distillation”(https://arxiv.org/abs/2305.…☆38Updated 2 years ago
- Official implementation of Matrix Variational Masked Autoencoder (M-MAE) for ICML paper "Information Flow in Self-Supervised Learning" (h…☆14Updated 9 months ago
- [WACV 2025] Official implementation of "Online-LoRA: Task-free Online Continual Learning via Low Rank Adaptation" by Xiwen Wei, Guihong L…☆41Updated 7 months ago
- Code of CropMix: Sampling a Rich Input Distribution via Multi-Scale Cropping☆17Updated 2 years ago
- ☆10Updated 3 months ago
- Implementation of "PaLM2-VAdapter:" from the multi-modal model paper: "PaLM2-VAdapter: Progressively Aligned Language Model Makes a Stron…☆16Updated 7 months ago
- official implementation of Training-free Boost for Open-Vocabulary Object Detection with Confidence Aggregation☆12Updated last year
- ☆9Updated 2 years ago
- Official implementation of RMoE (Layerwise Recurrent Router for Mixture-of-Experts)☆22Updated 10 months ago
- Implementation of the model "Hedgehog" from the paper: "The Hedgehog & the Porcupine: Expressive Linear Attentions with Softmax Mimicry"☆14Updated last year
- More dimensions = More fun☆22Updated 11 months ago
- Official repo of Progressive Data Expansion: data, code and evaluation☆29Updated last year
- ☆12Updated 3 weeks ago