CatworldLee / Gaussian-Mixture-Mask-Attention
☆9Updated 5 months ago
Alternatives and similar repositories for Gaussian-Mixture-Mask-Attention:
Users that are interested in Gaussian-Mixture-Mask-Attention are comparing it to the libraries listed below
- The official code of "PixelWorld: Towards Perceiving Everything as Pixels"☆13Updated 2 months ago
- ☆24Updated last year
- Implementation of the model "Hedgehog" from the paper: "The Hedgehog & the Porcupine: Expressive Linear Attentions with Softmax Mimicry"☆13Updated last year
- A benchmark dataset and simple code examples for measuring the perception and reasoning of multi-sensor Vision Language models.☆18Updated 3 months ago
- HGRN2: Gated Linear RNNs with State Expansion☆54Updated 7 months ago
- [ACL 2023] Code for paper “Tailoring Instructions to Student’s Learning Levels Boosts Knowledge Distillation”(https://arxiv.org/abs/2305.…☆38Updated last year
- Collect papers about Mamba (a selective state space model).☆14Updated 8 months ago
- Decoupled Kullback-Leibler Divergence Loss (DKL), NeurIPS 2024 / Generalized Kullback-Leibler Divergence Loss (GKL)☆43Updated last week
- [ICLR 2023] “ Layer Grafted Pre-training: Bridging Contrastive Learning And Masked Image Modeling For Better Representations”, Ziyu Jian…☆24Updated 2 years ago
- [ACL 2023] PuMer: Pruning and Merging Tokens for Efficient Vision Language Models☆29Updated 6 months ago
- [WACV2023] This is the official PyTorch impelementation of our paper "[Rethinking Rotation in Self-Supervised Contrastive Learning: Adapt…☆12Updated 2 years ago
- Open source community's implementation of the model from "LANGUAGE MODEL BEATS DIFFUSION — TOKENIZER IS KEY TO VISUAL GENERATION"☆16Updated 5 months ago
- Official implementation for "Knowledge Distillation with Refined Logits".☆13Updated 7 months ago
- ☆9Updated 2 years ago
- Official code for the paper "Attention as a Hypernetwork"☆27Updated 9 months ago
- ☆15Updated 3 weeks ago
- Pytorch Implementation of CLIP-Lite | Accepted at AISTATS 2023☆13Updated 2 years ago
- SkyScenes: A Synthetic Dataset for Aerial Scene Understanding☆18Updated 6 months ago
- [ICLR'25] "Understanding Bottlenecks of State Space Models through the Lens of Recency and Over-smoothing" by Peihao Wang, Ruisi Cai, Yue…☆11Updated 3 weeks ago
- Code of CropMix: Sampling a Rich Input Distribution via Multi-Scale Cropping☆17Updated 2 years ago
- This repository provides a multi task benchmark for instance segmentation, depth estimation, and 3D object detection.☆14Updated last year
- Official implementation of the WACV 2025 paper "3D Part Segmentation via Geometric Aggregation of 2D Visual Features"☆16Updated last month
- A simpler Pytorch + Zeta Implementation of the paper: "SiMBA: Simplified Mamba-based Architecture for Vision and Multivariate Time series…☆28Updated 5 months ago
- Unofficial implementation of paper : Exploring the Space of Key-Value-Query Models with Intention☆11Updated last year
- TIER: Text-Image Encoder-based Regression for AIGC Image Quality Assessment☆9Updated last month
- ☆15Updated 2 years ago
- survery of small language models☆14Updated 8 months ago
- [WACV 2025] Official implementation of "Online-LoRA: Task-free Online Continual Learning via Low Rank Adaptation" by Xiwen Wei, Guihong L…☆35Updated 4 months ago
- Code for "R2-T2: Re-Routing in Test-Time for Multimodal Mixture-of-Experts"☆15Updated last month
- Official code for the paper "Image generation with shortest path diffusion" accepted at ICML 2023.☆23Updated last year