savadikarc / gift
GIFT: Generative Interpretable Fine-Tuning
☆17Updated 2 months ago
Related projects: ⓘ
- 🔥 [CVPR 2024] Official implementation of "See, Say, and Segment: Teaching LMMs to Overcome False Premises (SESAME)"☆23Updated 3 months ago
- Benchmarking Attention Mechanism in Vision Transformers.☆16Updated last year
- Codes for ICML 2023 Learning Dynamic Query Combinations for Transformer-based Object Detection and Segmentation☆35Updated last year
- ☆20Updated 9 months ago
- REVO-LION: Evaluating and Refining Vision-Language Instruction Tuning Datasets☆11Updated 11 months ago
- [CVPR 2024] DiffAgent: Fast and Accurate Text-to-Image API Selection with Large Language Model☆15Updated 5 months ago
- Code for paper "Unsegment Anything by Simulating Deformation" (CVPR 2024)☆21Updated 3 months ago
- [ECCV 2024] This is the official implementation of "Stitched ViTs are Flexible Vision Backbones".☆22Updated 7 months ago
- Learning 1D Causal Visual Representation with De-focus Attention Networks☆28Updated 3 months ago
- ☆16Updated 2 years ago
- BESA is a differentiable weight pruning technique for large language models.☆12Updated 6 months ago
- Stay tuned!☆11Updated 5 months ago
- TIER: Text-Image Encoder-based Regression for AIGC Image Quality Assessment☆9Updated 8 months ago
- ☆19Updated last year
- OpenMMLab Detection Toolbox and Benchmark for V3Det☆15Updated 5 months ago
- A PyTorch implementation of the paper "Revisiting Non-Autoregressive Transformers for Efficient Image Synthesis"☆26Updated 3 months ago
- ☆36Updated 4 months ago
- The official implementation for MTLoRA: A Low-Rank Adaptation Approach for Efficient Multi-Task Learning☆25Updated last month
- DiverGen (CVPR 2024) & BSGAL (ICML 2024)☆33Updated 3 weeks ago
- ☆12Updated last month
- Sambor: Boosting Segment Anything Model Towards Open-Vocabulary Learning☆30Updated 9 months ago
- ☆16Updated this week
- PyTorch code for Q-DiT: Accurate Post-Training Quantization for Diffusion Transformers☆27Updated 2 weeks ago
- Official code for the paper, "TaCA: Upgrading Your Visual Foundation Model with Task-agnostic Compatible Adapter".☆15Updated last year
- (ICLR 2024, CVPR 2024) SparseFormer☆62Updated 5 months ago
- [ECCV 2022] AMixer: Adaptive Weight Mixing for Self-attention Free Vision Transformers☆27Updated last year
- Video Diffusion State Space Models☆19Updated 5 months ago
- ☆27Updated 5 months ago
- 一个mmcv 的logger hook, 可以用来把模型结果推送到微信上☆20Updated last year
- ☆17Updated last week