LeapLabTHU / UniTTALinks
☆17Updated 7 months ago
Alternatives and similar repositories for UniTTA
Users that are interested in UniTTA are comparing it to the libraries listed below
Sorting:
- ☆13Updated 9 months ago
- [NeurIPS 2022] Latency-aware Spatial-wise Dynamic Networks☆24Updated 2 years ago
- Jittor implementation of Vision Transformer with Deformable Attention☆31Updated 3 years ago
- Official implementation of Dynamic Perceiver☆43Updated last year
- [NeurIPS 2024] ENAT: Rethinking Spatial-temporal Interactions in Token-based Image Synthesis☆24Updated 10 months ago
- [ECCV 2024] Efficient Diffusion Transformer with Step-wise Dynamic Attention Mediators☆45Updated last year
- [ICML 2024] SimPro: A Simple Probabilistic Framework Towards Realistic Long-Tailed Semi-Supervised Learning☆30Updated last year
- [ECCV 2024] AdaNAT: Exploring Adaptive Policy for Token-Based Image Generation☆34Updated last year
- Official repository of Uni-AdaFocus (TPAMI 2024).☆49Updated 9 months ago
- [IEEE TPAMI] Latency-aware Unified Dynamic Networks for Efficient Image Recognition☆52Updated 6 months ago
- ☆42Updated 9 months ago
- ☆27Updated 3 years ago
- Repository of Vision Transformer with Deformable Attention (CVPR2022) and DAT++: Spatially Dynamic Vision Transformerwith Deformable Atte…☆19Updated last year
- [ECCV 2022] Learning to Weight Samples for Dynamic Early-exiting Networks☆36Updated 2 years ago
- Official code of paper Understanding, Predicting and Better Resolving Q-Value Divergence in Offline-RL☆23Updated last year
- [IEEE TIP] Fine-grained Recognition with Learnable Semantic Data Augmentation☆30Updated last year
- CODA: Repurposing Continuous VAEs for Discrete Tokenization☆28Updated 3 months ago
- IMG: Calibrating Diffusion Models via Implicit Multimodal Guidance, ICCV 2025☆28Updated last week
- Code for paper: Nullu: Mitigating Object Hallucinations in Large Vision-Language Models via HalluSpace Projection☆42Updated 6 months ago
- ☆36Updated 2 years ago
- Repository of GridMix (ICLR 2025)☆30Updated 6 months ago
- Code release for Deep Incubation (https://arxiv.org/abs/2212.04129)☆90Updated 2 years ago
- A PyTorch implementation of the paper "Revisiting Non-Autoregressive Transformers for Efficient Image Synthesis"☆46Updated last year
- [CVPR 2022] Pseudo-Q: Generating Pseudo Language Queries for Visual Grounding☆151Updated last year
- ☆16Updated 10 months ago
- [CVPR 2022] Official repository of AdaFocusV2.☆90Updated 9 months ago
- Repository of "Train Once, Get a Family: State-Adaptive Balances for Offline-to-Online Reinforcement Learning" (NeurIPS 2023 Spotlight)☆37Updated last year
- Official implementation of A Mixture of Surprises for Unsupervised Reinforcement Learning☆22Updated 2 years ago
- [TPAMI 2024] Probabilistic Contrastive Learning for Long-Tailed Visual Recognition☆84Updated last year
- (CVPR 2025) PyramidDrop: Accelerating Your Large Vision-Language Models via Pyramid Visual Redundancy Reduction☆130Updated 7 months ago