liuzuyan / ElasticCache
[ECCV 2024] Efficient Inference of Vision Instruction-Following Models with Elastic Cache
☆46Updated last month
Related projects: ⓘ
- ☆39Updated 3 months ago
- Code for paper:An Information Flow Perspective for Exploring Large Vision Language Models on Reasoning Tasks☆58Updated 3 weeks ago
- TransRefer3D: Entity-and-Relation Aware Transformer for Fine-Grained 3D Visual Grounding [ACM MM'21]☆32Updated 2 years ago
- This is the official reproduction of Qihoo-T2X.☆75Updated last week
- A comprehensive collection of resources focused on addressing and understanding hallucination phenomena in MLLMs.☆37Updated 4 months ago
- Chain-of-Spot: Interactive Reasoning Improves Large Vision-language Models☆81Updated 5 months ago
- This is the official code repository of MoTCoder: Elevating Large Language Models with Modular of Thought for Challenging Programming Tas…☆60Updated 3 weeks ago
- Code for WS3DPT☆76Updated 3 months ago
- Rethinking Video-Text Understanding Retrieval from Counterfactually Augmented Data☆47Updated last month
- [TCSVT 2024] Official PyTorch implementation of the paper "MLP: Motion Label Prior for Temporal Sentence Localization in Untrimmed 3D Hum…☆22Updated last month
- Official implementation of "Generating images with 3D annotations using diffusion models".☆58Updated 3 weeks ago
- WorldGPT: Empowering LLM as Multimodal World Model☆116Updated last month
- 🐱🐶🐲🐮🐷Implementation of DreamBeast: Distilling 3D Fantastical Animals with Part-Aware Knowledge Transfer☆37Updated this week
- AI2-THOR Data Collection Tool Based On Keyboard Interaction☆54Updated 2 months ago
- The official generation code and toolkits of VDW dataset (ICCV 2023)☆37Updated 2 months ago
- Intra-class Adaptive Augmentation with Neighbor Correction for Deep Metric Learning, IEEE Transactions on Multimedia (T-MM), 2022☆35Updated last year
- (ECCV 2024) Empowering Multimodal Large Language Model as a Powerful Data Generator☆77Updated 3 months ago
- [ECCV 2024] InterFusion: Text-Driven Generation of 3D Human-Object Interaction☆49Updated 2 months ago
- Painting 3D Nature in 2D: View Synthesis of Natural Scenes From a Single Semantic Mask☆41Updated last year
- Official Implementation for "Mask-based modeling for Neural Radiance Fields" (ICLR 2024)☆46Updated 3 months ago
- Multi-granularity Correspondence Learning from Long-term Noisy Videos [ICLR 2024, Oral]☆106Updated 5 months ago
- [MM'24 Oral] Prior Knowledge Integration via LLM Encoding and Pseudo Event Regulation for Video Moment Retrieval☆135Updated 3 weeks ago
- [ICLR 2023] Official Tensorflow implementation of "Distributionally Robust Post-hoc Classifiers under Prior Shifts"☆39Updated 7 months ago
- [ICML2022 Long Talk] Official Pytorch implementation of "To Smooth or Not? When Label Smoothing Meets Noisy Labels"☆114Updated 2 years ago
- A Pytorch implementation of ICML 2022 paper "NP-Match: When Neural Processes meet Semi-Supervised Learning"☆127Updated 11 months ago
- u-LLaVA: Unifying Multi-Modal Tasks via Large Language Model☆135Updated 2 months ago
- SpecRef: A Fast Training-free Baseline of Specific Reference-Condition Real Image Editing☆39Updated 7 months ago
- The official project website of "Augmentation-free Dense Contrastive Distillation for Efficient Semantic Segmentation" (Af-DCD for short,…☆19Updated 5 months ago
- ☆49Updated 2 months ago
- Accepted to CVPR 2024, "Interactive continual learning: Fast and slow thinking"☆119Updated 2 months ago