This repo holds the official code and data for "Beyond Literal Descriptions: Understanding and Locating Open-World Objects Aligned with Human Intentions", which is accepted by ACL 2024 (Findings).
☆16May 21, 2024Updated last year
Alternatives and similar repositories for IVG
Users that are interested in IVG are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Official PyTorch implementation of the paper "Enhancing Vision-Language Pre-Training with Jointly Learned Questioner and Dense Captioner"☆15Aug 9, 2023Updated 2 years ago
- Med-DANet Series (ECCV 2022 & WACV 2024)☆13Jan 2, 2024Updated 2 years ago
- This repo holds the official code and data for "Unveiling Parts Beyond Objects: Towards Finer-Granularity Referring Expression Segmentati…☆72Jun 3, 2024Updated last year
- ☆13Oct 30, 2023Updated 2 years ago
- ☆22May 16, 2023Updated 2 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Entity-Aware and Motion-Aware Transformers for Language-driven Action Localization(IJCAI-22)☆12Oct 11, 2022Updated 3 years ago
- ☆13Jul 20, 2024Updated last year
- Official PyTorch implementation Source code for Weakly Supervised Video Scene Graph Generation via Natural Language Supervision, accepted…☆24Jun 13, 2025Updated 9 months ago
- [CVPR-2023] The official dataset of Advancing Visual Grounding with Scene Knowledge: Benchmark and Method.☆33Jul 12, 2023Updated 2 years ago
- ChatBridge, an approach to learning a unified multimodal model to interpret, correlate, and reason about various modalities without rely…☆54Sep 4, 2023Updated 2 years ago
- ☆31Nov 17, 2024Updated last year
- [ICLR2026] The code for "Interp3D: Correspondence-Aware Interpolation for Generative Textured 3D Morphing."☆26Jan 21, 2026Updated 2 months ago
- Welcome to the official repository of Emotion-Qwen.☆26Jun 10, 2025Updated 9 months ago
- Official code for the paper, "TaCA: Upgrading Your Visual Foundation Model with Task-agnostic Compatible Adapter".☆16Jun 20, 2023Updated 2 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- ☆10Jan 9, 2025Updated last year
- Adaptive FSS has been Accepted by AAAI 2024. A Novel Few-Shot Segmentation Framework via Prototype Enhancement☆43Mar 11, 2024Updated 2 years ago
- Official Implementation (Pytorch) of the "Representation Shift: Unifying Token Compression with FlashAttention", ICCV 2025☆32Feb 22, 2026Updated last month
- Official code for CVPR 2024 paper, "SC-Tune: Unleashing Self-Consistent Referential Comprehension in Large Vision Language Models"☆16Apr 22, 2024Updated last year
- The official implementation of the paper "MMFuser: Multimodal Multi-Layer Feature Fuser for Fine-Grained Vision-Language Understanding". …☆64Nov 5, 2024Updated last year
- Code for ChordSync, a conformer-based audio-to-chord synchroniser☆13Oct 17, 2025Updated 5 months ago
- https://arxiv.org/abs/2102.12594☆14Oct 3, 2023Updated 2 years ago
- [TPAMI2024] Codes and Models for VALOR: Vision-Audio-Language Omni-Perception Pretraining Model and Dataset☆307Dec 25, 2024Updated last year
- [ICLR 2025] Diffusion Feedback Helps CLIP See Better☆301Jan 23, 2025Updated last year
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- ☆32Mar 25, 2024Updated 2 years ago
- Pytorch Implementation of the paper: "Learning to (Learn at Test Time): RNNs with Expressive Hidden States"☆25Mar 16, 2026Updated last week
- PyTorch implementation of Data2Vec self-supervised approach for vision use cases.☆18Oct 7, 2022Updated 3 years ago
- ☆12Jan 4, 2022Updated 4 years ago
- [WACV 2024 Oral] Rethinking Visibility in Human Pose Estimation: Occluded Pose Reasoning via Transformers☆14Jul 6, 2024Updated last year
- ☆16Jan 6, 2025Updated last year
- ☆33Sep 27, 2024Updated last year
- UniVG-R1: Reasoning Guided Universal Visual Grounding with Reinforcement Learning☆161Jun 2, 2025Updated 9 months ago
- Natural language is not enough: Benchmarking multi-modal generative AI for Verilog generation (ICCAD 2024)☆38Jun 17, 2025Updated 9 months ago
- NordVPN Special Discount Offer • AdSave on top-rated NordVPN 1 or 2-year plans with secure browsing, privacy protection, and support for for all major platforms.
- An efficient GRPO training util.☆55Jun 13, 2025Updated 9 months ago
- ☆16Jun 5, 2023Updated 2 years ago
- This is some implements of pattern classificaion course including perceptron,relaxation procedure,MSE,Fisher,Ho-kashyap,SVM,KNN☆13May 29, 2018Updated 7 years ago
- ☆21Oct 10, 2023Updated 2 years ago
- Explicit Occlusion Reasoning for Multi-person 3D Human Pose Estimation☆17Nov 20, 2022Updated 3 years ago
- Official implementation of the paper 'Perception-Distortion Balanced ADMM Optimization for Single-Image Super-Resolution' in ECCV 2022☆18Aug 9, 2022Updated 3 years ago
- ☆61Oct 13, 2023Updated 2 years ago