[CVPR 2024] Code for "Improved Visual Grounding through Self-Consistent Explanations".
☆27Mar 1, 2024Updated 2 years ago
Alternatives and similar repositories for SelfEQ
Users that are interested in SelfEQ are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- M2-Reasoning: Empowering MLLMs with Unified General and Spatial Reasoning☆47Jul 17, 2025Updated 9 months ago
- Official code for Zero-shot Referring Expression Comprehension via Structural Similarity Between Images and Captions (CVPR 2024)☆28Jun 21, 2024Updated last year
- [ICME 2024 Oral] DARA: Domain- and Relation-aware Adapters Make Parameter-efficient Tuning for Visual Grounding☆23Feb 26, 2025Updated last year
- [AAAI 2024]Weakly Supervised Multimodal Affordance Grounding for Egocentric Images☆13Nov 10, 2024Updated last year
- [CVPR 2024] Visual Programming for Zero-shot Open-Vocabulary 3D Visual Grounding☆62Aug 3, 2024Updated last year
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Code of "What Images are More Memorable to Machines?"☆15Feb 13, 2023Updated 3 years ago
- Codes for the AAAI 2023 paper (Oral) "Efficient Mirror Detection via Multi-level Heterogeneous Learning" https://arxiv.org/pdf/2211.1564…☆13Jan 18, 2023Updated 3 years ago
- [CVPR 2024] Tune-An-Ellipse: CLIP Has Potential to Find What You Want☆14Jan 5, 2025Updated last year
- Official Implementation of "Magnet: We Never Know How Text-to-Image Diffusion Models Work, Until We Learn How Vision-Language Models Func…☆31Dec 2, 2024Updated last year
- [ICCV 2023] Distilling Coarse-to-fine Semantic Matching Knowledge for Weakly Supervised 3D Visual Grounding☆14Oct 2, 2024Updated last year
- Exploiting Inter-sample and Inter-feature Relations in Dataset Distillation (CVPR24)☆11Jun 16, 2024Updated last year
- This is the repo for the work "Where and What: Driver Attention-based Object Detection".☆10May 10, 2022Updated 3 years ago
- Towards Efficient Shapley Value Estimation via Cross-contribution Maximization☆14Jul 8, 2022Updated 3 years ago
- SIEVE: Multimodal Dataset Pruning using Image-Captioning Models (CVPR 2024)☆19Apr 28, 2024Updated 2 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- ☆44Apr 16, 2026Updated 3 weeks ago
- [ICLR 2026] RewardMap: Tackling Sparse Rewards in Fine-grained Visual Reasoning via Multi-Stage Reinforcement Learning☆43Feb 22, 2026Updated 2 months ago
- [ECCV'24] Official Implementation of Autoregressive Visual Entity Recognizer.☆14Mar 2, 2024Updated 2 years ago
- Git for "Stepwise Self-Consistent Mathematical Reasoning with Large Language Models"☆12Nov 26, 2024Updated last year
- 华中科技大学计算机视觉课程实验☆16Sep 29, 2024Updated last year
- A reading list of papers about Visual Grounding.☆31Aug 24, 2022Updated 3 years ago
- [AAAI 2024] Mono3DVG: 3D Visual Grounding in Monocular Images, AAAI, 2024☆70Apr 9, 2024Updated 2 years ago
- ☆14Jul 13, 2021Updated 4 years ago
- ☆21Apr 2, 2024Updated 2 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Official implement of "Point Long-Term Locality-Aware Transformer for Point Cloud Video Understanding"☆28Mar 24, 2026Updated last month
- Official repository of the UPAR dataset for pedestrian attribute recognition and attribute-based person retrieval☆16Jan 22, 2024Updated 2 years ago
- ☆24Jul 8, 2023Updated 2 years ago
- [AAAI 2025] Video Diffusion Models are Strong Video Inpainter☆17Jul 21, 2025Updated 9 months ago
- Uncertainty-Aware Curriculum Learning for Neural Machine Translation (ACL 2020)☆11Jun 12, 2020Updated 5 years ago
- Pytorch code for NODIS: Neural Ordinary Differential Scene Understanding, ECCV2020☆12Aug 28, 2020Updated 5 years ago
- ☆42Jul 14, 2025Updated 9 months ago
- [CVPR 2024] 3DFIRES: Few Image 3D REconstruction for Scenes with Hidden Surfaces☆26Mar 28, 2024Updated 2 years ago
- Codes for our ACM MM 2019 paper: "Exploiting Temporal Relationships in Video Moment Localization with Natural Language"☆16Oct 22, 2022Updated 3 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- code for affordance-r1☆71Dec 21, 2025Updated 4 months ago
- Thermal Indoor Motion Dataset☆16Apr 27, 2023Updated 3 years ago
- [NeurIPS 2024] NeuMA: Neural Material Adaptor for Visual Grounding of Intrinsic Dynamics☆60May 29, 2025Updated 11 months ago
- A curated list of the Video Summarization subject which is a computer science using machine learning and deep learning☆42May 29, 2020Updated 5 years ago
- A personal reimplementation with TensorFlow of NIPS2018 paper: Joint Autoregressive and Hierarchical Priors for Learned Image Compression☆15Jan 17, 2023Updated 3 years ago
- EANN(Pytorch)☆10Mar 12, 2022Updated 4 years ago
- Human Pose Classification☆17Feb 19, 2023Updated 3 years ago