Code for ACM MM 2024 paper "A Picture Is Worth a Graph: A Blueprint Debate Paradigm for Multimodal Reasoning"
☆20Dec 5, 2024Updated last year
Alternatives and similar repositories for BDoG
Users that are interested in BDoG are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- iterative shrinking for referring expression grounding using deep reinforcement learning☆14Nov 27, 2021Updated 4 years ago
- a collaborative agent-based workflow designed for NL2Vis task☆19Mar 6, 2025Updated last year
- [CVPR 2023] Code for "Improving Visual Grounding by Encouraging Consistent Gradient-based Explanations"☆19Oct 10, 2023Updated 2 years ago
- Code for COLING 2020 paper "Controllable Abstractive Sentence Summarization with Guiding Entities"☆12Dec 24, 2020Updated 5 years ago
- ☆13Jan 5, 2022Updated 4 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- ☆11Aug 20, 2025Updated 7 months ago
- Official Implementation of MDK12-Bench: A Multi-Discipline Benchmark for Evaluating Reasoning in Multimodal Large Language Models☆13Nov 1, 2025Updated 5 months ago
- [ACL 2025 Main] (🏆 Outstanding Paper Award) Rethinking the Role of Prompting Strategies in LLM Test-Time Scaling: A Perspective of Proba…☆17Aug 15, 2025Updated 7 months ago
- MAT: Multi-modal Agent Tuning 🔥 ICLR 2025 (Spotlight)☆91Dec 18, 2025Updated 3 months ago
- A collection of research papers related to Natural Language Reasoning☆11May 27, 2022Updated 3 years ago
- Implementation of the Paper Scene-Graph ViT☆10Dec 20, 2024Updated last year
- [TCybern2019] Brain MRI Super-Resolution based on Thick-Section MR Images from Two Planes.☆11Jul 19, 2023Updated 2 years ago
- [AAAI'26] Official implementation of CMMCoT: Enhancing Complex Multi-Image Comprehension via Multi-Modal Chain-of-Thought and Memory Augm…☆12Dec 5, 2025Updated 4 months ago
- An Implementation of Deep Exhaustive Model for Nested NER☆15Jul 19, 2019Updated 6 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- ☆36Jan 9, 2026Updated 3 months ago
- [CVPR2023] Context De-confounded Emotion Recognition☆18Jul 23, 2023Updated 2 years ago
- Official code of the MSF model for GZSSAR (ICIG 2023)☆14Jan 3, 2026Updated 3 months ago
- Pytorch implementation of Detective☆12Jul 11, 2024Updated last year
- Code and data for COLING2024 paper "Characteristic AI Agents via Large Language Models".☆25Nov 29, 2024Updated last year
- Scripting Multi-Scene Videos with Time-Aware and Structural Audio-Visual Captions☆28Feb 11, 2026Updated 2 months ago
- ☆13Apr 1, 2022Updated 4 years ago
- ☆15Feb 11, 2025Updated last year
- Pytorch implementation of "Diversified in-domain synthesis with efficient fine-tuning for few-shot classification"☆17Mar 25, 2024Updated 2 years ago
- NordVPN Special Discount Offer • AdSave on top-rated NordVPN 1 or 2-year plans with secure browsing, privacy protection, and support for for all major platforms.
- A video question answering dataset that focuses on the dynamics properties of objects (velocity, acceleration) and their collisions withi…☆19Apr 23, 2025Updated 11 months ago
- ☆14Jan 31, 2024Updated 2 years ago
- Official implementation of "Continual Learning by Modeling Intra-Class Variation" (MOCA). [TMLR 2023]☆16Mar 3, 2023Updated 3 years ago
- [EMNLP 2024] SURf: Teaching Large Vision-Language Models to Selectively Utilize Retrieved Information☆12Oct 11, 2024Updated last year
- Python code to implement DeIL, a CLIP based approach for open-world few-shot learning.☆18Nov 4, 2024Updated last year
- Tis is code for Few-Shot Joint Multimodal Entity-Relation Extraction via Knowledge-Enhanced Cross-modal Prompt Model (ACM MM 2024))☆12Aug 27, 2024Updated last year
- FIGR-8, but images in .SVG vector graphics format☆15Feb 16, 2019Updated 7 years ago
- [CVPR2024] Learning from Synthetic Human Group Activities☆14Feb 24, 2025Updated last year
- Code for ACM MM 2021 Paper "Multimodal Relation Extraction with Efficient Graph Alignment".☆111Aug 2, 2022Updated 3 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- [NAACL 2025] Guiding Large Language Models in Code Execution with Fine-grained Multimodal Chain-of-Thought Reasoning☆12Feb 9, 2025Updated last year
- [ACM MM 2023] The released code of paper "Deconfounded Visual Question Generation with Causal Inference"☆11Sep 3, 2024Updated last year
- ☆15May 23, 2022Updated 3 years ago
- A Chinese discourse parser based on CDTB☆14Jun 2, 2019Updated 6 years ago
- [AAAI 2024] MESED: A Multi-modal Entity Set Expansion Dataset with Fine-grained Semantic Classes and Hard Negative Entities☆16Apr 26, 2024Updated last year
- Model proposed in Sequence to Backward and Forward Sequences: A Content-Introducing Approach to Generative Short-Text Conversation☆18Aug 17, 2017Updated 8 years ago
- This is the official implementation of our PrOmpt cLass lEarning (POLE).☆12Jan 21, 2024Updated 2 years ago