mlvlab / DialogGSRLinks
Official Implementation (Pytorch) of the "Generative Subgraph Retrieval for Knowledge Graph-Grounded Dialog Generation", EMNLP 2024 (main)
☆11Updated 5 months ago
Alternatives and similar repositories for DialogGSR
Users that are interested in DialogGSR are comparing it to the libraries listed below
Sorting:
- ☆17Updated 2 years ago
- Archive for AI grand challenge☆21Updated 2 years ago
- ☆17Updated 2 years ago
- ☆17Updated 2 years ago
- Official Implementation (Pytorch) of the "VidChain: Chain-of-Tasks with Metric-based Direct Preference Optimization for Dense Video Capti…☆21Updated 6 months ago
- Official implementation of CVPR 2024 paper "Prompt Learning via Meta-Regularization".☆28Updated 5 months ago
- ☆12Updated 3 years ago
- Large Language Models are Temporal and Causal Reasoners for Video Question Answering (EMNLP 2023)☆76Updated 4 months ago
- Official implementation of paper "OED: Towards One-stage End-to-End Dynamic Scene Graph Generation".☆20Updated last year
- Official PyTorch implementation Source code for Weakly Supervised Video Scene Graph Generation via Natural Language Supervision, accepted…☆20Updated last month
- ☆11Updated 3 years ago
- [ICLR 2025] Data-Augmented Phrase-Level Alignment for Mitigating Object Hallucination☆13Updated 6 months ago
- Open-Vocabulary Video Question Answering: A New Benchmark for Evaluating the Generalizability of Video Question Answering Models (ICCV 20…☆18Updated last year
- Official PyTorch implementation Source code for LLM4SGG: Large Language Models for Weakly Supervised Scene Graph Generation, accepted at …☆108Updated last year
- [CVPR 2024] Official repository of ST_GT☆9Updated 10 months ago
- MELTR: Meta Loss Transformer for Learning to Fine-tune Video Foundation Models (CVPR 2023)☆35Updated last year
- ☆12Updated last year
- ✨✨The Curse of Multi-Modalities (CMM): Evaluating Hallucinations of Large Multimodal Models across Language, Visual, and Audio☆46Updated 3 weeks ago
- ☆13Updated 5 months ago
- Official pytorch implementation of 'Relation-aware Language-Graph Transformer for Question Answering' (AAAI 2023)☆17Updated 2 years ago
- Code for paper "Semantic Diversity-aware Prototype-based Learning for Unbiased Scene Graph Generation (ECCV 2024)"☆26Updated 2 weeks ago
- ☆12Updated 7 months ago
- Awesome Vision-Language Compositionality, a comprehensive curation of research papers in literature.☆26Updated 5 months ago
- Video Chain of Thought, Codes for ICML 2024 paper: "Video-of-Thought: Step-by-Step Video Reasoning from Perception to Cognition"☆157Updated 5 months ago
- Weakly Supervised Gaussian Contrastive Grounding with Large Multimodal Models for Video Question Answering [ACM MM'24]☆12Updated last year
- Official PyTorch code of GroundVQA (CVPR'24)☆61Updated 10 months ago
- [ACM MM 2025] TimeChat-online: 80% Visual Tokens are Naturally Redundant in Streaming Videos☆66Updated 3 weeks ago
- [CVPR 2025] Adaptive Keyframe Sampling for Long Video Understanding☆90Updated 3 months ago
- [CVPR 2025] COSMOS: Cross-Modality Self-Distillation for Vision Language Pre-training☆28Updated 4 months ago
- Official implementation of CVPR 2024 paper "vid-TLDR: Training Free Token merging for Light-weight Video Transformer".☆52Updated last year