rlqja1107 / NL-VSGGLinks
Official PyTorch implementation Source code for Weakly Supervised Video Scene Graph Generation via Natural Language Supervision, accepted at ICLR 2025
☆23Updated 6 months ago
Alternatives and similar repositories for NL-VSGG
Users that are interested in NL-VSGG are comparing it to the libraries listed below
Sorting:
- Official PyTorch implementation Source code for Adaptive Self-Training Framework for Fine-grained Scene Graph generation (ST-SGG), accept…☆21Updated last year
- Code for paper "Semantic Diversity-aware Prototype-based Learning for Unbiased Scene Graph Generation (ECCV 2024)"☆26Updated 5 months ago
- Hetsgg☆29Updated 2 years ago
- Bidirectional Likelihood Estimation with Multi-Modal Large Language Models for Text-Video Retrieval (ICCV 2025 Highlight)☆20Updated 5 months ago
- Official PyTorch implementation Source code for LLM4SGG: Large Language Models for Weakly Supervised Scene Graph Generation, accepted at …☆113Updated last year
- Official implementation of paper "OED: Towards One-stage End-to-End Dynamic Scene Graph Generation".☆24Updated last year
- ☆17Updated 2 years ago
- [ICLR2023] Video Scene Graph Generation from Single-Frame Weak Supervision☆12Updated 2 years ago
- Official implementation of CVPR 2024 paper "Prompt Learning via Meta-Regularization".☆31Updated 9 months ago
- [ICLR 2025] Data-Augmented Phrase-Level Alignment for Mitigating Object Hallucination☆17Updated 11 months ago
- Official PyTorch Implementation of RA-TTA (ICLR25)☆21Updated 8 months ago
- ☆21Updated 3 years ago
- Official Implementation (Pytorch) of the "Generative Subgraph Retrieval for Knowledge Graph-Grounded Dialog Generation", EMNLP 2024 (main…☆12Updated 9 months ago
- [NeurIPS'2023] Zero-shot Visual Relation Detection via Composite Visual Cues from Large Language Models☆22Updated 2 months ago
- The official implementation of paper "Prototype-based Aleatoric Uncertainty Quantification for Cross-modal Retrieval" accepted by NeurIPS…☆27Updated last year
- Official Implementation (Pytorch) of the "VidChain: Chain-of-Tasks with Metric-based Direct Preference Optimization for Dense Video Capti…☆23Updated 11 months ago
- [ECCV 2024] Code for the paper "Mew: Multiplexed Immunofluorescence Image Analysis through an Efficient Multiplex Network"☆16Updated last year
- [CVPR 2024] Official repository of ST_GT☆10Updated last year
- Open-Vocabulary Video Question Answering: A New Benchmark for Evaluating the Generalizability of Video Question Answering Models (ICCV 20…☆18Updated last year
- [ICCV'2023] Compositional Feature Augmentation for Unbiased Scene Graph Generation☆15Updated 2 years ago
- Code for paper 'Leveraging Predicate and Triplet Learning for Scene Graph Generation'. (CVPR 2024)☆32Updated 4 months ago
- Implementation of "DIME-FM: DIstilling Multimodal and Efficient Foundation Models"☆15Updated 2 years ago
- Official Pytorch Implementation of the framework TEMPURA proposed in our paper Unbiased Scene Graph Generation in Videos accepted by CVPR…☆23Updated 4 months ago
- The official source code for "3D Interaction Geometric Pre-training for Molecular Relational Learning"☆21Updated 3 months ago
- Training and testing code from our CVPR 2023 paper "Are Deep Neural Networks SMARTer than Second Graders?"☆11Updated 2 years ago
- Missing Modality Generation for Recommendaton☆33Updated 8 months ago
- [ICML 2024] "Visual-Text Cross Alignment: Refining the Similarity Score in Vision-Language Models"☆57Updated last year
- Vision Relation Transformer for Unbiased Scene Graph Generation (ICCV 2023)☆22Updated 2 years ago
- The official source code for "Vision Language Model is NOT All You Need: Augmentation Strategies for Molecule Language Model".☆14Updated last year
- ☆14Updated last year