mlvlab / Drone_task4Links
☆12Updated 3 years ago
Alternatives and similar repositories for Drone_task4
Users that are interested in Drone_task4 are comparing it to the libraries listed below
Sorting:
- ☆17Updated 2 years ago
- ☆17Updated 2 years ago
- ☆17Updated 2 years ago
- ☆11Updated 3 years ago
- Archive for AI grand challenge☆21Updated 2 years ago
- 2021 Drone AI challenge☆16Updated 3 years ago
- Official Implementation (Pytorch) of the "Generative Subgraph Retrieval for Knowledge Graph-Grounded Dialog Generation", EMNLP 2024 (main…☆11Updated 5 months ago
- Awesome Vision-Language Compositionality, a comprehensive curation of research papers in literature.☆28Updated 6 months ago
- [CVPR 2024 Best paper award candidate] EGTR: Extracting Graph from Transformer for Scene Graph Generation☆123Updated last year
- Official implementation of CVPR 2024 paper "Prompt Learning via Meta-Regularization".☆29Updated 5 months ago
- The official code for Devil's on the Edges: Selective Quad Attention for Scene Graph Generation, CVPR2023.☆23Updated 2 years ago
- Official implementation of paper "OED: Towards One-stage End-to-End Dynamic Scene Graph Generation".☆20Updated last year
- Official PyTorch implementation Source code for LLM4SGG: Large Language Models for Weakly Supervised Scene Graph Generation, accepted at …☆110Updated last year
- Official Implementation (Pytorch) of the "VidChain: Chain-of-Tasks with Metric-based Direct Preference Optimization for Dense Video Capti…☆21Updated 7 months ago
- Repository of "Improving Cross-Modal Retrieval With Set of Diverse Embeddings" (CVPR'23, Highlight)☆41Updated last year
- [ECCV 2024 (Oral)] Towards Scene Graph Anticipation☆17Updated 9 months ago
- [ACL Main 2025] I0T: Embedding Standardization Method Towards Zero Modality Gap☆11Updated 2 months ago
- Official Pytorch Implementation of 'BAM-DETR: Boundary-Aligned Moment Detection Transformer for Temporal Sentence Grounding in Videos'☆32Updated 6 months ago
- Open-Vocabulary Video Question Answering: A New Benchmark for Evaluating the Generalizability of Video Question Answering Models (ICCV 20…☆18Updated last year
- [ECCV 2024 Oral] Official implementation of the paper "DEVIAS: Learning Disentangled Video Representations of Action and Scene"☆24Updated 10 months ago
- Official PyTorch implementation Source code for Weakly Supervised Video Scene Graph Generation via Natural Language Supervision, accepted…☆20Updated 2 months ago
- [ICCV-2025] Multi-Granular Spatio-Temporal Token Merging for Training-Free Acceleration of Video LLMs☆38Updated last month
- Official code for Zero-shot Referring Expression Comprehension via Structural Similarity Between Images and Captions (CVPR 2024)☆26Updated last year
- Code for paper "Semantic Diversity-aware Prototype-based Learning for Unbiased Scene Graph Generation (ECCV 2024)"☆26Updated last month
- Official repository of the "ReSTR: Convolution-Free Referring Image Segmentation Using Transformers (CVPR'22)"☆13Updated 8 months ago
- Large Language Models are Temporal and Causal Reasoners for Video Question Answering (EMNLP 2023)☆76Updated 5 months ago
- Official repository of the "Shatter and Gather: Learning Referring Image Segmentation with Text Supervision (ICCV'23)"☆39Updated last year
- [ECCV 2024 Best Paper Candidate] Implementation of "Expanding Scene Graph Boundaries: Fully Open-vocabulary Scene Graph Generation via Vi…☆76Updated last month
- [CVPR 2024 Accepted] TaskWeave: Decoupling and Inter-Task Feedback for Joint Moment Retrieval and Highlight Detection☆26Updated 11 months ago
- This is the official repository for the paper "Visually-Prompted Language Model for Fine-Grained Scene Graph Generation in an Open World"…☆47Updated last year