mlvlab / Drone_Task1Links
☆11Updated 3 years ago
Alternatives and similar repositories for Drone_Task1
Users that are interested in Drone_Task1 are comparing it to the libraries listed below
Sorting:
- ☆12Updated 3 years ago
- ☆17Updated 2 years ago
- ☆17Updated 2 years ago
- ☆17Updated 2 years ago
- Archive for AI grand challenge☆21Updated 2 years ago
- 2021 Drone AI challenge☆16Updated 3 years ago
- Official Implementation (Pytorch) of the "Generative Subgraph Retrieval for Knowledge Graph-Grounded Dialog Generation", EMNLP 2024 (main…☆11Updated 5 months ago
- Official PyTorch Implementation for CVPR2022 paper "Consistency Learning via Decoding Path Augmentation for Transformers in Human Object …☆9Updated 3 years ago
- [ECCV 2024] Official PyTorch implementation of "HYPE: Hyperbolic Entailment Filtering for Underspecified Images and Texts"☆19Updated 8 months ago
- Awesome Vision-Language Compositionality, a comprehensive curation of research papers in literature.☆26Updated 5 months ago
- official PyTorch implementation for "Discovering an inference recipe for weakly-supervised object localization"☆17Updated last year
- [AAAI 2025] ConVis: Contrastive Decoding with Hallucination Visualization for Mitigating Hallucinations in Multimodal Large Language Mode…☆18Updated 10 months ago
- MELTR: Meta Loss Transformer for Learning to Fine-tune Video Foundation Models (CVPR 2023)☆35Updated last year
- Official implementation of CVPR 2024 paper "Prompt Learning via Meta-Regularization".☆28Updated 5 months ago
- Code for paper "Semantic Diversity-aware Prototype-based Learning for Unbiased Scene Graph Generation (ECCV 2024)"☆26Updated 2 weeks ago
- This is the official repository for the paper "Visually-Prompted Language Model for Fine-Grained Scene Graph Generation in an Open World"…☆47Updated last year
- [CVPR 2024] Official repository of ST_GT☆9Updated 10 months ago
- [AAAI2024] BOK-VQA : Bilingual Outside Knowledge-based Visual Question Answering via Graph Representation Pretraining☆2Updated last year
- [AAAI-24] VVS : Video-to-Video Retrieval With Irrelevant Frame Suppression☆20Updated last year
- Official Pytorch implementation of EVEREST: Efficient Masked Video Autoencoder by Removing Redundant Spatiotemporal Tokens [ICML2024].☆28Updated last year
- Official PyTorch implementation Source code for Weakly Supervised Video Scene Graph Generation via Natural Language Supervision, accepted…☆20Updated last month
- ☆12Updated last year
- [CVPR 2024] Official implementation of the paper "TE-TAD: Towards Full End-to-End Temporal Action Detection via Time-Aligned Coordinate E…☆26Updated last year
- Official PyTorch implementation Source code for LLM4SGG: Large Language Models for Weakly Supervised Scene Graph Generation, accepted at …☆108Updated last year
- Official implementation of paper "OED: Towards One-stage End-to-End Dynamic Scene Graph Generation".☆20Updated last year
- [CVPR'2022 Oral] The Devil is in the Labels: Noisy Label Correction for Robust Scene Graph Generation☆32Updated last year
- Official implementation of CVPR 2024 paper "vid-TLDR: Training Free Token merging for Light-weight Video Transformer".☆52Updated last year
- [ACL 2024] FLEUR: An Explainable Reference-Free Evaluation Metric for Image Captioning Using a Large Multimodal Model☆16Updated 3 months ago
- Large Language Models are Temporal and Causal Reasoners for Video Question Answering (EMNLP 2023)☆76Updated 4 months ago
- Official repository for HOTR: End-to-End Human-Object Interaction Detection with Transformers (CVPR'21, Oral Presentation)☆152Updated 2 years ago