mlvlab / Drone_Task1Links
☆11Updated 4 years ago
Alternatives and similar repositories for Drone_Task1
Users that are interested in Drone_Task1 are comparing it to the libraries listed below
Sorting:
- ☆12Updated 4 years ago
- ☆16Updated 2 years ago
- ☆16Updated 2 years ago
- ☆16Updated 2 years ago
- Archive for AI grand challenge☆20Updated 2 years ago
- 2021 Drone AI challenge☆16Updated 4 years ago
- Official Implementation (Pytorch) of the "Generative Subgraph Retrieval for Knowledge Graph-Grounded Dialog Generation", EMNLP 2024 (main…☆12Updated 10 months ago
- [ACL Main 2025] I0T: Embedding Standardization Method Towards Zero Modality Gap☆12Updated 7 months ago
- [ECCV 2024] Official PyTorch implementation of "HYPE: Hyperbolic Entailment Filtering for Underspecified Images and Texts"☆20Updated last year
- Code for paper "Semantic Diversity-aware Prototype-based Learning for Unbiased Scene Graph Generation (ECCV 2024)"☆26Updated 6 months ago
- [ACL 2024] FLEUR: An Explainable Reference-Free Evaluation Metric for Image Captioning Using a Large Multimodal Model☆17Updated 9 months ago
- Official implementation of project Honeybee (CVPR 2024)☆464Updated last year
- Official implementation of CVPR 2024 paper "Prompt Learning via Meta-Regularization".☆32Updated 10 months ago
- [CVPR 2025] Your Large Vision-Language Model Only Needs A Few Attention Heads For Visual Grounding☆16Updated 4 months ago
- MELTR: Meta Loss Transformer for Learning to Fine-tune Video Foundation Models (CVPR 2023)☆35Updated last year
- [AAAI 2025] ConVis: Contrastive Decoding with Hallucination Visualization for Mitigating Hallucinations in Multimodal Large Language Mode…☆26Updated last year
- Kyung Hee University Vision and Learning Reading Group☆45Updated this week
- Official PyTorch implementation Source code for LLM4SGG: Large Language Models for Weakly Supervised Scene Graph Generation, accepted at …☆114Updated last year
- official PyTorch implementation for "Discovering an inference recipe for weakly-supervised object localization"☆17Updated last year
- Open-Vocabulary Video Question Answering: A New Benchmark for Evaluating the Generalizability of Video Question Answering Models (ICCV 20…☆18Updated last year
- Awesome Vision-Language Compositionality, a comprehensive curation of research papers in literature.☆34Updated 11 months ago
- [WACV 2026] MomentMix Augmentation with Length-Aware DETR for Temporally Robust Moment Retrieval☆13Updated 4 months ago
- [CVPR 2024] Official repository of ST_GT☆10Updated last year
- [ Text Analytics ] 법률 도메인 특화 한국어 기반 LLM 개발☆14Updated 4 months ago
- Bidirectional Likelihood Estimation with Multi-Modal Large Language Models for Text-Video Retrieval (ICCV 2025 Highlight)☆20Updated 6 months ago
- [EMNLP 2024 Industry track] MERLIN : Multimodal Embedding Refinement via LLM-based Iterative Navigation for Text-Video Retrieval-Rerank P…☆14Updated 11 months ago
- Official Pytorch implementation of EVEREST: Efficient Masked Video Autoencoder by Removing Redundant Spatiotemporal Tokens [ICML2024].☆30Updated last year
- Official PyTorch implementation Source code for Weakly Supervised Video Scene Graph Generation via Natural Language Supervision, accepted…☆23Updated 7 months ago
- [ACL 2024 Findings] Official PyTorch Implementation code for realizing the technical part of CoLLaVO: Crayon Large Language and Vision mO…☆99Updated last year
- Official PyTorch implementation of Extract Free Dense Misalignment from CLIP (AAAI'25)☆25Updated 9 months ago