mlvlab / Drone_Task1
☆11Updated 2 years ago
Related projects: ⓘ
- ☆12Updated 2 years ago
- 2021 Drone AI challenge☆16Updated 2 years ago
- ☆17Updated last year
- ☆17Updated last year
- ☆17Updated last year
- Archive for AI grand challenge☆21Updated last year
- Official PyTorch Implementation for CVPR2022 paper "Consistency Learning via Decoding Path Augmentation for Transformers in Human Object …☆8Updated 2 years ago
- Open-Vocabulary Video Question Answering: A New Benchmark for Evaluating the Generalizability of Video Question Answering Models (ICCV 20…☆18Updated 4 months ago
- MELTR: Meta Loss Transformer for Learning to Fine-tune Video Foundation Models (CVPR 2023)☆32Updated 4 months ago
- The official code for Devil's on the Edges: Selective Quad Attention for Scene Graph Generation, CVPR2023.☆22Updated last year
- Official PyTorch implementation of "Groupwise Query Specialization and Quality-Aware Multi-Assignment for Transformer-based Visual Relati…☆24Updated 5 months ago
- A New Benchmark for Scene Graph Generation, targeting real-world applications☆32Updated last month
- [ECCV 2024] Official PyTorch implementation of TC-CLIP "Leveraging Temporal Contextualization for Video Action Recognition"☆15Updated last month
- Official implementation of paper "OED: Towards One-stage End-to-End Dynamic Scene Graph Generation".☆12Updated 5 months ago
- [ICCV'2023] Compositional Feature Augmentation for Unbiased Scene Graph Generation☆13Updated 9 months ago
- Large Language Models are Temporal and Causal Reasoners for Video Question Answering (EMNLP 2023)☆71Updated last month
- ☆75Updated 2 years ago
- Official implementation of CVPR 2024 paper "Prompt Learning via Meta-Regularization".☆22Updated 3 weeks ago
- Official Pytorch Implementation of 'BAM-DETR: Boundary-Aligned Moment Detection Transformer for Temporal Sentence Grounding in Videos'☆20Updated 9 months ago
- Repository of "Improving Cross-Modal Retrieval With Set of Diverse Embeddings" (CVPR'23, Highlight)☆36Updated 10 months ago
- This is the official repository for the paper "Visually-Prompted Language Model for Fine-Grained Scene Graph Generation in an Open World"…☆41Updated 6 months ago
- Official implementation of CVPR 2024 paper "vid-TLDR: Training Free Token merging for Light-weight Video Transformer".☆32Updated 4 months ago
- ☆27Updated this week
- Official Pytorch implementation of EVEREST: Efficient Masked Video Autoencoder by Removing Redundant Spatiotemporal Tokens [ICML2024].☆21Updated 3 months ago
- Code for CVPR23 paper: Learning to Generate Language-supervised and Open-vocabulary Scene Graph using Pre-trained Visual-Semantic Space☆35Updated 11 months ago
- ☆19Updated last year
- ☆20Updated 2 weeks ago
- ☆28Updated last year
- Official pytorch repository for "Knowing Where to Focus: Event-aware Transformer for Video Grounding" (ICCV 2023)☆46Updated last year
- Official repository of the "Shatter and Gather: Learning Referring Image Segmentation with Text Supervision (ICCV'23)"☆32Updated 7 months ago