mlvlab / Drone_task4
☆12Updated 3 years ago
Alternatives and similar repositories for Drone_task4:
Users that are interested in Drone_task4 are comparing it to the libraries listed below
- ☆11Updated 3 years ago
- ☆17Updated last year
- 2021 Drone AI challenge☆16Updated 3 years ago
- ☆17Updated last year
- ☆17Updated last year
- Archive for AI grand challenge☆21Updated last year
- Official Implementation (Pytorch) of the "Generative Subgraph Retrieval for Knowledge Graph-Grounded Dialog Generation", EMNLP 2024 (main…☆11Updated 3 weeks ago
- Official PyTorch Implementation for CVPR2022 paper "Consistency Learning via Decoding Path Augmentation for Transformers in Human Object …☆8Updated 2 years ago
- Official Implementation (Pytorch) of the "VidChain: Chain-of-Tasks with Metric-based Direct Preference Optimization for Dense Video Capti…☆17Updated 2 months ago
- Open-Vocabulary Video Question Answering: A New Benchmark for Evaluating the Generalizability of Video Question Answering Models (ICCV 20…☆18Updated 11 months ago
- Official implementation of CVPR 2024 paper "Prompt Learning via Meta-Regularization".☆27Updated 3 weeks ago
- Official Implementation (Pytorch) of "Super-class guided Transformer for Zero-Shot Attribute Classification", AAAI 2025☆11Updated 2 months ago
- MELTR: Meta Loss Transformer for Learning to Fine-tune Video Foundation Models (CVPR 2023)☆33Updated 11 months ago
- Official implementation of paper "OED: Towards One-stage End-to-End Dynamic Scene Graph Generation".☆18Updated last year
- Official PyTorch implementation of "Groupwise Query Specialization and Quality-Aware Multi-Assignment for Transformer-based Visual Relati…☆35Updated 11 months ago
- Official PyTorch implementation Source code for Weakly Supervised Video Scene Graph Generation via Natural Language Supervision, accepted…☆10Updated last month
- [CVPR 2024 Best paper award candidate] EGTR: Extracting Graph from Transformer for Scene Graph Generation☆104Updated 9 months ago
- Official implementation of CVPR 2024 paper "vid-TLDR: Training Free Token merging for Light-weight Video Transformer".☆46Updated 10 months ago
- Official Pytorch implementation of EVEREST: Efficient Masked Video Autoencoder by Removing Redundant Spatiotemporal Tokens [ICML2024].☆26Updated 9 months ago
- The official code for Devil's on the Edges: Selective Quad Attention for Scene Graph Generation, CVPR2023.☆22Updated last year
- Official implementation of CVPR 2024 paper "Multi-criteria Token Fusion with One-step-ahead Attention for Efficient Vision Transformers".☆35Updated 11 months ago
- Distribution-Aware Prompt Tuning for Vision-Language Models (ICCV 2023)☆38Updated last year
- [CVPR 2024] Official repository of ST_GT☆9Updated 6 months ago
- Large Language Models are Temporal and Causal Reasoners for Video Question Answering (EMNLP 2023)☆74Updated last week
- ☆12Updated 4 months ago
- Video-Text Representation Learning via Differentiable Weak Temporal Alignment (CVPR 2022)☆16Updated 11 months ago
- ☆21Updated 2 years ago
- Code for paper "Semantic Diversity-aware Prototype-based Learning for Unbiased Scene Graph Generation (ECCV 2024)"☆21Updated 2 weeks ago
- ☆16Updated this week
- Code for paper 'Leveraging Predicate and Triplet Learning for Scene Graph Generation'. (CVPR 2024)☆27Updated 2 weeks ago