Tanveer81 / RGNet
This is the official implementation of RGNet: A Unified Retrieval and Grounding Network for Long Videos
☆14Updated 3 weeks ago
Alternatives and similar repositories for RGNet:
Users that are interested in RGNet are comparing it to the libraries listed below
- The official code of Towards Balanced Alignment: Modal-Enhanced Semantic Modeling for Video Moment Retrieval (AAAI2024)☆29Updated 11 months ago
- [CVPR 2024] Do you remember? Dense Video Captioning with Cross-Modal Memory Retrieval☆53Updated 9 months ago
- Code implementation of paper "MUSE: Mamba is Efficient Multi-scale Learner for Text-video Retrieval (AAAI2025)"☆18Updated last month
- With a Little Help from your own Past: Prototypical Memory Networks for Image Captioning. ICCV 2023☆16Updated 9 months ago
- An official implementation for MS-DETR in ACL'23☆16Updated last year
- Official PyTorch code of GroundVQA (CVPR'24)☆56Updated 6 months ago
- Repo for paper: "Paxion: Patching Action Knowledge in Video-Language Foundation Models" Neurips 23 Spotlight☆37Updated last year
- COLA: Evaluate how well your vision-language model can Compose Objects Localized with Attributes!☆24Updated 4 months ago
- [2023 ACL] CONE: An Efficient COarse-to-fiNE Alignment Framework for Long Video Temporal Grounding☆30Updated last year
- Winner solution to Generic Event Boundary Captioning task in LOVEU Challenge (CVPR 2023 workshop)☆30Updated last year
- [AAAI 2024] GMMFormer: Gaussian-Mixture-Model Based Transformer for Efficient Partially Relevant Video Retrieval