Qinying-Liu / OpenWTALLinks
a unified and simple codebase for weakly-supervised temporal action localization
☆19Updated 2 years ago
Alternatives and similar repositories for OpenWTAL
Users that are interested in OpenWTAL are comparing it to the libraries listed below
Sorting:
- ☆15Updated last year
- (CVPR2024) Realigning Confidence with Temporal Saliency Information for Point-level Weakly-Supervised Temporal Action Localization☆19Updated last year
- Code for our IJCV 2023 paper "CLIP-guided Prototype Modulating for Few-shot Action Recognition".☆74Updated last year
- CLIP-Driven Fine-grained Text-Image Person Re-identification☆57Updated 2 years ago
- Paper Reading of IMCC groups.☆18Updated last month
- [AAAI 2024] Official implementation of "Point-supervised Temporal Action Localization via Hierarchical Reliability Propagation"☆38Updated last year
- [ECCV 2024 oral] -C2C: Component-to-Composition Learning for Zero-Shot Compositional Action Recognition☆37Updated 11 months ago
- Accepted by ICCV2023, Revisiting Foreground and Background Separation in Weakly-supervised Temporal Action Localization: A Clustering-bas…☆103Updated last year
- MomentDiff: Generative Video Moment Retrieval from Random to Real--NeurIPS 2023☆80Updated 2 years ago
- Code for our paper "HyRSM++: Hybrid Relation Guided Temporal Set Matching for Few-shot Action Recognition".☆13Updated 2 years ago
- ☆94Updated 2 years ago
- [ECCV 2022] Dual-Evidential Learning for Weakly-supervised Temporal Action Localization☆49Updated last year
- [CVPR 2022] Fine-grained Temporal Contrastive Learning for Weakly-supervised Temporal Action Localization☆48Updated 2 years ago
- Code for CVPR23 Highlight "I2MVFormer: Large Language Model Generated Multi-View Document Supervision for Zero-Shot Image Classification"…☆20Updated 2 years ago
- [ICLR 2024] FROSTER: Frozen CLIP is a Strong Teacher for Open-Vocabulary Action Recognition☆90Updated 10 months ago
- This is a summary of research on noisy correspondence. There may be omissions. If anything is missing please get in touch with us. Our em…☆73Updated 2 weeks ago
- ☆27Updated last year
- [CVPR 2023] Collecting Cross-Modal Presence-Absence Evidence for Weakly-Supervised Audio-Visual Event Perception☆36Updated 2 years ago
- [ICML2024] Official PyTorch implementation of CoMC: Language-Driven Cross-Modal Classifier for Zero-Shot Multi-Label Image Recognition☆16Updated last year
- Progressive Spatio-Temporal Prototype Matching for Text-Video Retrieval --ICCV2023 Oral☆91Updated 2 years ago
- ☆77Updated 3 weeks ago
- ☆48Updated 2 years ago
- [ICCV-2023] The official code of Bridging Vision and Language Encoders: Parameter-Efficient Tuning for Referring Image Segmentation☆138Updated 4 months ago
- ☆22Updated 2 years ago
- Code for "CARIS: Context-Aware Referring Image Segmentation" [ACM MM2023]☆27Updated 11 months ago
- ☆14Updated last year
- [ACM MM 2024] Hierarchical Multimodal Fine-grained Modulation for Visual Grounding.☆55Updated last week
- [TPAMI 2024] This is the official Pytorch code for our paper "Context Disentangling and Prototype Inheriting for Robust Visual Grounding"…☆26Updated 6 months ago
- [CVPR 2024 Accepted] TaskWeave: Decoupling and Inter-Task Feedback for Joint Moment Retrieval and Highlight Detection☆27Updated last year
- [ICLR 2025] Official repository of "Learning Clustering-based Prototypes for Compositional Zero-shot Learning"☆20Updated 8 months ago