musicalOffering / ActionSwitch-releaseLinks
☆10Updated 11 months ago
Alternatives and similar repositories for ActionSwitch-release
Users that are interested in ActionSwitch-release are comparing it to the libraries listed below
Sorting:
- Official repository of ECCV 2024 paper - "HAT: History-Augmented Anchor Transformer for Online Temporal Action Localization"☆17Updated 10 months ago
- [ECCV 2024] Official Implementation of CoPT: Unsupervised Domain Adaptive Segmentation using Domain-Agnostic Text Embeddings☆9Updated 4 months ago
- ☆20Updated 11 months ago
- [ECCV 2024] Elysium: Exploring Object-level Perception in Videos via MLLM☆78Updated 8 months ago
- [ECCV 2024] Official PyTorch implementation of TC-CLIP "Leveraging Temporal Contextualization for Video Action Recognition"☆68Updated 4 months ago
- "Visual Prompt Selection for In-Context Learning Segmentation Framework"☆15Updated 7 months ago
- Improving Mamaba performance on Video Understanding task☆39Updated 8 months ago
- [CVPR 2024] Adapting Short-Term Transformers for Action Detection in Untrimmed Videos☆11Updated last year
- [NAACL 2024] Z-GMOT: Zero-shot Generic Multiple Object Tracking☆11Updated last year
- [ECCV24] VISA: Reasoning Video Object Segmentation via Large Language Model☆17Updated 11 months ago
- [AAAI 2025] Open-vocabulary Video Instance Segmentation Codebase built upon Detectron2, which is really easy to use.☆23Updated 6 months ago
- ☆17Updated last year
- Self-Calibrated CLIP for Training-Free Open-Vocabulary Segmentation☆52Updated last month
- UniMD: Towards Unifying Moment retrieval and temporal action Detection☆51Updated last year
- Code for Semantics Meets Temporal Correspondence: Self-supervised Object-centric Learning in Videos☆9Updated 10 months ago
- [ECCV 2024] Code for Betrayed by Attention: A Simple yet Effective Approach for Self-supervised Video Object Segmentation☆34Updated 4 months ago
- [ECCV 2024 Oral] ActionVOS: Actions as Prompts for Video Object Segmentation☆33Updated 7 months ago
- Official implementation of CVPR 2024 paper "Multi-criteria Token Fusion with One-step-ahead Attention for Efficient Vision Transformers".☆39Updated last year
- Are Synthetic Data Useful for Egocentric Hand-Object Interaction Detection? [ECCV, 2024]☆13Updated 6 months ago
- Official Pytorch Implementation of 'BAM-DETR: Boundary-Aligned Moment Detection Transformer for Temporal Sentence Grounding in Videos'☆32Updated 4 months ago
- [NeurIPS 2023] The official implementation of SOC: Semantic-Assisted Object Cluster for Referring Video Object Segmentation☆32Updated last year
- Transactions on Multimedia (TMM25)☆15Updated 3 months ago
- [CVPR2024 Highlight] Official repository of the paper "The devil is in the fine-grained details: Evaluating open-vocabulary object detect…☆57Updated 3 months ago
- [ICLR 2025] TimeSuite: Improving MLLMs for Long Video Understanding via Grounded Tuning☆36Updated 3 months ago
- ☆10Updated 9 months ago
- Large-Vocabulary Video Instance Segmentation dataset☆89Updated last year
- [CVPR 2025 🔥]A Large Multimodal Model for Pixel-Level Visual Grounding in Videos☆74Updated 3 months ago
- This repo holds the official code and data for "Unveiling Parts Beyond Objects: Towards Finer-Granularity Referring Expression Segmentati…☆70Updated last year
- Offical repo for CAT-V - Caption Anything in Video: Object-centric Dense Video Captioning with Spatiotemporal Multimodal Prompting☆41Updated 2 weeks ago
- ☆12Updated 3 months ago