[CVPR 2024] Action-slot: Visual Action-centric Representations for Atomic Activity Recognition in Traffic Scenes
☆24Apr 28, 2025Updated 10 months ago
Alternatives and similar repositories for Action-slot
Users that are interested in Action-slot are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [ICRA 2024] RiskBench: A Scenario-based Benchmark for Risk Identification☆19Mar 21, 2025Updated last year
- ☆16Jan 30, 2024Updated 2 years ago
- Official PyTorch implementation of "GaussianLSS - Toward Real-world BEV Perception: Depth Uncertainty Estimation via Gaussian Splatting" …☆166Jul 9, 2025Updated 8 months ago
- ☆16Nov 14, 2023Updated 2 years ago
- [CVPR2025] Official implementation of RAM☆29Nov 4, 2025Updated 4 months ago
- The official project for the paper: Slot-VPS: Object-centric Representation Learning for Video Panoptic Segmentation, CVPR 2022☆14Nov 9, 2022Updated 3 years ago
- [AAAI 2024] DGL: Dynamic Global-Local Prompt Tuning for Text-Video Retrieval.☆47Oct 14, 2024Updated last year
- [ECCV'24] Self-training Room Layout Estimation via Geometry-aware Ray-casting☆15Jan 20, 2025Updated last year
- [AAAI 2024] GMMFormer: Gaussian-Mixture-Model Based Transformer for Efficient Partially Relevant Video Retrieval☆20May 10, 2024Updated last year
- ☆10Aug 9, 2023Updated 2 years ago
- Official toolkit for Multi-View Layout Estimation Challenge in OmniCV workshop at CVPR'23.☆16Jun 1, 2023Updated 2 years ago
- [ECCV2022] 3D-PL: Domain Adaptive Depth Estimation with 3D-aware Pseudo-Labeling☆17Sep 20, 2022Updated 3 years ago
- ACMMM 2025☆17Dec 11, 2025Updated 3 months ago
- Repository for our paper "Object-Centric Learning for Real-World Videos by Predicting Temporal Feature Similarities"☆34Feb 12, 2025Updated last year
- [IROS 2021] ADD: A Fine-grained Dynamic Inference Architecture for Semantic Image Segmentation☆10May 3, 2022Updated 3 years ago
- [NeurIPS'22] 360-MLC: Multi-view Layout Consistency for Self-training and Hyper-parameter Tuning☆14Apr 3, 2025Updated 11 months ago
- [ICLR'25] Official repository of paper: Ranking-aware adapter for text-driven image ordering with CLIP☆16Apr 17, 2025Updated 11 months ago
- [EMNLP25 Main]The official code of "Gradient-Attention Guided Dual-Masking Synergetic Framework for Robust Text-based Person Retrieval"☆22Mar 11, 2026Updated last week
- End-to-end Multi-modal Video Temporal Grounding, NeurIPS 2021☆18Oct 24, 2021Updated 4 years ago
- Noisy-Correspondence Learning for Text-to-Image Person Re-identification (CVPR 2024 Pytorch Code)☆115Nov 28, 2024Updated last year
- Scene Parsing with Global Context Embedding, ICCV 2017☆22Feb 28, 2018Updated 8 years ago
- [2026 AAAI] Think Before You Segment: An Object-aware Reasoning Agent for Referring Audio-Visual Segmentation☆19Nov 8, 2025Updated 4 months ago
- The code of the paper "Negative Pre-aware for Noisy Cross-modal Matching" in AAAI 2024.☆30Jul 2, 2025Updated 8 months ago
- [NeurIPS 2023] Self-supervised Object-Centric Learning for Videos☆32Nov 28, 2024Updated last year
- Multi-Organ Foundation Model for Universal Ultrasound Image Segmentation with Task Prompt and Anatomical Prior☆16Sep 30, 2024Updated last year
- Official PyTorch implementation of: "Cannot See the Forest for the Trees: Aggregating Multiple Viewpoints to Better Classify Objects in V…☆14Aug 29, 2022Updated 3 years ago
- [ECCV 2024] Official Implementation of "Appearance-Based Refinement for Object-Centric Motion Segmentation" Junyu Xie, Weidi Xie, Andrew …☆13Oct 23, 2024Updated last year
- [ECCV 2024] Code for Betrayed by Attention: A Simple yet Effective Approach for Self-supervised Video Object Segmentation☆34Mar 7, 2025Updated last year
- [CVPR2024] UFineBench: Towards Text-based Person Retrieval with Ultra-fine Granularity☆79Sep 28, 2024Updated last year
- Incorporating Neuro-Inspired Adaptability for Continual Learning in Artificial Intelligence☆28Dec 12, 2023Updated 2 years ago
- [ICLR 2026] Thinking on the Fly: Test-Time Reasoning Enhancement via Latent Thought Policy Optimization☆24Mar 6, 2026Updated 2 weeks ago
- 【CVPR 2025】Chat-based Person Retrieval via Dialogue-Refined Cross-Modal Alignment☆35Sep 17, 2025Updated 6 months ago
- ☆14Jan 5, 2022Updated 4 years ago
- Source code related to the research paper entitled RVENet: A Large Echocardiographic Dataset for the Deep Learning-Based Assessment of Ri…☆12Mar 10, 2024Updated 2 years ago
- Exploiting Inter-sample and Inter-feature Relations in Dataset Distillation (CVPR24)☆11Jun 16, 2024Updated last year
- This is the official implementation of "Fuzzy Multimodal Learning for Trusted Cross-modal Retrieval" (CVPR 2025)☆39Nov 16, 2025Updated 4 months ago
- Code for "ATTA: Anomaly-aware Test-Time Adaptation for Out-of-Distribution Detection in Segmentation" (NeurIPS 23)☆14Apr 12, 2024Updated last year
- Unseen Object Segmentation in Videos via Transferable Representations, ACCV 2018 (oral)☆25Apr 21, 2021Updated 4 years ago
- Code for "Unsupervised Space-Time Network for Temporally-Consistent Segmentation of Multiple Motions." (CVPR 2023)☆11Jun 15, 2023Updated 2 years ago