olga-zats / DIFF_MANTALinks
[CVPR 2025] MANTA: Diffusion Mamba for Efficient and Effective Stochastic Long-Term Dense Anticipation
☆21Updated 6 months ago
Alternatives and similar repositories for DIFF_MANTA
Users that are interested in DIFF_MANTA are comparing it to the libraries listed below
Sorting:
- Official implementation of the paper "Hierarchical Vector Quantization for Unsupervised Action Segmentation"☆22Updated 8 months ago
- [ECCV2024] Gated Temporal Action Anticipation for Stochastic Long-Term Anticipation☆22Updated 6 months ago
- [ICIP2023] Code for the paper 'Action Anticipation with Goal Consistency'☆12Updated last year
- A curated list of awesome temporal action segmentation resources.☆233Updated last year
- Official Open Source code for "Masked Autoencoders As Spatiotemporal Learners"☆357Updated last week
- The suite of modeling video with Mamba☆282Updated last year
- A curated list of awesome self-supervised learning methods in videos☆158Updated last week
- [CVPR 2023] VideoMAE V2: Scaling Video Masked Autoencoders with Dual Masking☆718Updated last year
- [CVPR2023] Masked Video Distillation: Rethinking Masked Feature Modeling for Self-supervised Video Representation Learning (https://arxiv…☆134Updated 2 years ago
- Code for Diffusion Action Segmentation (ICCV 2023)☆68Updated 2 years ago
- [ICCV 2023] Latent Action Composition for Skeleton-based Action Segmentation☆21Updated 2 years ago
- [ECCV2024] VideoMamba: State Space Model for Efficient Video Understanding☆1,040Updated last year
- Official Repo for CVPR 2024 Paper "FACT: Frame-Action Cross-Attention Temporal Modeling for Efficient Fully-Supervised Action Segmentatio…☆81Updated 6 months ago
- [ICCV 2023] How Much Temporal Long-Term Context is Needed for Action Segmentation?☆50Updated last year
- [CVPR'25] SyncVP: Joint Diffusion for Synchronous Multi-Modal Video Prediction☆19Updated 4 months ago
- OpenTAD is an open-source temporal action detection (TAD) toolbox based on PyTorch.☆304Updated 7 months ago
- The official PyTorch implementation of the IEEE/CVF International Conference on Computer Vision (ICCV) '23 paper Multimodal Motion Condit…☆88Updated last year
- [CVPR 2024] - Official code for the paper "Temporally Consistent Unbalanced Optimal Transport for Unsupervised Action Segmentation"☆45Updated last year
- PyTorch implementation of a collections of scalable Video Transformer Benchmarks.☆305Updated 3 years ago
- Official repository for "Video-FocalNets: Spatio-Temporal Focal Modulation for Video Action Recognition" [ICCV 2023]☆100Updated last year
- ☆21Updated this week
- ☆40Updated last year
- Official implementation of the CVPR'24 paper [Adaptive Slot Attention: Object Discovery with Dynamic Slot Number]☆61Updated 10 months ago
- Progress-Aware Online Action Segmentation for Egocentric Procedural Task Videos☆27Updated last year
- Visualizing the learned space-time attention using Attention Rollout☆40Updated 3 years ago
- Official PyTorch implementation of the paper "Hyperbolic Self-paced Learning for Self-supervised Skeleton-based Action Representations" (…☆21Updated last year
- Code for ECCV2022 "Real-time Online Video Detection with Temporal Smoothing Transformers"☆114Updated 3 months ago
- Awesome Online Action Detection☆71Updated 10 months ago
- [ACM MM 2024] Frequency Guidance Matters: Skeletal Action Recognition by Frequency-Aware Mixed Transformer☆20Updated 4 months ago
- xLSTM as Generic Vision Backbone☆491Updated last month