Mark12Ding / STA
Code for Prune Spatio-temporal Tokens by Semantic-aware Temporal Accumulation (ICCV 2023)
☆23Updated last year
Alternatives and similar repositories for STA:
Users that are interested in STA are comparing it to the libraries listed below
- (ECCV 2024) Official repository of paper "EgoExo-Fitness: Towards Egocentric and Exocentric Full-Body Action Understanding"☆25Updated last month
- Latent Motion Token as the Bridging Language for Robot Manipulation☆79Updated last week
- [arXiv:2309.16669] Code release for "Training a Large Video Model on a Single Machine in a Day"☆127Updated 8 months ago
- [ICLR 2024] Seer: Language Instructed Video Prediction with Latent Diffusion Models☆30Updated 10 months ago
- [CVPR 2024 Champions][ICLR 2025] Solutions for EgoVis Chanllenges in CVPR 2024☆124Updated 2 weeks ago
- Affordance Grounding from Demonstration Video to Target Image (CVPR 2023)☆43Updated 8 months ago
- [ECCV2024, Oral, Best Paper Finalist]This is the official implementation of the paper "LEGO: Learning EGOcentric Action Frame Generation …☆37Updated last month
- ☆46Updated 3 months ago
- Accepted by CVPR 2024☆33Updated 10 months ago
- 🔍 Explore Egocentric Vision: research, data, challenges, real-world apps. Stay updated & contribute to our dynamic repository! Work-in-p…☆106Updated 4 months ago
- [CVPR 2025] The code for paper ''Video-3D LLM: Learning Position-Aware Video Representation for 3D Scene Understanding''.☆71Updated 3 weeks ago
- EgoVid-5M: A Large-Scale Video-Action Dataset for Egocentric Video Generation☆98Updated 4 months ago
- Code release for "EgoVLPv2: Egocentric Video-Language Pre-training with Fusion in the Backbone" [ICCV, 2023]☆96Updated 8 months ago
- AnyBimanual: Transfering Unimanual Policy for General Bimanual Manipulation☆66Updated 2 months ago
- For Ego4D VQ3D Task☆19Updated last year
- [CVPR 2023 Hightlight] PDPP: Projected Diffusion for Procedure Planning in Instructional Videos☆33Updated last year
- Code for paper "Grounding Video Models to Actions through Goal Conditioned Exploration".☆44Updated 3 months ago
- [CVPR 2024] Data and benchmark code for the EgoExoLearn dataset☆55Updated 6 months ago
- ☆94Updated 7 months ago
- Code release for NeurIPS 2023 paper SlotDiffusion: Object-centric Learning with Diffusion Models☆84Updated last year
- Code release for "PISA Experiments: Exploring Physics Post-Training for Video Diffusion Models by Watching Stuff Drop" (arXiv 2025)☆24Updated last week
- Official repository for "iVideoGPT: Interactive VideoGPTs are Scalable World Models" (NeurIPS 2024), https://arxiv.org/abs/2405.15223☆121Updated 3 weeks ago
- Official PyTorch Implementation of Learning Affordance Grounding from Exocentric Images, CVPR 2022☆54Updated 5 months ago
- Ego4D Goal-Step: Toward Hierarchical Understanding of Procedural Activities (NeurIPS 2023)☆40Updated 11 months ago
- ☆26Updated this week
- This the official repository of OCL (ICCV 2023).☆19Updated last year
- Accepted by CVPR 2023☆41Updated 11 months ago
- ☆21Updated 10 months ago
- ☆37Updated last week
- Repository for "General Flow as Foundation Affordance for Scalable Robot Learning"☆47Updated 3 months ago