Mark12Ding / STA
Code for Prune Spatio-temporal Tokens by Semantic-aware Temporal Accumulation (ICCV 2023)
☆23Updated last year
Alternatives and similar repositories for STA:
Users that are interested in STA are comparing it to the libraries listed below
- (ECCV 2024) Official repository of paper "EgoExo-Fitness: Towards Egocentric and Exocentric Full-Body Action Understanding"☆21Updated last week
- [ECCV2024, Oral, Best Paper Finalist]This is the official implementation of the paper "LEGO: Learning EGOcentric Action Frame Generation …☆36Updated this week
- Latent Motion Token as the Bridging Language for Robot Manipulation☆72Updated last week
- Accepted by CVPR 2024☆31Updated 9 months ago
- ☆43Updated 2 months ago
- AnyBimanual: Transfering Unimanual Policy for General Bimanual Manipulation☆61Updated last month
- Code release for NeurIPS 2023 paper SlotDiffusion: Object-centric Learning with Diffusion Models☆82Updated last year
- An unofficial pytorch dataloader for Open X-Embodiment Datasets https://github.com/google-deepmind/open_x_embodiment☆13Updated last month
- Affordance Grounding from Demonstration Video to Target Image (CVPR 2023)☆42Updated 6 months ago
- [ICLR 2024 Spotlight] Unified Human-Scene Interaction via Prompted Chain-of-Contacts☆187Updated 3 months ago
- Preview code of ECCV'24 paper "Distill Gold from Massive Ores" (BiLP)☆24Updated 7 months ago
- Repository for "General Flow as Foundation Affordance for Scalable Robot Learning"☆46Updated 2 months ago
- [arXiv:2309.16669] Code release for "Training a Large Video Model on a Single Machine in a Day"☆122Updated 6 months ago
- EgoVid-5M: A Large-Scale Video-Action Dataset for Egocentric Video Generation☆90Updated 3 months ago
- Official implementation of Dancing with Still Images: Video Distillation via Static-Dynamic Disentanglement.☆26Updated 5 months ago
- Official PyTorch Implementation of Learning Affordance Grounding from Exocentric Images, CVPR 2022☆54Updated 3 months ago
- For Ego4D VQ3D Task☆19Updated last year
- official implementation of CVPR 23 paper "M3Video: Masked Motion Modeling for Self-Supervised Video Representation Learning"☆50Updated last year
- Accepted by CVPR 2023☆36Updated 10 months ago
- Official implementation of Language Conditioned Spatial Relation Reasoning for 3D Object Grounding (NeurIPS'22).☆58Updated 2 years ago
- ☆23Updated 6 months ago
- Bidirectional Mapping between Action Physical-Semantic Space☆30Updated 5 months ago
- [CVPR 2024 Champions] Solutions for EgoVis Chanllenges in CVPR 2024☆119Updated 7 months ago
- This the official repository of OCL (ICCV 2023).☆19Updated 10 months ago
- ☆10Updated last year
- Code for paper "Grounding Video Models to Actions through Goal Conditioned Exploration".☆41Updated last month
- ☆13Updated 6 months ago
- Physical laws underpin all existence, and harnessing them for generative modeling opens boundless possibilities for advancing science and…☆89Updated 3 weeks ago
- [NeurIPS'24] This repository is the implementation of "SpatialRGPT: Grounded Spatial Reasoning in Vision Language Models"☆114Updated 2 months ago