Max-Fu / tvl
☆65Updated last month
Alternatives and similar repositories for tvl:
Users that are interested in tvl are comparing it to the libraries listed below
- [CVPR 2024] Binding Touch to Everything: Learning Unified Multimodal Tactile Representations☆38Updated last month
- IMProv: Inpainting-based Multimodal Prompting for Computer Vision Tasks☆59Updated 5 months ago
- ☆44Updated 3 months ago
- [ECCV2024, Oral, Best Paper Finalist]This is the official implementation of the paper "LEGO: Learning EGOcentric Action Frame Generation …☆36Updated 2 weeks ago
- Latent Motion Token as the Bridging Language for Robot Manipulation☆74Updated last month
- Code for paper "Grounding Video Models to Actions through Goal Conditioned Exploration".☆42Updated 2 months ago
- ☆64Updated 6 months ago
- [ICLR 2024] Seer: Language Instructed Video Prediction with Latent Diffusion Models☆29Updated 9 months ago
- Official repository for "iVideoGPT: Interactive VideoGPTs are Scalable World Models" (NeurIPS 2024), https://arxiv.org/abs/2405.15223☆121Updated last week
- EgoVid-5M: A Large-Scale Video-Action Dataset for Egocentric Video Generation☆93Updated 4 months ago
- Emma-X: An Embodied Multimodal Action Model with Grounded Chain of Thought and Look-ahead Spatial Reasoning☆48Updated last month
- ☆93Updated 6 months ago
- Egocentric Video Understanding Dataset (EVUD)☆27Updated 8 months ago
- ☆73Updated 6 months ago
- Code implementation for paper titled "HOI-Ref: Hand-Object Interaction Referral in Egocentric Vision"☆25Updated 10 months ago
- An unofficial pytorch dataloader for Open X-Embodiment Datasets https://github.com/google-deepmind/open_x_embodiment☆13Updated 2 months ago
- Code for FLIP: Flow-Centric Generative Planning for General-Purpose Manipulation Tasks☆45Updated 3 months ago
- ☆15Updated 4 months ago
- Language Repository for Long Video Understanding☆31Updated 8 months ago
- Repository for "General Flow as Foundation Affordance for Scalable Robot Learning"☆46Updated 2 months ago
- ☆68Updated 3 months ago
- Code release for NeurIPS 2023 paper SlotDiffusion: Object-centric Learning with Diffusion Models☆82Updated last year
- Official implementation of the CVPR'24 paper [Adaptive Slot Attention: Object Discovery with Dynamic Slot Number]☆34Updated last month
- Code release for the paper "Egocentric Video Task Translation" (CVPR 2023 Highlight)☆32Updated last year
- AnyBimanual: Transfering Unimanual Policy for General Bimanual Manipulation☆65Updated last month
- Dreamitate: Real-World Visuomotor Policy Learning via Video Generation (CoRL 2024)☆43Updated 8 months ago
- [ICLR 2025] LAPA: Latent Action Pretraining from Videos☆177Updated last month
- ☆17Updated 8 months ago