Max-Fu / tvl
☆65Updated last week
Alternatives and similar repositories for tvl:
Users that are interested in tvl are comparing it to the libraries listed below
- IMProv: Inpainting-based Multimodal Prompting for Computer Vision Tasks☆59Updated 4 months ago
- Binding Touch to Everything: Learning Unified Multimodal Tactile Representations☆28Updated 2 weeks ago
- ☆43Updated 2 months ago
- Code for paper "Grounding Video Models to Actions through Goal Conditioned Exploration".☆41Updated last month
- [ECCV2024, Oral, Best Paper Finalist]This is the official implementation of the paper "LEGO: Learning EGOcentric Action Frame Generation …☆36Updated this week
- Latent Motion Token as the Bridging Language for Robot Manipulation☆72Updated last week
- ☆61Updated 5 months ago
- Official repository for "iVideoGPT: Interactive VideoGPTs are Scalable World Models" (NeurIPS 2024), https://arxiv.org/abs/2405.15223☆114Updated last month
- [ICLR 2025] LAPA: Latent Action Pretraining from Videos☆154Updated 3 weeks ago
- GRAPE: Guided-Reinforced Vision-Language-Action Preference Optimization☆76Updated 2 weeks ago
- ☆73Updated 5 months ago
- ☆91Updated 6 months ago
- [ICLR 2024] Seer: Language Instructed Video Prediction with Latent Diffusion Models☆24Updated 8 months ago
- Code release for NeurIPS 2023 paper SlotDiffusion: Object-centric Learning with Diffusion Models☆82Updated last year
- EgoVid-5M: A Large-Scale Video-Action Dataset for Egocentric Video Generation☆90Updated 3 months ago
- Official implementation of "Self-Improving Video Generation"☆60Updated last month
- An unofficial pytorch dataloader for Open X-Embodiment Datasets https://github.com/google-deepmind/open_x_embodiment☆13Updated last month
- Official implementation of the CVPR'24 paper [Adaptive Slot Attention: Object Discovery with Dynamic Slot Number]☆32Updated 3 weeks ago
- AnyBimanual: Transfering Unimanual Policy for General Bimanual Manipulation☆61Updated last month
- ☆66Updated 2 months ago
- ElasticTok: Adaptive Tokenization for Image and Video☆54Updated 3 months ago
- Dreamitate: Real-World Visuomotor Policy Learning via Video Generation (CoRL 2024)☆43Updated 7 months ago
- Official code for MotionBench☆24Updated last month
- Code for Stable Control Representations☆23Updated last month
- Efficiently apply modification functions to RLDS/TFDS datasets.☆24Updated 8 months ago
- ☆15Updated 3 months ago
- Code for FLIP: Flow-Centric Generative Planning for General-Purpose Manipulation Tasks☆41Updated 2 months ago
- ☆17Updated 7 months ago
- ☆61Updated 4 months ago
- ☆21Updated 3 weeks ago