Video-as-Agent / VideoAgent
Official implementation of "Self-Improving Video Generation"
☆60Updated last month
Alternatives and similar repositories for VideoAgent:
Users that are interested in VideoAgent are comparing it to the libraries listed below
- Official repository for "iVideoGPT: Interactive VideoGPTs are Scalable World Models" (NeurIPS 2024), https://arxiv.org/abs/2405.15223☆114Updated last month
- Code for paper "Grounding Video Models to Actions through Goal Conditioned Exploration".☆41Updated last month
- ☆73Updated 5 months ago
- ☆112Updated last month
- ☆91Updated 6 months ago
- Latent Motion Token as the Bridging Language for Robot Manipulation☆72Updated last week
- ElasticTok: Adaptive Tokenization for Image and Video☆54Updated 3 months ago
- ☆61Updated 5 months ago
- [ICLR 2025] LAPA: Latent Action Pretraining from Videos☆154Updated 3 weeks ago
- ☆65Updated last week
- ☆15Updated 3 months ago
- Code release for "Pre-training Contextualized World Models with In-the-wild Videos for Reinforcement Learning" (NeurIPS 2023), https://ar…☆58Updated 4 months ago
- Video Generation, Physical Commonsense, Semantic Adherence, VideoCon-Physics☆78Updated 2 weeks ago
- GRAPE: Guided-Reinforced Vision-Language-Action Preference Optimization☆76Updated 2 weeks ago
- Codebase for HiP☆88Updated last year
- Pytorch implementation of "Genie: Generative Interactive Environments", Bruce et al. (2024).☆131Updated 5 months ago
- Reward Guided Latent Consistency Distillation☆21Updated 4 months ago
- ☆51Updated 4 months ago
- ☆18Updated 7 months ago
- ☆116Updated last year
- Benchmarking physical understanding in generative video models☆116Updated this week