Video-as-Agent / VideoAgent
Official implementation of "Self-Improving Video Generation"
☆62Updated 3 weeks ago
Alternatives and similar repositories for VideoAgent:
Users that are interested in VideoAgent are comparing it to the libraries listed below
- ☆121Updated 2 months ago
- Code for paper "Grounding Video Models to Actions through Goal Conditioned Exploration".☆43Updated 2 months ago
- ☆75Updated 7 months ago
- ☆16Updated 4 months ago
- Official repository for "iVideoGPT: Interactive VideoGPTs are Scalable World Models" (NeurIPS 2024), https://arxiv.org/abs/2405.15223☆121Updated 2 weeks ago
- Latent Motion Token as the Bridging Language for Robot Manipulation☆77Updated this week
- ElasticTok: Adaptive Tokenization for Image and Video☆61Updated 4 months ago
- Reward Guided Latent Consistency Distillation☆21Updated 5 months ago
- ☆67Updated 6 months ago
- ☆94Updated 7 months ago
- Video Generation, Physical Commonsense, Semantic Adherence, VideoCon-Physics☆87Updated last week
- ☆26Updated 3 months ago
- ☆67Updated last month
- GRAPE: Guided-Reinforced Vision-Language-Action Preference Optimization☆94Updated last month
- [ICLR 2024] Seer: Language Instructed Video Prediction with Latent Diffusion Models☆29Updated 10 months ago
- Codebase for HiP☆88Updated last year
- ☆18Updated 8 months ago
- Code for FLIP: Flow-Centric Generative Planning for General-Purpose Manipulation Tasks☆48Updated 3 months ago
- Code for Stable Control Representations☆24Updated 2 months ago
- ☆27Updated last month
- An ML research template with good documentation by Boyuan Chen, an MIT PhD student☆63Updated 2 weeks ago
- ☆51Updated 6 months ago
- [ECCV 2024] STEVE in Minecraft is for See and Think: Embodied Agent in Virtual Environment☆36Updated last year
- ☆46Updated 3 months ago
- Official repository of S-Agents: Self-organizing Agents in Open-ended Environment☆21Updated last year
- ☆43Updated last month