Video-as-Agent / VideoAgent
Official implementation of "Self-Improving Video Generation"
☆49Updated this week
Related projects ⓘ
Alternatives and complementary repositories for VideoAgent
- ☆68Updated 2 months ago
- Official repository for "iVideoGPT: Interactive VideoGPTs are Scalable World Models" (NeurIPS 2024), https://arxiv.org/abs/2405.15223☆68Updated last week
- ☆61Updated last week
- ☆43Updated 2 months ago
- Video Generation, Physical Commonsense, Semantic Adherence, VideoCon-Physics☆55Updated last month
- Codebase for HiP☆86Updated 10 months ago
- ElasticTok: Adaptive Tokenization for Image and Video☆31Updated last week
- ☆67Updated this week
- Pytorch implementation of "Genie: Generative Interactive Environments", Bruce et al. (2024).☆69Updated 2 months ago
- ☆76Updated 2 months ago
- [ECCV 2024] 💐Official implementation of the paper "Diffusion Reward: Learning Rewards via Conditional Video Diffusion"☆77Updated 4 months ago
- ☆60Updated 4 months ago
- ☆44Updated last month
- Streaming Diffusion Policy: Fast Policy Synthesis with Variable Noise Diffusion Models☆33Updated last month
- Code release for "Pre-training Contextualized World Models with In-the-wild Videos for Reinforcement Learning" (NeurIPS 2023), https://ar…☆54Updated last month
- Reward Guided Latent Consistency Distillation☆15Updated last month
- ☆27Updated last week
- ☆35Updated 3 months ago
- ☆29Updated 2 weeks ago
- ☆45Updated 3 weeks ago
- ☆16Updated 4 months ago
- Code release for NeurIPS 2023 paper SlotDiffusion: Object-centric Learning with Diffusion Models☆78Updated 9 months ago
- [NeurIPS 2024] GenRL: Multimodal foundation world models allow grounding language and video prompts into embodied domains, by turning the…☆58Updated 3 months ago
- Source codes for the paper "COMBO: Compositional World Models for Embodied Multi-Agent Cooperation"☆25Updated 6 months ago
- ☆46Updated 4 months ago
- Reading list for research topics in intuitive physics for artificial cognition.☆17Updated 2 years ago
- IMProv: Inpainting-based Multimodal Prompting for Computer Vision Tasks☆59Updated last month
- This repo is the official implementation of "MineDreamer: Learning to Follow Instructions via Chain-of-Imagination for Simulated-World Co…☆71Updated 4 months ago
- Transformer implementation for "Diffusion Forcing: Next-token Prediction Meets Full-Sequence Diffusion"☆55Updated 3 weeks ago
- [ICML 2024] The offical Implementation of "DecisionNCE: Embodied Multimodal Representations via Implicit Preference Learning"☆67Updated last month