Official implementation of "Self-Improving Video Generation"
☆77Apr 25, 2025Updated 10 months ago
Alternatives and similar repositories for VideoAgent
Users that are interested in VideoAgent are comparing it to the libraries listed below
Sorting:
- [ICLR 2025 Spotlight] Grounding Video Models to Actions through Goal Conditioned Exploration☆60May 4, 2025Updated 9 months ago
- This is the official implementation of SG-I2V: Self-Guided Trajectory Control in Image-to-Video Generation.☆116Nov 26, 2024Updated last year
- We introduce OpenStory++, a large-scale open-domain dataset focusing on enabling MLLMs to perform storytelling generation tasks.☆16Aug 30, 2024Updated last year
- A custom node extension for ComfyUI that integrates Google's Veo 2 text-to-video generation capabilities.☆32Apr 12, 2025Updated 10 months ago
- Jupyter notebooks for PuLID face transfer with Flux.1 dev. Able to run on Google Colab Free Tier☆18Dec 18, 2024Updated last year
- Benchmarking physical understanding in generative video models☆247Feb 2, 2026Updated 3 weeks ago
- Official Implementation of paper "Tex4D: Zero-shot 4D Scene Texturing with Video Diffusion Models"☆55Jan 28, 2025Updated last year
- [NeurIPS 2024] Official Implementation of GrounDiT☆59Dec 12, 2024Updated last year
- ☆78May 23, 2025Updated 9 months ago
- Training-free Guidance in Text-to-Video Generation via Multimodal Planning and Structured Noise Initialization☆25Apr 14, 2025Updated 10 months ago
- Video Diffusion Alignment via Reward Gradients. We improve a variety of video diffusion models such as VideoCrafter, OpenSora, ModelScope…☆309Mar 12, 2025Updated 11 months ago
- Symphony — A decentralized multi-agent framework that enables intelligent agents to collaborate seamlessly across heterogeneous edge devi…☆30Oct 30, 2025Updated 4 months ago
- Official implementation of "VSTAR: Generative Temporal Nursing for Longer Dynamic Video Synthesis"☆20Jan 26, 2025Updated last year
- T2VScore: Towards A Better Metric for Text-to-Video Generation☆81Apr 10, 2024Updated last year
- Implementation of Zero-Shot Contrastive Loss for Text-Guided Diffusion Image Style Transfer.☆24May 16, 2024Updated last year
- ☆24Feb 21, 2025Updated last year
- Txt2Img | Img2Img | + Multiple LoRAs, All in one jupyter notebook for Flux.1 dev/schnell. Able to run on Google Colab Free Tier☆21Dec 3, 2024Updated last year
- Official repository of Learning to Act from Actionless Videos through Dense Correspondences.☆248Apr 25, 2024Updated last year
- ☆21Jan 26, 2026Updated last month
- Advanced drum machine for ComfyUI featuring a 64-step sequencer, custom sample support, and retro hardware aesthetics.☆20Jan 19, 2026Updated last month
- PhysGame Benchmark for Physical Commonsense Evaluation in Gameplay Videos☆48Jul 3, 2025Updated 7 months ago
- [AAAI 2026] Official implementation of DreamRunner: Fine-Grained Storytelling Video Generation with Retrieval-Augmented Motion Adaptation☆78Jun 11, 2025Updated 8 months ago
- [CVPR 2023] Official PyTorch implementation of MoStGAN-V☆24Jun 15, 2023Updated 2 years ago
- [ACM Multimedia 2025 Datasets Track] EditWorld: Simulating World Dynamics for Instruction-Following Image Editing☆139Aug 2, 2025Updated 7 months ago
- official repo for "VideoScore: Building Automatic Metrics to Simulate Fine-grained Human Feedback for Video Generation" [EMNLP2024]☆112Dec 4, 2025Updated 2 months ago
- This is official repository of Physics-AD☆18Updated this week
- AutoLibra: Metric Induction for Agents from Open-Ended Human Feedback☆17Oct 15, 2025Updated 4 months ago
- ☆19Apr 23, 2025Updated 10 months ago
- Code for "VideoRepair: Improving Text-to-Video Generation via Misalignment Evaluation and Localized Refinement"☆52Dec 5, 2024Updated last year
- Code release for: Controllable Layer Decomposition for Reversible Multi-Layer Image Generation☆42Dec 7, 2025Updated 2 months ago
- Codebase for the paper HawkI: HawkI: Homography & Mutual Information Guidance for 3D-free Single Image to Aerial View☆13Jun 5, 2024Updated last year
- [ICCV 2025] "Fine-grained Spatiotemporal Grounding on Egocentric Videos"☆22Nov 23, 2025Updated 3 months ago
- Pytorch implementatoin of the components mentioned in deep dynamic characters☆32Mar 27, 2024Updated last year
- Navigate dreamscapes with a click – your chosen point guides the drone’s flight in a thrilling visual journey.☆48Sep 2, 2025Updated 6 months ago
- ☆163Jan 6, 2025Updated last year
- Text-Guided Generation of Full-Body Image with Preserved Reference Face for Customized Animation☆24Jun 24, 2024Updated last year
- NOVA-3D: Non-overlapped Views for 3D Anime Character Reconstruction☆26Mar 14, 2024Updated last year
- Unofficial implementation of Layer Diffuse in diffusers☆28Apr 3, 2024Updated last year
- Code repository for T2V-Turbo and T2V-Turbo-v2☆314Jan 31, 2025Updated last year