HITsz-TMG / FilmAgent
Resources of our paper "FilmAgent: A Multi-Agent Framework for End-to-End Film Automation in Virtual 3D Spaces". New versions in the making!
☆898Updated last week
Alternatives and similar repositories for FilmAgent:
Users that are interested in FilmAgent are comparing it to the libraries listed below
- Hallo3: Highly Dynamic and Realistic Portrait Image Animation with Diffusion Transformer Networks☆1,092Updated this week
- Build multimodal language agents for fast prototype and production☆2,101Updated last week
- Hallo2: Long-Duration and High-Resolution Audio-driven Portrait Image Animation☆3,493Updated this week
- Vchitect-2.0: Parallel Transformer for Scaling Up Video Diffusion Models☆888Updated last month
- Video generation from text&image, 1st-gen☆791Updated 2 weeks ago
- [CVPR'25]Tora: Trajectory-oriented Diffusion Transformer for Video Generation☆1,077Updated this week
- Code for Paper "UniAnimate: Taming Unified Video Diffusion Models for Consistent Human Image Animation".☆1,050Updated 7 months ago
- Region-Aware Text-to-Image Generation via Hard Binding and Soft Refinement 🔥☆545Updated last month
- MagicTime: Time-lapse Video Generation Models as Metamorphic Simulators☆1,272Updated 3 weeks ago
- Nexa SDK is a comprehensive toolkit for supporting GGML and ONNX models. It supports text generation, image generation, vision-language m…☆4,407Updated this week
- Allegro is a powerful text-to-video model that generates high-quality videos up to 6 seconds at 15 FPS and 720p resolution from simple te…☆1,055Updated 3 weeks ago
- Eko (Eko Keeps Operating) - Build Production-ready Agentic Workflow with Natural Language - eko.fellou.ai☆2,669Updated this week
- [CVPR 2025] 3DTopia-XL: High-Quality 3D PBR Asset Generation via Primitive Diffusion☆907Updated this week
- Towards Open-source GPT-4o with Vision, Speech and Duplex Capabilities。☆1,649Updated last month
- The codes about "Uni-MoE: Scaling Unified Multimodal Models with Mixture of Experts"☆694Updated last month
- Customized ID Consistent for human☆939Updated last week
- Champ: Controllable and Consistent Human Image Animation with 3D Parametric Guidance☆4,165Updated 7 months ago
- [NeurIPS 2024] An official implementation of ShareGPT4Video: Improving Video Understanding and Generation with Better Captions☆1,041Updated 4 months ago
- [LCLR 2025 Oral] TANGO: Co-Speech Gesture Video Reenactment with Hierarchical Audio-Motion Embedding and Diffusion Interpolation☆930Updated 4 months ago
- [IJCV] Show-1: Marrying Pixel and Latent Diffusion Models for Text-to-Video Generation☆1,021Updated 3 months ago
- ☆789Updated 2 months ago
- Align Anything: Training All-modality Model with Feedback☆2,486Updated this week
- [ICLR'25] 3DTrajMaster: Mastering 3D Trajectory for Multi-Entity Motion in Video Generation☆304Updated last week
- [NeurIPS 2024🔥] DreamClear: High-Capacity Real-World Image Restoration with Privacy-Safe Dataset Curation☆964Updated 2 months ago