JavisVerse / JavisGPTLinks
[NeurIPS'25 Spotlight] Official implementation of "JavisGPT: A Unified Multi-modal LLM for Sounding-Video Comprehension and Generation"
☆68Updated 3 weeks ago
Alternatives and similar repositories for JavisGPT
Users that are interested in JavisGPT are comparing it to the libraries listed below
Sorting:
- VideoCoF: Unified Video Editing with Temporal Reasoner☆129Updated 3 weeks ago
- An official implementation of SwapAnyone.☆73Updated 10 months ago
- Animate Any Character in Any World☆88Updated 3 weeks ago
- ViSAudio: End-to-End Video-Driven Binaural Spatial Audio Generation☆111Updated last month
- A Unified Visual Generator with Interleaved OmniModal Context☆167Updated 3 weeks ago
- Blending Custom Photos with Video Diffusion Transformers☆48Updated last year
- ☆132Updated 7 months ago
- Official PyTorch implementation of the paper "FlowDirector: Training-Free Flow Steering for Precise Text-to-Video Editing"☆76Updated last month
- Official implementation of Progressive Detail Injection for Training-Free Semantic Binding in Text-to-Image Generation☆31Updated 5 months ago
- ☆227Updated 6 months ago
- The official implementation of ”RepVideo: Rethinking Cross-Layer Representation for Video Generation“☆123Updated last year
- OmniInsert: Mask-Free Video Insertion of Any Reference via Diffusion Transformer Models☆152Updated 4 months ago
- The official UniVerse-1 code.☆119Updated 3 months ago
- ☆92Updated 4 months ago
- FantasyID: Face Knowledge Enhanced ID-Preserving Video Generation☆78Updated 5 months ago
- [Arxiv'25] IC-Custom: Diverse Image Customization via In-Context Learning☆158Updated 4 months ago
- DreamStyle: A Unified Framework for Video Stylization☆107Updated 3 weeks ago
- [AAAI 2026] Official implementation of DreamRunner: Fine-Grained Storytelling Video Generation with Retrieval-Augmented Motion Adaptation☆77Updated 7 months ago
- ☆107Updated 4 months ago
- Scaling Zero-Shot Reference-to-Video Generation☆63Updated last month
- 👋 Dataset and Benchmark code for EgoEdit☆105Updated last month
- Official PyTorch Implementation of "SVG-T2I: Scaling up Text-to-Image Latent Diffusion Model Without Variational Autoencoder".☆123Updated last month
- UltraFlux: Data-Model Co-Design for High-quality Native 4K Text-to-Image Generation across Diverse Aspect Ratios☆107Updated last month
- ☆29Updated 10 months ago
- Official Implementation of ReCo: Region-Constraint In-Context Generation for Instructional Video Editing☆139Updated 2 weeks ago
- [AAAI 2026] UltraGen☆79Updated 3 months ago
- Krea Realtime 14B. An open-source realtime AI video model.☆464Updated 2 months ago
- An official implementation of EvoSearch: Scaling Image and Video Generation via Test-Time Evolutionary Search☆100Updated 3 months ago
- [CVPR 2025] Official implementation of ByTheWay: Boost Your Text-to-Video Generation Model to Higher Quality in a Training-free Way☆46Updated 3 months ago
- [ICCV 2025] Scaling Inference-Time Optimization for Text-to-Image Diffusion Models via Reflection Tuning☆211Updated 2 months ago