[NeurIPS'25 Spotlight] Official implementation of "JavisGPT: A Unified Multi-modal LLM for Sounding-Video Comprehension and Generation"
☆73Feb 26, 2026Updated 3 months ago
Alternatives and similar repositories for JavisGPT
Users that are interested in JavisGPT are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [arXiv 2025.12] Animate Any Character in Any World☆97Mar 10, 2026Updated 2 months ago
- A framework for camera-controllable image editing using unified geometric guidance and video models.☆66Apr 28, 2026Updated last month
- [CVPR 2026] 👋 Dataset and Benchmark code for EgoEdit☆147Apr 5, 2026Updated last month
- [AAAI 2026] UltraGen☆78Feb 1, 2026Updated 3 months ago
- The official repository of EditCrafter: Tuning-free High-Resolution Image Editing via Pretrained Diffusion Model (CVPRW 2026)☆48Apr 19, 2026Updated last month
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- [ICLR 2026] Light-X: Generative 4D Video Rendering with Camera and Illumination Control☆184Dec 11, 2025Updated 5 months ago
- [CVPR 2026🔥] Enhancing Spatial Understanding in Image Generation via Reward Modeling☆84Mar 2, 2026Updated 2 months ago
- DreamStyle: A Unified Framework for Video Stylization☆119Jan 7, 2026Updated 4 months ago
- Official repo for paper "IC-Effect: Precise and Efficient Video Effects Editing via In-Context Learning"☆43Jan 29, 2026Updated 4 months ago
- [Arxiv 2026] ActionPlan: Future-Aware Streaming Motion Synthesis via Frame-Level Action Planning☆85Mar 26, 2026Updated 2 months ago
- UniMesh: Unifying 3D Mesh Understanding and Generation☆56May 8, 2026Updated 2 weeks ago
- Resilient multi-LLM orchestration with in-built failure handing, rate limits, retries, and circuit breaker.☆45Mar 23, 2026Updated 2 months ago
- Official repository of paper "ProEdit: Inversion-based Editing From Prompts Done Right"☆117Feb 5, 2026Updated 3 months ago
- Official code for SongEcho☆63Mar 3, 2026Updated 2 months ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- ☆88Mar 16, 2026Updated 2 months ago
- A real-time streaming conversational video system that transforms text interactions into continuous, high-fidelity video responses using …☆326Dec 15, 2025Updated 5 months ago
- Official Implementation of MultiWorld: Scalable Multi-Agent Multi-View Video World Models☆214May 12, 2026Updated 2 weeks ago
- SpotEdit:Selective Region Editing in Diffusion Transformers☆190Jan 5, 2026Updated 4 months ago
- Audio-video joint generation☆57Nov 27, 2025Updated 6 months ago
- A Unified Visual Generator with Interleaved OmniModal Context☆222Mar 5, 2026Updated 2 months ago
- D2E: Scaling Vision-Action Pretraining on Desktop Data for Transfer to Embodied AI [ICLR 2026]☆82Mar 3, 2026Updated 2 months ago
- [ICML 2026] Transform Trained Transformer for Accelerating Native 4K Video Generation☆41Dec 16, 2025Updated 5 months ago
- Official repository for the paper "MVP4D: Multi-View Portrait Video Diffusion for Animatable 4D Avatars"☆43Mar 24, 2026Updated 2 months ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- [Official Repo] SpatialEdit: Benchmarking Fine-Grained Image Spatial Editing☆204Apr 13, 2026Updated last month
- ☆333Jan 24, 2026Updated 4 months ago
- Schoenfeld’s Anatomy of Mathematical Reasoning by Language Models☆22Dec 21, 2025Updated 5 months ago
- ☆17Sep 23, 2022Updated 3 years ago
- [ICLR2026] Any-to-Bokeh is a novel one-step video bokeh framework that converts arbitrary input videos into temporally coherent, depth-aw…☆138Feb 4, 2026Updated 3 months ago
- [CVPR 2026] Scaling Zero-Shot Reference-to-Video Generation☆72Apr 28, 2026Updated last month
- DreamID-V: Bridging the Image-to-Video Gap for High-Fidelity Face Swapping via Diffusion Transformer☆639May 22, 2026Updated last week
- End2End Virtual Try-on with Visual Reference, CVPR2026☆64Apr 18, 2026Updated last month
- OmniTransfer: All-in-one Framework for Spatio-temporal Video Transfer☆232Apr 15, 2026Updated last month
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Python package for Zuna, an EEG foundation model for inference.☆297May 8, 2026Updated 2 weeks ago
- This repository is for The Power of Sound(TPoS): Audio Reactive Video Generation with Stable Diffusion (ICCV2023)☆25Dec 7, 2023Updated 2 years ago
- a guide to grapheme-to-phoneme conversion and phoneme list for ace singing voice synthesis engine☆43Jan 17, 2025Updated last year
- Official code repository of '3DreamBooth: High-Fidelity 3D Subject-Driven Video Generation Model'☆58Mar 20, 2026Updated 2 months ago
- Official Pytorch Implementation for "Time-to-Move: Training-Free Motion Controlled Video Generation via Dual-Clock Denoising"☆357May 12, 2026Updated 2 weeks ago
- ☆200Mar 11, 2026Updated 2 months ago
- Code for "Adversarial Attack Generation Empowered by Min-Max Optimization", NeurIPS 2021☆20Dec 6, 2021Updated 4 years ago