[NeurIPS'25 Spotlight] Official implementation of "JavisGPT: A Unified Multi-modal LLM for Sounding-Video Comprehension and Generation"
☆72Feb 26, 2026Updated 2 months ago
Alternatives and similar repositories for JavisGPT
Users that are interested in JavisGPT are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Animate Any Character in Any World☆97Mar 10, 2026Updated last month
- A framework for camera-controllable image editing using unified geometric guidance and video models.☆59Apr 28, 2026Updated last week
- [CVPR 2026] 👋 Dataset and Benchmark code for EgoEdit☆143Apr 5, 2026Updated last month
- [CVPR 2026] SpaceTimePilot: Generative Rendering of Dynamic Scenes Across Space and Time☆110Apr 15, 2026Updated 3 weeks ago
- ☆89Feb 4, 2026Updated 3 months ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- [ICLR 2026] Light-X: Generative 4D Video Rendering with Camera and Illumination Control☆180Dec 11, 2025Updated 4 months ago
- [Arxiv 2026] ActionPlan: Future-Aware Streaming Motion Synthesis via Frame-Level Action Planning☆81Mar 26, 2026Updated last month
- [CVPR 2026🔥] Enhancing Spatial Understanding in Image Generation via Reward Modeling☆82Mar 2, 2026Updated 2 months ago
- DreamStyle: A Unified Framework for Video Stylization☆119Jan 7, 2026Updated 4 months ago
- UniMesh: Unifying 3D Mesh Understanding and Generation☆47Apr 29, 2026Updated last week
- Resilient multi-LLM orchestration with in-built failure handing, rate limits, retries, and circuit breaker.☆43Mar 23, 2026Updated last month
- Official repository of paper "ProEdit: Inversion-based Editing From Prompts Done Right"☆116Feb 5, 2026Updated 3 months ago
- Official code for SongEcho☆59Mar 3, 2026Updated 2 months ago
- ☆86Mar 16, 2026Updated last month
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- A real-time streaming conversational video system that transforms text interactions into continuous, high-fidelity video responses using …☆327Dec 15, 2025Updated 4 months ago
- SpotEdit:Selective Region Editing in Diffusion Transformers☆188Jan 5, 2026Updated 4 months ago
- Code for paper "CLiFT: Compressive Light-Field Tokens for Compute Efficient and Adaptive Neural Rendering" [NeurIPS 2025 (spotlight)]☆75Aug 2, 2025Updated 9 months ago
- ☆38Dec 16, 2025Updated 4 months ago
- Audio-video joint generation☆57Nov 27, 2025Updated 5 months ago
- A Unified Visual Generator with Interleaved OmniModal Context☆216Mar 5, 2026Updated 2 months ago
- D2E: Scaling Vision-Action Pretraining on Desktop Data for Transfer to Embodied AI [ICLR 2026]☆81Mar 3, 2026Updated 2 months ago
- Official repository for the paper "MVP4D: Multi-View Portrait Video Diffusion for Animatable 4D Avatars"☆42Mar 24, 2026Updated last month
- Schoenfeld’s Anatomy of Mathematical Reasoning by Language Models☆22Dec 21, 2025Updated 4 months ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Official implementation of Tuna-2: Pixel Embeddings Beat Vision Encoders for Unified Understanding and Generation☆411Updated this week
- [Official Repo] SpatialEdit: Benchmarking Fine-Grained Image Spatial Editing☆200Apr 13, 2026Updated 3 weeks ago
- ☆332Jan 24, 2026Updated 3 months ago
- [ICLR2026] Any-to-Bokeh is a novel one-step video bokeh framework that converts arbitrary input videos into temporally coherent, depth-aw…☆135Feb 4, 2026Updated 3 months ago
- [CVPR 2026] Scaling Zero-Shot Reference-to-Video Generation☆72Apr 28, 2026Updated last week
- DreamID-V: Bridging the Image-to-Video Gap for High-Fidelity Face Swapping via Diffusion Transformer☆625Mar 13, 2026Updated last month
- Official PyTorch Implementation of "SVG-T2I: Scaling up Text-to-Image Latent Diffusion Model Without Variational Autoencoder".☆148Dec 18, 2025Updated 4 months ago
- OmniTransfer: All-in-one Framework for Spatio-temporal Video Transfer☆232Apr 15, 2026Updated 3 weeks ago
- Python package for Zuna, an EEG foundation model for inference.☆287Mar 6, 2026Updated 2 months ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- This repository is for The Power of Sound(TPoS): Audio Reactive Video Generation with Stable Diffusion (ICCV2023)☆25Dec 7, 2023Updated 2 years ago
- a guide to grapheme-to-phoneme conversion and phoneme list for ace singing voice synthesis engine☆43Jan 17, 2025Updated last year
- Official code repository of '3DreamBooth: High-Fidelity 3D Subject-Driven Video Generation Model'☆56Mar 20, 2026Updated last month
- ☆196Mar 11, 2026Updated last month
- Official Implementation of SAGE-GRPO:Manifold-Aware Exploration for Reinforcement Learning in Video Generation☆116Apr 2, 2026Updated last month
- Official codes for the paper "GARDO: Reinforcing Diffusion Models without Reward Hacking"☆57Feb 2, 2026Updated 3 months ago
- ☆30May 7, 2025Updated 11 months ago