[NeurIPS'25 Spotlight] Official implementation of "JavisGPT: A Unified Multi-modal LLM for Sounding-Video Comprehension and Generation"
☆75Feb 26, 2026Updated 3 months ago
Alternatives and similar repositories for JavisGPT
Users that are interested in JavisGPT are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A framework for camera-controllable image editing using unified geometric guidance and video models.☆66Apr 28, 2026Updated last month
- [CVPR 2026] 👋 Dataset and Benchmark code for EgoEdit☆150Apr 5, 2026Updated 2 months ago
- [AAAI 2026] UltraGen☆78Feb 1, 2026Updated 4 months ago
- The official repository of EditCrafter: Tuning-free High-Resolution Image Editing via Pretrained Diffusion Model (CVPRW 2026)☆49Apr 19, 2026Updated last month
- [ICLR 2026] Light-X: Generative 4D Video Rendering with Camera and Illumination Control☆187Dec 11, 2025Updated 6 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- DreamStyle: A Unified Framework for Video Stylization☆119Jan 7, 2026Updated 5 months ago
- UniMesh: Unifying 3D Mesh Understanding and Generation☆57May 8, 2026Updated last month
- Official repo for paper "IC-Effect: Precise and Efficient Video Effects Editing via In-Context Learning"☆43Jan 29, 2026Updated 4 months ago
- Resilient multi-LLM orchestration with in-built failure handing, rate limits, retries, and circuit breaker.☆47Jun 1, 2026Updated 2 weeks ago
- Official repository of paper "ProEdit: Inversion-based Editing From Prompts Done Right"☆117Feb 5, 2026Updated 4 months ago
- Official code for SongEcho☆64Mar 3, 2026Updated 3 months ago
- ☆88Mar 16, 2026Updated 3 months ago
- SpotEdit:Selective Region Editing in Diffusion Transformers☆194Jan 5, 2026Updated 5 months ago
- Code for paper "CLiFT: Compressive Light-Field Tokens for Compute Efficient and Adaptive Neural Rendering" [NeurIPS 2025 (spotlight)]☆77Aug 2, 2025Updated 10 months ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- A Unified Visual Generator with Interleaved OmniModal Context☆228Mar 5, 2026Updated 3 months ago
- D2E: Scaling Vision-Action Pretraining on Desktop Data for Transfer to Embodied AI [ICLR 2026]☆87Mar 3, 2026Updated 3 months ago
- [ICML 2026] Transform Trained Transformer for Accelerating Native 4K Video Generation☆41Dec 16, 2025Updated 6 months ago
- Official repository for the paper "MVP4D: Multi-View Portrait Video Diffusion for Animatable 4D Avatars"☆43Mar 24, 2026Updated 2 months ago
- [Official Repo] SpatialEdit: Benchmarking Fine-Grained Image Spatial Editing☆211Apr 13, 2026Updated 2 months ago
- ☆333Jan 24, 2026Updated 4 months ago
- Schoenfeld’s Anatomy of Mathematical Reasoning by Language Models☆27Dec 21, 2025Updated 5 months ago
- [ICLR2026] Any-to-Bokeh is a novel one-step video bokeh framework that converts arbitrary input videos into temporally coherent, depth-aw…☆140Feb 4, 2026Updated 4 months ago
- [CVPR 2026] Scaling Zero-Shot Reference-to-Video Generation☆75Apr 28, 2026Updated last month
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- DreamID-V: Bridging the Image-to-Video Gap for High-Fidelity Face Swapping via Diffusion Transformer☆648May 22, 2026Updated 3 weeks ago
- End2End Virtual Try-on with Visual Reference, CVPR2026☆68Apr 18, 2026Updated 2 months ago
- We propose a novel modular framework that learns to dynamically mix low-rank adapters (LoRAs) to improve visual analogy learning, enablin…☆73Apr 12, 2026Updated 2 months ago
- Python package for Zuna, an EEG foundation model for inference.☆301May 8, 2026Updated last month
- This repository is for The Power of Sound(TPoS): Audio Reactive Video Generation with Stable Diffusion (ICCV2023)☆25Dec 7, 2023Updated 2 years ago
- Official code repository of '3DreamBooth: High-Fidelity 3D Subject-Driven Video Generation Model'☆58Mar 20, 2026Updated 2 months ago
- ☆30May 7, 2025Updated last year
- Official implementation of Tuna-2: Pixel Embeddings Beat Vision Encoders for Unified Understanding and Generation☆710Jun 9, 2026Updated last week
- Official implementation of "Lotus-2: Advancing Geometric Dense Prediction with Powerful Image Generative Model"☆267Apr 25, 2026Updated last month
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- [CVPR 2026] Official code and models for Video Encoder-only Mask Transformer (VidEoMT).☆238Jun 8, 2026Updated last week
- Implementation of an X86 mini OS from scratch. Reference: https://github.com/yyu/osfs00☆11Jan 9, 2023Updated 3 years ago
- When real time Yoga Position classification meets GNN☆11Sep 17, 2023Updated 2 years ago
- A Benchmark and Evaluation Suite for Zero-shot Singing Voice Synthesis☆29Feb 11, 2026Updated 4 months ago
- ☆34Dec 29, 2025Updated 5 months ago
- ☆12Nov 12, 2024Updated last year
- Official Implementation of SAGE-GRPO:Manifold-Aware Exploration for Reinforcement Learning in Video Generation☆124Apr 2, 2026Updated 2 months ago