JavisVerse/JavisGPT

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/JavisVerse/JavisGPT)

JavisVerse / JavisGPT

[NeurIPS'25 Spotlight] Official implementation of "JavisGPT: A Unified Multi-modal LLM for Sounding-Video Comprehension and Generation"

☆75

Alternatives and similar repositories for JavisGPT

Users that are interested in JavisGPT are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

iSEE-Laboratory / ProEdit
View on GitHub
Official repository of paper "ProEdit: Inversion-based Editing From Prompts Done Right"
☆116Feb 5, 2026Updated 5 months ago
kszpxxzmc / ViSAudio
View on GitHub
ViSAudio: End-to-End Video-Driven Binaural Spatial Audio Generation
☆117Dec 11, 2025Updated 7 months ago
ZheningHuang / SpaceTimePilot
View on GitHub
[CVPR 2026] SpaceTimePilot: Generative Rendering of Dynamic Scenes Across Space and Time
☆123May 17, 2026Updated 2 months ago
LJungang / RTV-Bench
View on GitHub
[NeurIPS 2025] 𝓡𝓣𝓥-𝓑𝓮𝓷𝓬𝓱: Benchmarking MLLM Continuous Perception, Understanding and Reasoning through Real-Time Video.
☆33Jan 15, 2026Updated 6 months ago
Biangbiang0321 / SpotEdit
View on GitHub
SpotEdit:Selective Region Editing in Diffusion Transformers
☆196Jul 8, 2026Updated 3 weeks ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
LemonSky1995 / DreamStyle
View on GitHub
DreamStyle: A Unified Framework for Video Stylization
☆124Jan 7, 2026Updated 6 months ago
KYRIE-LI11 / VideoMark
View on GitHub
☆23Aug 23, 2025Updated 11 months ago
CVC2233 / AndroTMem
View on GitHub
AndroTMem: From Interaction Trajectories to Anchored Memory in Long-Horizon GUI Agents
☆25Jul 5, 2026Updated 3 weeks ago
dingyue772 / OmniSIFT
View on GitHub
[ICML2026] OmniSIFT: Modality-Asymmetric Token Compression for Efficient Omni-modal Large Language Models
☆26May 21, 2026Updated 2 months ago
jamichss / Stream-DiffVSR
View on GitHub
The official repository of paper "Stream-DiffVSR: Low-Latency Streamable Video Super-Resolution via Auto-Regressive Diffusion"
☆310Jan 12, 2026Updated 6 months ago
emjay73 / InfCam
View on GitHub
☆90May 13, 2026Updated 2 months ago
NVlabs / LoRWeB
View on GitHub
We propose a novel modular framework that learns to dynamically mix low-rank adapters (LoRAs) to improve visual analogy learning, enablin…
☆75Jun 22, 2026Updated last month
snowflakewang / CustomX
View on GitHub
[ECCV 2026] CustomX: Unified Character, Action, and Scene Customization in Video World Models
☆96Jun 25, 2026Updated last month
yangdongchao / UniAudio2Demo
View on GitHub
☆26Feb 10, 2026Updated 5 months ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
wren93 / tuna
View on GitHub
☆94Apr 29, 2026Updated 2 months ago
sjtuplayer / Harmony
View on GitHub
Audio-video joint generation
☆58Nov 27, 2025Updated 8 months ago
SOTAMak1r / VINO-code
View on GitHub
A Unified Visual Generator with Interleaved OmniModal Context
☆232Mar 5, 2026Updated 4 months ago
GAIR-NLP / LiveTalk
View on GitHub
☆328Jan 2, 2026Updated 6 months ago
TingtingLiao / mimix
View on GitHub
☆83Oct 13, 2025Updated 9 months ago
HiDream-ai / ReCo
View on GitHub
[ICML 2026] ReCo: In-Context Generation with Regional Constraints for Instructional Video Editing
☆171May 26, 2026Updated 2 months ago
snap-research / EgoEdit
View on GitHub
[CVPR 2026] 👋 Dataset and Benchmark code for EgoEdit
☆155Apr 5, 2026Updated 3 months ago
KlingAIResearch / SVG-T2I
View on GitHub
[Arxiv 2025] Official PyTorch Implementation of "SVG-T2I: Scaling up Text-to-Image Latent Diffusion Model Without Variational Autoencoder…
☆152Dec 18, 2025Updated 7 months ago
HKUST-LongGroup / SwiftI2V
View on GitHub
[arXiv 2026] Project page for paper "SwiftI2V: Efficient High-Resolution Image-to-Video Generation via Conditional Segment-wise Generatio…
☆86May 8, 2026Updated 2 months ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
JavisVerse / JavisDiT
View on GitHub
[ICLR 2026] Official implementation of JavisDiT and JavisDiT++ series.
☆376Mar 29, 2026Updated 4 months ago
JiazheWei / PosterCopilot
View on GitHub
☆198Dec 10, 2025Updated 7 months ago
ZhaoJingjing713 / Spatia
View on GitHub
[CVPR2026] Long-horizon, spatially consistent video generation enabled by persistent 3D scene point clouds and dynamic-static disentangle…
☆220May 12, 2026Updated 2 months ago
AIGeeksGroup / UniMesh
View on GitHub
UniMesh: Unifying 3D Mesh Understanding and Generation
☆57Jul 14, 2026Updated 2 weeks ago
Francis-Rings / FlashPortrait
View on GitHub
[CVPR2026]We present FlashPortrait, an end-to-end video diffusion transformer capable of synthesizing ID-preserving, infinite-length vide…
☆479Feb 21, 2026Updated 5 months ago
XiaokunSun / MorphAny3D
View on GitHub
[CVPR 2026] Official repo of "MorphAny3D: Unleashing the Power of Structured Latent in 3D Morphing“
☆110Apr 13, 2026Updated 3 months ago
LaVi-Lab / Rethink_CoT_Video
View on GitHub
Official code for "Rethinking Chain-of-Thought Reasoning for Videos"
☆21Dec 14, 2025Updated 7 months ago
mo230761 / UniGeo
View on GitHub
A framework for camera-controllable image editing using unified geometric guidance and video models.
☆65Jun 25, 2026Updated last month
360CVGroup / RefTon
View on GitHub
End2End Virtual Try-on with Visual Reference, CVPR2026
☆72Apr 18, 2026Updated 3 months ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
prs-eth / stereospace
View on GitHub
☆75May 4, 2026Updated 2 months ago
KIMGEONUNG / VideoFrom3D
View on GitHub
[SIGGRAPH-ASIA 2025] Official implementation of "VideoFrom3D: 3D Scene Video Generation via Complementary Image and Video Diffusion Model…
☆139Mar 19, 2026Updated 4 months ago
wsntxxn / UniFlow-Audio
View on GitHub
☆74Jul 17, 2026Updated last week
blurgyy / CoMPaSS
View on GitHub
[ICCV 2025] Enhancing spatial understanding in text-to-Image diffusion models
☆94Sep 11, 2025Updated 10 months ago
SihuiJi / LayerFlow
View on GitHub
[SIGGRAGH'25] Official repository of LayerFlow: A Unified Model for Layer-aware Video Generation
☆95Aug 18, 2025Updated 11 months ago
XingtongGe / Salt
View on GitHub
🧂 [ECCV 2026] Salt: Self-Consistent Distribution Matching with Cache-Aware Training for Fast Video Generation
☆16Apr 6, 2026Updated 3 months ago
stdstu12 / YUME
View on GitHub
The official code of Yume
☆679Jan 14, 2026Updated 6 months ago