QuanjianSong / UniVSTLinks
[TPAMI 2025] Official Pytorch Code of the Paper "UniVST: A Unified Framework for Training-free Localized Video Style Transfer"
☆77Updated 2 weeks ago
Alternatives and similar repositories for UniVST
Users that are interested in UniVST are comparing it to the libraries listed below
Sorting:
- Inference pipeline for some Text-to-Image metrics.☆94Updated 2 months ago
- To support and further the research in the field of portrait animation , we are excited to launch PhotoPoster, an open project for pose-d…☆226Updated last year
- [ICLR 2025] InstantSwap: This repo is the official implementation of "InstantSwap: Fast Customized Concept Swapping across Sharp Shape Di…☆110Updated 8 months ago
- Teaching LMMs for Image Quality Scoring and Interpreting☆93Updated 7 months ago
- [CVPR 2025] Official implementation of "Kiss3DGen: Repurposing Image Diffusion Models for 3D Asset Generation"☆283Updated 5 months ago
- [ICCV 2025] Official code for "Align Your Rhythm: Generating Highly Aligned Dance Poses with Gating-Enhanced Rhythm-Aware Feature Represe…☆83Updated 2 weeks ago
- [ACMMM 2025] "Set You Straight: Auto-Steering Denoising Trajectories to Sidestep Unwanted Concepts" (Official Implementation)☆78Updated 4 months ago
- [CVPR'25] Antidote: A Unified Framework for Mitigating LVLM Hallucinations in Counterfactual Presupposition and Object Perception☆15Updated last month
- [ICLR 2025 spotlight] 3DIS: Depth-Driven Decoupled Instance Synthesis for Text-to-Image Generation☆261Updated 5 months ago
- [NeurIPS 2024] NaRCan: Natural Refined Canonical Image with Integration of Diffusion Prior for Video Editing☆168Updated 2 months ago
- About Official repo of paper "SRUM: Fine-Grained Self-Rewarding for Unified Multimodal Models". A post-training framework that creates a …☆82Updated this week
- [NeurIPS`25] TC-Light: Temporally Coherent Generative Rendering for Realistic World Transfer☆96Updated last month
- Official implementation of the paper "GenCompositor: Generative Video Compositing with Diffusion Transformer"☆128Updated last month
- My personal tech notes on SDN, P4, INT, Go, and beyond — sharing what I learn from both research and hands-on development.☆136Updated this week
- Official Implementation of ECCV2024 paper: Chat Edit 3D: Interactive 3D Scene Editing via Large Language Model☆312Updated 5 months ago
- A curated list of papers on reinforcement learning for video generation☆218Updated 2 weeks ago
- CHATS: Combining Human-Aligned Optimization and Test-Time Sampling for Text-to-Image Generation (ICML2025)☆117Updated 3 months ago
- Official Pytorch Code of the Paper "LightMotion: A Light and Tuning-free Method for Simulating Camera Motion in Video Generation"☆30Updated 3 months ago
- [NeurIPS 2024] EnsIR: An Ensemble Algorithm for Image Restoration via Gaussian Mixture Models☆49Updated last year
- The repository for this project is the code implementation of the paper MHIAIFormer: Multihead Interacted and Adaptive Integrated Transfo…☆22Updated 4 months ago
- Simple PyTorch implementation of "Libra: Building Decoupled Vision System on Large Language Models" (accepted by ICML 2024)☆164Updated 11 months ago
- The code of paper "DeFillet: Detection and Removal of Fillet Regions in Polygonal CAD Models" , ACM Transactions on Graphics (SIGGRAPH 20…☆86Updated last week
- 【🚧 项目目前尚处于开发阶段,暂未完成开发,请过段时间再来看吧】寒霜物联 —— 支持轻量化快速接入的 IoT 设备统一接入平台☆387Updated 3 weeks ago
- Official repo for 'Large Multimodal Models Evaluation: A Survey'☆93Updated last week
- 这是一个专为开发企业级MCP server而设计的通用开发框架☆178Updated 6 months ago
- ☆176Updated 7 months ago
- In 2024, the strongest open-source implementation of asymmetric magvit_v2 supports inference code but excludes VQVAE. It supports the joi…☆151Updated last year
- VARGPT: Unified Understanding and Generation in a Visual Autoregressive Multimodal Large Language Model☆345Updated 7 months ago
- 使用推特API监控喊单KOL,并把内容推送到飞书☆110Updated last year
- This is the official code for the paper Tailor3D☆180Updated last year