franciszzj / SaberView external linksLinks
Scaling Zero-Shot Reference-to-Video Generation
☆62Dec 11, 2025Updated 2 months ago
Alternatives and similar repositories for Saber
Users that are interested in Saber are comparing it to the libraries listed below
Sorting:
- D2E: Scaling Vision-Action Pretraining on Desktop Data for Transfer to Embodied AI☆68Jan 15, 2026Updated last month
- DreamStyle: A Unified Framework for Video Stylization☆110Jan 7, 2026Updated last month
- [AAAI 2026] UltraGen☆78Feb 1, 2026Updated 2 weeks ago
- 👋 Dataset and Benchmark code for EgoEdit☆106Dec 11, 2025Updated 2 months ago
- OmniInsert: Mask-Free Video Insertion of Any Reference via Diffusion Transformer Models☆153Sep 24, 2025Updated 4 months ago
- Reflection Removal through Efficient Adaptation of Diffusion Transformers☆117Dec 5, 2025Updated 2 months ago
- ☆86Feb 4, 2026Updated last week
- Official implementation for "Story2Board: A Training‑Free Approach for Expressive Storyboard Generation"☆229Aug 22, 2025Updated 5 months ago
- This repository contains the code for the paper “Neuro-Symbolic Query Compiler”, accepted to the Findings of ACL 2025.☆16Oct 20, 2025Updated 3 months ago
- [ICCV 2025] Official implementation of the paper "DreamCube: 3D Panorama Generation via Multi-plane Synchronization".☆168Feb 4, 2026Updated last week
- ☆17Jan 17, 2025Updated last year
- Cost-Sensitive Toolpath Agent for Multi-turn Image Editing☆25Mar 26, 2025Updated 10 months ago
- Long-horizon, spatially consistent video generation enabled by persistent 3D scene point clouds and dynamic-static disentanglement.☆167Dec 18, 2025Updated last month
- Animate Any Character in Any World☆88Jan 9, 2026Updated last month
- ☆11Sep 12, 2025Updated 5 months ago
- [ICLR 2026] Fast-Slow Toolpath Agent with Subroutine Mining for Efficient Multi-turn Image Editing☆29Feb 6, 2026Updated last week
- A Unified Visual Generator with Interleaved OmniModal Context☆180Updated this week
- [ICLR 2026] Light-X: Generative 4D Video Rendering with Camera and Illumination Control☆166Dec 11, 2025Updated 2 months ago
- ☆23Mar 25, 2024Updated last year
- This project is the official implementation of 'DreamOmni3: Scribble-based Editing and Generation''☆37Dec 30, 2025Updated last month
- ☆92Dec 28, 2025Updated last month
- ☆19May 17, 2025Updated 8 months ago
- Custom node for ComfyUI. Add useful nodes related to prompt.☆25Mar 30, 2025Updated 10 months ago
- Official PyTorch implementation of "ViBiDSampler: Enhancing Video Interpolation Using Bidirectional Diffusion Sampler"☆21Jan 25, 2025Updated last year
- https://little-misfit.github.io/GRAG-Image-Editing/☆116Nov 27, 2025Updated 2 months ago
- CARI4D: Category Agnostic 4D Reconstruction of Human-Object Interaction☆103Dec 24, 2025Updated last month
- Frame Guidance: Training-Free Guidance for Frame-Level Control in Video Diffusion Model (ICLR 2026)☆41Jul 10, 2025Updated 7 months ago
- End2End Virtual Try-on with Visual Reference☆57Nov 19, 2025Updated 2 months ago
- Official repository for CVPR 2025 paper PERSE: Personalized 3D Generative Avatars from A Single Portrait☆130Jul 28, 2025Updated 6 months ago
- OmniTransfer: All-in-one Framework for Spatio-temporal Video Transfer☆215Jan 26, 2026Updated 2 weeks ago
- ☆131Dec 24, 2025Updated last month
- One-shot and Few-shot 3D Editing without Per-Scene Optimization☆167Aug 21, 2025Updated 5 months ago
- A ComfyUI node for Maya1, a 3B-parameter speech model built for expressive voice generation with rich human emotion and precise voice des…☆57Nov 11, 2025Updated 3 months ago
- Chirpy3D: Continuous Part Latents for Creative 3D Bird Generation☆28Apr 11, 2025Updated 10 months ago
- [NeurIPS 2025] Wan-Move: Motion-controllable Video Generation via Latent Trajectory Guidance☆563Jan 5, 2026Updated last month
- Official code for StoryMem: Multi-shot Long Video Storytelling with Memory☆644Jan 22, 2026Updated 3 weeks ago
- Official repo for paper "EMMA: Efficient Multimodal Understanding, Generation, and Editing with a Unified Architecture."☆61Dec 16, 2025Updated last month
- Code release for AccDiffusionV2 (TPAMI)☆35Nov 4, 2025Updated 3 months ago
- The first spoken long-text dataset derived from live streams, designed to reflect the redundancy-rich and conversational nature of real-w…☆13Jun 28, 2025Updated 7 months ago