☆49Feb 9, 2026Updated 3 weeks ago
Alternatives and similar repositories for VIBE
Users that are interested in VIBE are comparing it to the libraries listed below
Sorting:
- [NeurIPS 2025] Official code for ORIGEN: Zero-Shot 3D Orientation Grounding in Text-to-Image Generation☆33Oct 17, 2025Updated 4 months ago
- ☆16Sep 1, 2025Updated 6 months ago
- SimCMF: A Simple Cross-modal Fine-tuning Strategy from Vision Foundation Models to Any Imaging Modality☆35Nov 25, 2024Updated last year
- [IEEE/CVF CVPR'2022] "ST-MFNet: A Spatio-Temporal Multi-Flow Network for Frame Interpolation", Duolikun Danier, Fan Zhang, David Bull☆13Oct 9, 2023Updated 2 years ago
- [ICCV'25] ScenePainter: Semantically Consistent Perpetual 3D Scene Generation with Concept Relation Alignment☆36Oct 5, 2025Updated 5 months ago
- Tiny AutoEncoder for Stable Diffusion Videos☆36Oct 5, 2024Updated last year
- ☆18Oct 21, 2024Updated last year
- Program that enables seamless interaction with your documents through an advanced vector database and the power of Large Language Model (…☆18Sep 12, 2023Updated 2 years ago
- Dungeon procedural generator similar to whatabou's "One Page Dungeon"☆50Jan 4, 2026Updated 2 months ago
- [ACM MM24 Poster] Official implementation of paper "MVPbev: Multi-view Perspective Image Generation from BEV with Test-time Controllabili…☆20Sep 6, 2025Updated 5 months ago
- VFXMaster: Unlocking Dynamic Visual Effect Generation via In-Context Learning☆59Nov 4, 2025Updated 4 months ago
- ControlNet control image preprocess library☆15Feb 27, 2023Updated 3 years ago
- ComfyUI-HiggsAudio is now available in ComfyUI, Higgs Audio v2 is a text-audio foundation model from Boson AI.☆22Jul 26, 2025Updated 7 months ago
- ☆20Jun 26, 2024Updated last year
- ☆11Sep 12, 2025Updated 5 months ago
- SyncNoise: Geometrically Consistent Noise Prediction for Text-based 3D Scene Editing☆19Dec 28, 2024Updated last year
- Official code for paper: F3D-Gaus: Feed-forward 3D-aware Generation on ImageNet with Cycle-Aggregative Gaussian Splatting☆50Mar 11, 2025Updated 11 months ago
- finetune your florence2 model easy☆21Jul 27, 2024Updated last year
- [SIGGRAPHASIA2025] InfiniHuman: Infinite 3D Human Creation with Precise Control☆84Oct 14, 2025Updated 4 months ago
- Bringing large-language models and chat to web browsers. Everything runs inside the browser with no server support.☆16Feb 4, 2024Updated 2 years ago
- The official code for paper "GPSToken: Gaussian Parameterized Spatially-adaptive Tokenization for Image Representation and Generation"☆49Sep 28, 2025Updated 5 months ago
- Frame Interpolation Refined with Stable Diffusion via Control Net☆24Jul 5, 2023Updated 2 years ago
- [CVPR2025] Official code repository for SeTa: "Scale Efficient Training for Large Datasets"☆23Mar 18, 2025Updated 11 months ago
- ☆19Jul 11, 2024Updated last year
- Simple, Efficient, and Effective Negative Guidance in Few-Step Image Generation Models By Value Sign Flip☆37Jan 27, 2026Updated last month
- Official code for our Paper "SSL: A Self-similarity Loss for Improving Generative Image Super-resolution" in ACMMM 2024☆50Jun 1, 2025Updated 9 months ago
- ☆25Mar 30, 2025Updated 11 months ago
- Toward Generalizing Visual Brain Decoding to Unseen Subjects☆28May 14, 2025Updated 9 months ago
- [ICLR 2025] Trajectory Attention For Fine-grained Video Motion Control☆97May 13, 2025Updated 9 months ago
- [Arxiv 2025] ByteMorph: Benchmarking Instruction-Guided Image Editing with Non-Rigid Motions☆45Jun 11, 2025Updated 8 months ago
- Music production for silent film clips.☆32Apr 30, 2025Updated 10 months ago
- [ICCV 2025] Deeply Supervised Flow-Based Generative Models☆28Jun 26, 2025Updated 8 months ago
- [SIGGRAPH-ASIA 2025] Official implementation of "VideoFrom3D: 3D Scene Video Generation via Complementary Image and Video Diffusion Model…☆126Oct 27, 2025Updated 4 months ago
- ☆33Aug 9, 2024Updated last year
- Text-Guided Generation of Full-Body Image with Preserved Reference Face for Customized Animation☆24Jun 24, 2024Updated last year
- Official repo for the TMLR paper "Discffusion: Discriminative Diffusion Models as Few-shot Vision and Language Learners"☆30Apr 27, 2024Updated last year
- A modular graph based DataSet implementation for Pytorch☆37Updated this week
- Official repo for paper "EMMA: Efficient Multimodal Understanding, Generation, and Editing with a Unified Architecture."☆62Dec 16, 2025Updated 2 months ago
- Code for ACL 2024 main conference paper "Can We Achieve High-quality Direct Speech-to-Speech Translation Without Parallel Speech Data?".☆25Jul 2, 2024Updated last year