snap-research/VIMI

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/snap-research/VIMI)

snap-research / VIMI

☆13

Alternatives and similar repositories for VIMI

Users that are interested in VIMI are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

google-deepmind / wyd-benchmark
View on GitHub
☆28Mar 3, 2025Updated last year
LingjieKong-fdu / CustAny
View on GitHub
Official code for CustAny: Customizing Anything from A Single Example. Accepted by CVPR2025 (Oral)
☆47Apr 10, 2025Updated last year
zwl666666 / infusion
View on GitHub
Infusion: Preventing Customized Text-to-Image Diffusion from Overfitting
☆14Dec 19, 2025Updated 7 months ago
akhilkedia / TranformersGetStable
View on GitHub
[ICML 2024] Official Repository for the paper "Transformers Get Stable: An End-to-End Signal Propagation Theory for Language Models"
☆11Jul 19, 2024Updated 2 years ago
lambert-x / VideoAuteur
View on GitHub
VideoAuteur: Towards Long Narrative Video Generation
☆44Oct 22, 2025Updated 9 months ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
ziqipang / ADDP
View on GitHub
[ICLR 2025] Aligning Generative Denoising with Discriminative Objectives Unleashes Diffusion for Visual Perception
☆15Jul 4, 2025Updated last year
zipengxuc / PPE
View on GitHub
Code for CVPR'2022 paper ✨ "Predict, Prevent, and Evaluate: Disentangled Text-Driven Image Manipulation Empowered by Pre-Trained Vision-L…
☆37Apr 13, 2022Updated 4 years ago
yigu1008 / Diffusion-RPO
View on GitHub
☆15Mar 30, 2025Updated last year
ljzycmd / SCD
View on GitHub
Consistent Human Image and Video Generation with Spatially Conditioned Diffusion
☆16Sep 1, 2025Updated 10 months ago
wuxiaofei01 / PFVG
View on GitHub
☆20Dec 24, 2025Updated 6 months ago
wtybest / EnMMDiT
View on GitHub
[TPAMI 2026] Enhancing MMDiT-Based Text-to-Image Models for Similar Subject Generation
☆15Mar 7, 2026Updated 4 months ago
abdo-eldesokey / latentman
View on GitHub
This is the official repository for "LatentMan: Generating Consistent Animated Characters using Image Diffusion Models" [CVPRW 2024]
☆22Jul 21, 2024Updated 2 years ago
TerminologyHub / termhub-in-5-minutes
View on GitHub
Developer project for getting basic API integrations working in under 5 minutes
☆11May 22, 2026Updated 2 months ago
SobeyMIL / TVG
View on GitHub
code for "TVG: A Training-free Transition Video Generation Method with Diffusion Models"
☆50Aug 19, 2024Updated last year
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
rezashkv / diffusion_pruning
View on GitHub
[ICLR 2025] Adaptive prompt tailored pruning of T2I diffusion models.
☆15Feb 1, 2025Updated last year
Adamdad / vico
View on GitHub
Vico: Compositional Video Generation as Flow Equalization
☆59Nov 15, 2024Updated last year
videodreamer23 / videodreamer23.github.io
View on GitHub
☆31Nov 7, 2023Updated 2 years ago
snap-research / AVLink
View on GitHub
AV-Link: Temporally-Aligned Diffusion Features for Cross-Modal Audio-Video Generation
☆17Aug 3, 2025Updated 11 months ago
sjz5202 / LLaVA-Reward
View on GitHub
Official repository for LLaVA-Reward (ICCV 2025): Multimodal LLMs as Customized Reward Models for Text-to-Image Generation
☆26Jul 30, 2025Updated 11 months ago
zipengxuc / PPE-Pytorch
View on GitHub
Pytorch Implementation for CVPR'2022 paper ✨ "Predict, Prevent, and Evaluate: Disentangled Text-Driven Image Manipulation Empowered by Pr…
☆28Jul 31, 2022Updated 3 years ago
Kebii / Freehand-Genshin-Diffusion
View on GitHub
Transferring Genshin PVs into a freehand style with Diffusion Model.
☆10Jun 5, 2024Updated 2 years ago
KwonGihyun / TweedieMix
View on GitHub
Official source codes of "TweedieMix: Improving Multi-Concept Fusion for Diffusion-based Image/Video Generation" (ICLR 2025)
☆62Jan 22, 2025Updated last year
snakeye / FPC1020-Arduino
View on GitHub
Testing FPC1020 fingerprint sensors with Arduino
☆10Mar 25, 2020Updated 6 years ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
INFINIQ-AI1 / CLIPVQDiffusion
View on GitHub
official implementation of "CLIP-VQDiffusion : Langauge Free Training of Text To Image generation using CLIP and vector quantized diffusi…
☆19Sep 5, 2024Updated last year
SHI-Labs / T2I-Copilot
View on GitHub
T2I-Copilot: A Training-Free Multi-Agent Text-to-Image System for Enhanced Prompt Interpretation and Interactive Generation (ICCV'25)
☆56Oct 6, 2025Updated 9 months ago
shengliu66 / FractionalReason
View on GitHub
Official github repo for "Fractional Reasoning via Latent Steering Vectors Improves Inference Time Compute"
☆17Jun 30, 2025Updated last year
Qichuzyy / POA
View on GitHub
Official implementation of ECCV24 paper: POA
☆24Aug 8, 2024Updated last year
kyegomez / AudioMamba
View on GitHub
Implementation of the paper: "Audio Mamba: Bidirectional State Space Model for Audio Representation Learning" in pytorch
☆15Updated this week
cfeng16 / GPS2Pix
View on GitHub
[CVPR 2025] GPS as a Control Signal for Image Generation
☆25Mar 18, 2025Updated last year
KaiyueSun98 / T2I-Personalization-with-AR
View on GitHub
☆47Apr 20, 2025Updated last year
Yikai-Wang / SeMani
View on GitHub
Official code for SeMani (CVPR 2020 oral and Journal extension)
☆25Dec 4, 2023Updated 2 years ago
chainstacklabs / chainstack-dlp-browser-extension
View on GitHub
Chrome extension that redacts potentially sensitive information before querying ChatGPT
☆13Aug 10, 2023Updated 2 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
UCSB-AI / via-video
View on GitHub
☆25May 12, 2026Updated 2 months ago
chenllliang / DreamEngine
View on GitHub
Multimodal Representation Alignment for Image Generation: Text-Image Interleaved Control Is Easier Than You Think!
☆123Mar 4, 2025Updated last year
anvilco / grpc-lb-example
View on GitHub
An example of load-balancing a gRPC service through Docker
☆10Jun 11, 2026Updated last month
AlonzoLeeeooo / LCDG
View on GitHub
The official code implementation of "LaCon: Late-Constraint Diffusion for Steerable Guided Image Synthesis".
☆37Dec 11, 2025Updated 7 months ago
GongyeLiu / Awesome-Alignment-of-Diffusion-Models
View on GitHub
paper collection: alignment of diffusion models
☆29Mar 6, 2026Updated 4 months ago
llyx97 / FETV
View on GitHub
[NeurIPS 2023 Datasets and Benchmarks] "FETV: A Benchmark for Fine-Grained Evaluation of Open-Domain Text-to-Video Generation", Yuanxin L…
☆56Mar 4, 2024Updated 2 years ago
TencentARC / Video-Holmes
View on GitHub
[ECCV 2026] Video-Holmes: Can MLLM Think Like Holmes for Complex Video Reasoning?
☆95Jul 13, 2025Updated last year