jsonBackup / Latent-CLIP-DemoLinks

☆14

Alternatives and similar repositories for Latent-CLIP-Demo

Users that are interested in Latent-CLIP-Demo are comparing it to the libraries listed below

Sorting:

threedle / hyperfields
☆21Updated 7 months ago
top-yun / SPARK
A benchmark dataset and simple code examples for measuring the perception and reasoning of multi-sensor Vision Language models.
☆18Updated 6 months ago
AniAggarwal / ecad
Code for Evolutionary Caching to Accelerate Your Off-the-Shelf Diffusion Model
☆25Updated 3 weeks ago
CompVis / DisCLIP
[AAAI 2025] Does VLM Classification Benefit from LLM Description Semantics?
☆17Updated 6 months ago
SerChirag / rs-imle
RS-IMLE
☆41Updated 7 months ago
kyegomez / KosmosG
My implementation of the model KosmosG from "KOSMOS-G: Generating Images in Context with Multimodal Large Language Models"
☆14Updated 8 months ago
zhangjiewu / awesome-t2i-eval
A curated list of papers and resources for text-to-image evaluation.
☆29Updated last year
kyegomez / MAGVIT2
Open source community's implementation of the model from "LANGUAGE MODEL BEATS DIFFUSION — TOKENIZER IS KEY TO VISUAL GENERATION"
☆15Updated 8 months ago
elad-amrani / xtra
PyTorch implementation of "Sample- and Parameter-Efficient Auto-Regressive Image Models" from CVPR 2025
☆12Updated 4 months ago
NVlabs / GSPN
[CVPR 2025] Parallel Sequence Modeling via Generalized Spatial Propagation Network
☆100Updated 3 weeks ago
WalBouss / MaskInversion
☆26Updated 9 months ago
jialuli-luka / SELMA
Code and Data for Paper: SELMA: Learning and Merging Skill-Specific Text-to-Image Experts with Auto-Generated Data
☆34Updated last year
xie-lab-ml / IV-mixed-Sampler
[ICLR2025] IV-Mixed Sampler: Leveraging Image Diffusion Models for Enhanced Video Synthesis
☆33Updated 5 months ago
mrazhou / SeTa
[CVPR2025] Official code repository for SeTa: "Scale Efficient Training for Large Datasets"
☆18Updated 4 months ago
oooolga / Ctrl-V
👆Pytorch implementation of "Ctrl-V: Higher Fidelity Video Generation with Bounding-Box Controlled Object Motion"
☆27Updated 8 months ago
viiika / HumanEdit
[CVPR 2025 AI4CC Workshop] Official Implementation of HumanEdit: A High-Quality Human-Rewarded Dataset for Instruction-based Image Editin…
☆30Updated 2 months ago
BeyondScene / BeyondScene
[ECCV 2024] BeyondScene: Higher-Resolution Human-Centric Scene Generation With Pretrained Diffusion
☆21Updated last year
MarkXCloud / CSpD
The official repo of continuous speculative decoding
☆27Updated 3 months ago
kaist-cvml / scribble-guided-diffusion
[ICIP 2025] Scribble-Guided Diffusion for Training-free Text-to-Image Generation
☆22Updated 9 months ago
Corleone-Huang / DynamicVectorQuantization
☆21Updated 2 years ago
lzzcd001 / nabla-gfn
Official Implementation of Nabla-GFlowNet (ICLR 2025)
☆24Updated 2 months ago
JohannesTheo / trapped-in-texture-bias
Official code release for the paper Trapped in texture bias? A large scale comparison of deep instance segmentation, accepted at ECCV 202…
☆16Updated last year
marco-garosi / COPS
Official implementation of the WACV 2025 paper "3D Part Segmentation via Geometric Aggregation of 2D Visual Features"
☆19Updated last month
HanSolo9682 / CounterCurate
This is the implementation of CounterCurate, the data curation pipeline of both physical and semantic counterfactual image-caption pairs.
☆18Updated last year
HITsz-TMG / Agentic-CIGEval
Code of our paper "A Unified Agentic Framework for Evaluating Conditional Image Generation".
☆25Updated 3 months ago
VISION-SJTU / VidToMe
[CVPR 2024] VidToMe: Video Token Merging for Zero-Shot Video Editing
☆19Updated last year
helia95 / VASE
☆15Updated last year
AIGCResearch / styleme3d
Official repo for StyleMe3D
☆23Updated 2 months ago
OliverRensu / MVAR
☆70Updated 8 months ago
rhfeiyang / art-free-diffusion
Official implementation of "Art-Free Generative Models: Art Creation Without Graphic Art Knowledge"
☆31Updated 3 months ago