theAdamColton / vq-clipLinks

Train vector quantized CLIP models using pytorch lightning

☆20

Alternatives and similar repositories for vq-clip

Users that are interested in vq-clip are comparing it to the libraries listed below

Sorting:

MarkXCloud / CSpD
The official repo of continuous speculative decoding
☆27Updated 4 months ago
hila-chefer / Conceptor
Official implementation of the paper The Hidden Language of Diffusion Models
☆74Updated last year
philippe-eecs / small-vision
A big_vision inspired repo that implements a generic Auto-Encoder class capable in representation learning and generative modeling.
☆34Updated last year
OliverRensu / MVAR
☆70Updated 8 months ago
pinterest / atg-research
☆62Updated this week
chenllliang / DnD-Transformer
[ICLR 2025] Source code for paper "A Spark of Vision-Language Intelligence: 2-Dimensional Autoregressive Transformer for Efficient Finegr…
☆76Updated 7 months ago
j-min / VPGen
Visual Programming for Text-to-Image Generation and Evaluation (NeurIPS 2023)
☆56Updated 2 years ago
mhh0318 / UniD3
☆54Updated 2 years ago
wmn-231314 / diffusion-data-constraint
☆41Updated 2 weeks ago
TIGER-AI-Lab / VIEScore
Visual Instruction-guided Explainable Metric. Code for "Towards Explainable Metrics for Conditional Image Synthesis Evaluation" (ACL 2024…
☆50Updated 8 months ago
Lucky-Lance / TerDiT
TerDiT: Ternary Diffusion Models with Transformers
☆71Updated last year
ruocwang / dpo-diffusion
[ICML 2024] On Discrete Prompt Optimization for Diffusion Models - Google
☆60Updated 11 months ago
kyegomez / MAGVIT2
Open source community's implementation of the model from "LANGUAGE MODEL BEATS DIFFUSION — TOKENIZER IS KEY TO VISUAL GENERATION"
☆15Updated 8 months ago
philippe-eecs / vitok
☆34Updated 2 months ago
aszala / VPEval
VPEval Codebase from Visual Programming for Text-to-Image Generation and Evaluation (NeurIPS 2023)
☆45Updated last year
huggingface / amused
☆86Updated last year
jialuli-luka / SELMA
Code and Data for Paper: SELMA: Learning and Merging Skill-Specific Text-to-Image Experts with Auto-Generated Data
☆34Updated last year
FutureXiang / edm2
Minimal multi-gpu implementation of EDM2: "Analyzing and Improving the Training Dynamics of Diffusion Models"
☆34Updated last year
nanlliu / Unsupervised-Compositional-Concepts-Discovery
[ICCV 2023] Unsupervised Compositional Concepts Discovery with Text-to-Image Generative Models
☆84Updated last year
measure-infinity / mulan-code
☆41Updated last year
luping-liu / LongAlign
The official PyTorch implementation for Improving Long-Text Alignment for Text-to-Image Diffusion Models (LongAlign)
☆76Updated 3 months ago
HelmholtzAI-FZJ / flex_gen
☆17Updated 6 months ago
g-luo / vlm_cross_modal_reps
Official PyTorch Implementation for Vision-Language Models Create Cross-Modal Task Representations, ICML 2025
☆29Updated 3 months ago
NVlabs / T-Stitch
[ICLR 2025] Official PyTorch implmentation of paper "T-Stitch: Accelerating Sampling in Pre-trained Diffusion Models with Trajectory Stit…
☆103Updated last year
DAMO-NLP-SG / DiGIT
[NeurIPS 2024] Stabilize the Latent Space for Image Autoregressive Modeling: A Unified Perspective
☆69Updated 9 months ago
drx-code / EquivariantModeling
Official PyTorch implementation of the paper "Equivariant Image Modeling"(https://arxiv.org/abs/2503.18948)
☆34Updated 3 months ago
TIGER-AI-Lab / Vamba
Code for the paper "Vamba: Understanding Hour-Long Videos with Hybrid Mamba-Transformers" [ICCV 2025]
☆78Updated last week
LINs-lab / GMem
[Preprint] GMem: A Modular Approach for Ultra-Efficient Generative Models
☆39Updated 4 months ago
eclipse-t2i / eclipse-inference
[CVPR 2024] Official PyTorch implementation of "ECLIPSE: Revisiting the Text-to-Image Prior for Efficient Image Generation"
☆64Updated last year
visual-gen / semanticist
(ICCV 2025) "Principal Components" Enable A New Language of Images
☆54Updated last week