official implementation of "CLIP-VQDiffusion : Langauge Free Training of Text To Image generation using CLIP and vector quantized diffusion model"
☆19Sep 5, 2024Updated last year
Alternatives and similar repositories for CLIPVQDiffusion
Users that are interested in CLIPVQDiffusion are comparing it to the libraries listed below
Sorting:
- ☆17May 13, 2025Updated 10 months ago
- [ICME 2025] DiffusionTalker: Efficient and Compact Speech-Driven 3D Talking Head via Personalizer-Guided Distillation☆24Mar 25, 2025Updated 11 months ago
- finetune your florence2 model easy☆21Jul 27, 2024Updated last year
- [ICML 2024] Official Repository for the paper "Transformers Get Stable: An End-to-End Signal Propagation Theory for Language Models"☆10Jul 19, 2024Updated last year
- Using image captions with LLM for zero-shot VQA☆18Mar 14, 2024Updated 2 years ago
- Implementation of Self-supervised-Online-Adversarial-Purification☆13Aug 2, 2021Updated 4 years ago
- ☆13Jul 10, 2024Updated last year
- RosenPy is a complex-valued neural network library, written in Python; Incorporates CVNNs such as CV-FFNN (complex-valued feedforward neu…☆14Sep 17, 2024Updated last year
- [TPAMI 2026] Enhancing MMDiT-Based Text-to-Image Models for Similar Subject Generation☆11Mar 7, 2026Updated 2 weeks ago
- [ICCV 2025] Diffusion Curriculum (DisCL)☆18Sep 26, 2025Updated 5 months ago
- The official implementation of Diffusion Distillation With Direct Preference Optimization For Efficient 3D LiDAR Scene Completion [AAAI'2…☆16Feb 2, 2026Updated last month
- Code for implemeting a conditional DDPM trained on CIFAR10☆13Jan 15, 2024Updated 2 years ago
- [ICLR 2025] Adaptive prompt tailored pruning of T2I diffusion models.☆15Feb 1, 2025Updated last year
- B.Tech Project On demodulation technique of OTFS(Orthogonal Time Frequency Space) at imperfect Channel State Information and lower SNR(dB…☆11May 12, 2024Updated last year
- AV-Link: Temporally-Aligned Diffusion Features for Cross-Modal Audio-Video Generation☆16Aug 3, 2025Updated 7 months ago
- ☆41Oct 29, 2025Updated 4 months ago
- [CVPR 2025] TexTalker: Towards High-fidelity 3D Talking Avatar with Personalized Dynamic Texture☆33Jan 12, 2026Updated 2 months ago
- Rate-Adaptive Quantization: A Multi-Rate Codebook Adaptation for Vector Quantization-based Generative Models☆15Sep 10, 2025Updated 6 months ago
- [ICML 2025 Tokshop] One-D-Piece: Image Tokenizer Meets Quality-Controllable Compression☆77Jul 30, 2025Updated 7 months ago
- FFNet: MetaMixer-based Efficient Convolutional Mixer Design☆32Mar 11, 2025Updated last year
- DSSLIC: Deep Semantic Segmentation-based Layered Image Compression☆10Dec 10, 2018Updated 7 years ago
- Official implementation of ECCV24 paper: POA☆24Aug 8, 2024Updated last year
- Latent Diffusion Model-Enabled Low-Latency Semantic Communication in the Presence of Semantic Ambiguities and Wireless Channel Noises☆18Nov 19, 2024Updated last year
- Design a patches masked autoencoder by CNN☆18Jun 6, 2024Updated last year
- Implementation of FA-VAE: Frequancy Augmented VAE☆13May 5, 2023Updated 2 years ago
- Code for reproducing our paper "Low Rank Adapting Models for Sparse Autoencoder Features"☆17Mar 31, 2025Updated 11 months ago
- Implementation of O-OFDMNet, a deep learning-based optical OFDM system☆12Dec 29, 2021Updated 4 years ago
- Digital-twin-enabled 6G: Depth Map Estimation in mmWave systems☆14May 24, 2023Updated 2 years ago
- TPDiff: Temporal Pyramid Video Diffusion Model☆25Mar 13, 2025Updated last year
- [NeurIPS 2024] Image Understanding Makes for A Good Tokenizer for Image Generation☆22Dec 17, 2024Updated last year
- The extented code of layered conceptual image compression. Journal submitted.☆15Aug 29, 2022Updated 3 years ago
- ☆81Oct 18, 2025Updated 5 months ago
- Train vector quantized CLIP models using pytorch lightning☆20Jul 14, 2024Updated last year
- \infty-Video: A Training-Free Approach to Long Video Understanding via Continuous-Time Memory Consolidation☆19Feb 14, 2025Updated last year
- A Semantic Communication System Based on Robust Knowledge Distillation☆19Apr 14, 2025Updated 11 months ago
- This repository contains the code for the paper - "Aligning Text, Images, and 3D Structure Token-by-Token" (CVPR 2026)☆44Jun 11, 2025Updated 9 months ago
- ☆25Updated this week
- [CVPR 2025] PVC: Progressive Visual Token Compression for Unified Image and Video Processing in Large Vision-Language Models☆51Jun 12, 2025Updated 9 months ago
- [CVPR'25] MergeVQ: A Unified Framework for Visual Generation and Representation with Token Merging and Quantization☆47Jul 22, 2025Updated 7 months ago