official implementation of "CLIP-VQDiffusion : Langauge Free Training of Text To Image generation using CLIP and vector quantized diffusion model"
☆19Sep 5, 2024Updated last year
Alternatives and similar repositories for CLIPVQDiffusion
Users that are interested in CLIPVQDiffusion are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆17May 13, 2025Updated 11 months ago
- ☆12Jan 30, 2024Updated 2 years ago
- pytorch implementation of "Efficiently Reconstructing Dynamic Scenes One 🎯 D4RT at a Time"☆54Jan 27, 2026Updated 3 months ago
- [ICME 2025] DiffusionTalker: Efficient and Compact Speech-Driven 3D Talking Head via Personalizer-Guided Distillation☆24Mar 25, 2025Updated last year
- [ICML 2024] Official Repository for the paper "Transformers Get Stable: An End-to-End Signal Propagation Theory for Language Models"☆10Jul 19, 2024Updated last year
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- ☆13Jul 10, 2024Updated last year
- ☆22Jul 3, 2025Updated 9 months ago
- Using image captions with LLM for zero-shot VQA☆19Mar 14, 2024Updated 2 years ago
- ☆12Jul 19, 2022Updated 3 years ago
- RosenPy is a complex-valued neural network library, written in Python; Incorporates CVNNs such as CV-FFNN (complex-valued feedforward neu…☆14Sep 17, 2024Updated last year
- [TPAMI 2026] Enhancing MMDiT-Based Text-to-Image Models for Similar Subject Generation☆13Mar 7, 2026Updated last month
- [ICCV 2025] Diffusion Curriculum (DisCL)☆18Sep 26, 2025Updated 7 months ago
- The official implementation of Diffusion Distillation With Direct Preference Optimization For Efficient 3D LiDAR Scene Completion [AAAI'2…☆16Feb 2, 2026Updated 2 months ago
- Code for implemeting a conditional DDPM trained on CIFAR10☆14Jan 15, 2024Updated 2 years ago
- End-to-end encrypted cloud storage - Proton Drive • AdSpecial offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
- [ICLR 2025] Adaptive prompt tailored pruning of T2I diffusion models.☆15Feb 1, 2025Updated last year
- [TCSVT'22]Joint Graph Attention and Asymmetric Convolutional Neural Network for Deep Image Compression☆13Nov 14, 2022Updated 3 years ago
- B.Tech Project On demodulation technique of OTFS(Orthogonal Time Frequency Space) at imperfect Channel State Information and lower SNR(dB…☆12May 12, 2024Updated last year
- AV-Link: Temporally-Aligned Diffusion Features for Cross-Modal Audio-Video Generation☆17Aug 3, 2025Updated 8 months ago
- [ICML 2025 Tokshop] One-D-Piece: Image Tokenizer Meets Quality-Controllable Compression☆80Jul 30, 2025Updated 9 months ago
- UNCAGE: Contrastive Attention Guidance for Masked Generative Transformers in Text-to-Image Generation☆18Aug 12, 2025Updated 8 months ago
- ☆20Mar 14, 2022Updated 4 years ago
- FFNet: MetaMixer-based Efficient Convolutional Mixer Design☆34Mar 11, 2025Updated last year
- Rate-Adaptive Quantization: A Multi-Rate Codebook Adaptation for Vector Quantization-based Generative Models☆16Sep 10, 2025Updated 7 months ago
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- JAX implementation of VQVAE/VQGAN autoencoders (+FSQ)☆41Jun 6, 2024Updated last year
- [TPAMI'2023]Knowledge-enriched Attention Network with Group-wise Semantic for Visual Storytelling☆11Jan 3, 2023Updated 3 years ago
- Official implementation of ECCV24 paper: POA☆24Aug 8, 2024Updated last year
- Latent Diffusion Model-Enabled Low-Latency Semantic Communication in the Presence of Semantic Ambiguities and Wireless Channel Noises☆18Nov 19, 2024Updated last year
- Design a patches masked autoencoder by CNN☆19Jun 6, 2024Updated last year
- ☆45Oct 29, 2025Updated 6 months ago
- 切割手寫 png 打包 ttf☆14Aug 4, 2024Updated last year
- Implementation of O-OFDMNet, a deep learning-based optical OFDM system☆12Dec 29, 2021Updated 4 years ago
- Digital-twin-enabled 6G: Depth Map Estimation in mmWave systems☆14May 24, 2023Updated 2 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- TPDiff: Temporal Pyramid Video Diffusion Model☆25Mar 13, 2025Updated last year
- The extented code of layered conceptual image compression. Journal submitted.☆15Aug 29, 2022Updated 3 years ago
- ☆83Oct 18, 2025Updated 6 months ago
- TensorFlow implementation of "TokenLearner: What Can 8 Learned Tokens Do for Images and Videos?"☆38Dec 17, 2021Updated 4 years ago
- Train vector quantized CLIP models using pytorch lightning☆20Jul 14, 2024Updated last year
- \infty-Video: A Training-Free Approach to Long Video Understanding via Continuous-Time Memory Consolidation☆21Feb 14, 2025Updated last year
- A Semantic Communication System Based on Robust Knowledge Distillation☆20Apr 14, 2025Updated last year