[NeurIPS 2024] Image Understanding Makes for A Good Tokenizer for Image Generation
☆22Dec 17, 2024Updated last year
Alternatives and similar repositories for vector_quantization
Users that are interested in vector_quantization are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Rate-Adaptive Quantization: A Multi-Rate Codebook Adaptation for Vector Quantization-based Generative Models☆15Sep 10, 2025Updated 7 months ago
- The extented code of layered conceptual image compression. Journal submitted.☆15Aug 29, 2022Updated 3 years ago
- ☆19Dec 20, 2025Updated 3 months ago
- ☆25Jan 9, 2026Updated 3 months ago
- Image Tokenizer Needs Post-Training☆24Oct 4, 2025Updated 6 months ago
- Open source password manager - Proton Pass • AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- Extreme Image Compression using Fine-tuned VQGAN Models (DCC 2024)☆24Jan 14, 2025Updated last year
- ☆25Jun 5, 2025Updated 10 months ago
- ☆18May 14, 2025Updated 10 months ago
- ☆13Aug 7, 2025Updated 8 months ago
- Measuring the Signal to Noise Ratio in Language Model Evaluation☆29Aug 19, 2025Updated 7 months ago
- Official repository of the "Transformer Fusion with Optimal Transport" paper, published as a conference paper at ICLR 2024.☆31Apr 19, 2024Updated last year
- [ICML 2025 Tokshop] One-D-Piece: Image Tokenizer Meets Quality-Controllable Compression☆78Jul 30, 2025Updated 8 months ago
- 👄🇧🇷 Alinhamento fonético forçado em Português Brasileiro☆13Jul 18, 2025Updated 8 months ago
- PyTorch Implementation of [AudioLCM]: a efficient and high-quality text-to-audio generation with latent consistency model.☆13Jun 15, 2024Updated last year
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- ☆13Feb 2, 2023Updated 3 years ago
- Linear Attention for Efficient Bidirectional Sequence Modeling☆16May 13, 2025Updated 10 months ago
- [CVPR'25] MergeVQ: A Unified Framework for Visual Generation and Representation with Token Merging and Quantization☆47Jul 22, 2025Updated 8 months ago
- Official implementation of "What does CLIP know about a red circle? Visual Prompt Engineering for VLMs", ICCV 2023☆11Sep 21, 2023Updated 2 years ago
- This repository represents a basic implementation of the paper "Riemannian Geometry of Deep Generative Models", along with the results on…☆12Oct 23, 2019Updated 6 years ago
- This program converts .fits file to .jpg. Fits to jpeg.☆13Jun 4, 2018Updated 7 years ago
- ☆13Apr 23, 2025Updated 11 months ago
- ☆29Feb 15, 2026Updated last month
- Implementation of the paper: "Audio Mamba: Bidirectional State Space Model for Audio Representation Learning" in pytorch☆14Updated this week
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- We introduce new approach, Token Reduction using CLIP Metric (TRIM), aimed at improving the efficiency of MLLMs without sacrificing their…☆22Jan 11, 2026Updated 2 months ago
- Code repository for the paper "MrT5: Dynamic Token Merging for Efficient Byte-level Language Models."☆56Sep 25, 2025Updated 6 months ago
- PathPiece tokenizer☆14Nov 10, 2024Updated last year
- ☆20Aug 14, 2025Updated 7 months ago
- ☆13Dec 12, 2023Updated 2 years ago
- Implementation of "Learning Deep Generative Models"☆12Jun 4, 2019Updated 6 years ago
- ArXiv 每日论文推送助手 自动抓取 ArXiv 最新 AI 论文,使用 DeepSeek 进行深度分析,并推送到飞书。☆46Feb 5, 2026Updated 2 months ago
- Official Implementation for the ICLR2023 paper "Fuzzy Alignments in Directed Acyclic Graph for Non-autoregressive Machine Translation"☆14Mar 1, 2023Updated 3 years ago
- ☆13Dec 17, 2024Updated last year
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Neural image compression models optimized for Mask R-CNN from paper "Boosting Neural Image Compression for Machines Using Latent Space Ma…☆10Aug 16, 2022Updated 3 years ago
- This is the code repo of our Pattern Recognition journal on IPR protection of Image Captioning Models☆11Aug 29, 2023Updated 2 years ago
- ☆20Dec 15, 2025Updated 3 months ago
- [KDD 2026 ADS Track] Pytorch implementation of the paper "Hi-Guard: Towards Trustworthy Multimodal Moderation via Policy-Aligned Reasonin…☆21Jan 13, 2026Updated 2 months ago
- Latent Diffusion Model-Enabled Low-Latency Semantic Communication in the Presence of Semantic Ambiguities and Wireless Channel Noises☆18Nov 19, 2024Updated last year
- official implementation of "CLIP-VQDiffusion : Langauge Free Training of Text To Image generation using CLIP and vector quantized diffusi…☆19Sep 5, 2024Updated last year
- Design a patches masked autoencoder by CNN☆19Jun 6, 2024Updated last year