[NeurIPS 2024] Image Understanding Makes for A Good Tokenizer for Image Generation
☆22Dec 17, 2024Updated last year
Alternatives and similar repositories for vector_quantization
Users that are interested in vector_quantization are comparing it to the libraries listed below
Sorting:
- Rate-Adaptive Quantization: A Multi-Rate Codebook Adaptation for Vector Quantization-based Generative Models☆15Sep 10, 2025Updated 6 months ago
- ☆19Dec 20, 2025Updated 3 months ago
- Code repository for study ''Evaluating the representational power of pre-trained DNA language models for regulatory genomics"☆24Jun 26, 2024Updated last year
- ☆25Jun 5, 2025Updated 9 months ago
- ☆18May 14, 2025Updated 10 months ago
- This is the official implementation for the paper "Pianist Transformer: Towards Expressive Piano Performance Rendering via Scalable Self-…☆29Feb 8, 2026Updated last month
- Jax implementation of VIT-VQGAN☆10Jan 25, 2024Updated 2 years ago
- ☆13Aug 7, 2025Updated 7 months ago
- [ICLR 26] The official code repository for the paper "Mirage or Method? How Model–Task Alignment Induces Divergent RL Conclusions".☆17Feb 9, 2026Updated last month
- Perform the forced decoding with target transcription☆11Sep 12, 2018Updated 7 years ago
- Measuring the Signal to Noise Ratio in Language Model Evaluation☆29Aug 19, 2025Updated 7 months ago
- [ICML 2025 Tokshop] One-D-Piece: Image Tokenizer Meets Quality-Controllable Compression☆77Jul 30, 2025Updated 7 months ago
- Official repository of the "Transformer Fusion with Optimal Transport" paper, published as a conference paper at ICLR 2024.☆31Apr 19, 2024Updated last year
- 👄🇧🇷 Alinhamento fonético forçado em Português Brasileiro☆13Jul 18, 2025Updated 8 months ago
- Linear Attention for Efficient Bidirectional Sequence Modeling☆16May 13, 2025Updated 10 months ago
- Code release for "MORE: Multi-mOdal REtrieval Augmented Generative Commonsense Reasoning"☆11Oct 11, 2024Updated last year
- Official implementation of "What does CLIP know about a red circle? Visual Prompt Engineering for VLMs", ICCV 2023☆11Sep 21, 2023Updated 2 years ago
- This repository represents a basic implementation of the paper "Riemannian Geometry of Deep Generative Models", along with the results on…☆12Oct 23, 2019Updated 6 years ago
- ☆28Feb 15, 2026Updated last month
- This program converts .fits file to .jpg. Fits to jpeg.☆13Jun 4, 2018Updated 7 years ago
- [TCSVT'22]Joint Graph Attention and Asymmetric Convolutional Neural Network for Deep Image Compression☆13Nov 14, 2022Updated 3 years ago
- B.Tech Project On demodulation technique of OTFS(Orthogonal Time Frequency Space) at imperfect Channel State Information and lower SNR(dB…☆11May 12, 2024Updated last year
- [ICML 2025] Fast and Low-Cost Genomic Foundation Models via Outlier Removal.☆18Jun 19, 2025Updated 9 months ago
- Code repository for the paper "MrT5: Dynamic Token Merging for Efficient Byte-level Language Models."☆56Sep 25, 2025Updated 5 months ago
- We introduce new approach, Token Reduction using CLIP Metric (TRIM), aimed at improving the efficiency of MLLMs without sacrificing their…☆21Jan 11, 2026Updated 2 months ago
- ☆27Updated this week
- PathPiece tokenizer☆14Nov 10, 2024Updated last year
- Library to extract embeddings for DNA sequences using BioFM genomics foundation model☆19Aug 13, 2025Updated 7 months ago
- Code for "A Principled Framework for Multi-View Contrastive Learning"☆20Jul 10, 2025Updated 8 months ago
- ☆20Aug 14, 2025Updated 7 months ago
- DSSLIC: Deep Semantic Segmentation-based Layered Image Compression☆10Dec 10, 2018Updated 7 years ago
- ☆13Dec 12, 2023Updated 2 years ago
- ArXiv 每日论文推送助手 自动抓取 ArXiv 最新 AI 论文,使用 DeepSeek 进行深度分析,并推送到飞书。☆37Feb 5, 2026Updated last month
- [TPAMI'2023]Knowledge-enriched Attention Network with Group-wise Semantic for Visual Storytelling☆11Jan 3, 2023Updated 3 years ago
- Neural image compression models optimized for Mask R-CNN from paper "Boosting Neural Image Compression for Machines Using Latent Space Ma…☆10Aug 16, 2022Updated 3 years ago
- This is the code repo of our Pattern Recognition journal on IPR protection of Image Captioning Models☆11Aug 29, 2023Updated 2 years ago
- ☆12Dec 17, 2024Updated last year
- [KDD 2026 ADS Track] Pytorch implementation of the paper "Hi-Guard: Towards Trustworthy Multimodal Moderation via Policy-Aligned Reasonin…☆21Jan 13, 2026Updated 2 months ago
- Latent Diffusion Model-Enabled Low-Latency Semantic Communication in the Presence of Semantic Ambiguities and Wireless Channel Noises☆18Nov 19, 2024Updated last year