haoliuhl / language-quantized-autoencoders
Language Quantized AutoEncoders
☆104Updated 2 years ago
Alternatives and similar repositories for language-quantized-autoencoders:
Users that are interested in language-quantized-autoencoders are comparing it to the libraries listed below
- https://arxiv.org/abs/2209.15162☆49Updated 2 years ago
- Patching open-vocabulary models by interpolating weights☆91Updated last year
- Implementation of 🌻 Mirasol, SOTA Multimodal Autoregressive model out of Google Deepmind, in Pytorch☆88Updated last year
- ☆117Updated 2 years ago
- ☆95Updated last year
- ☆51Updated last year
- Toolkit for Elevater Benchmark☆70Updated last year
- Official code for "pi-Tuning: Transferring Multimodal Foundation Models with Optimal Multi-task Interpolation", ICML 2023.☆32Updated last year
- Sparkles: Unlocking Chats Across Multiple Images for Multimodal Instruction-Following Models☆44Updated 10 months ago
- Command-line tool for downloading and extending the RedCaps dataset.☆46Updated last year
- PyTorch code for "Perceiver-VL: Efficient Vision-and-Language Modeling with Iterative Latent Attention" (WACV 2023)☆33Updated 2 years ago
- ☆54Updated 2 years ago
- Code release for "MERLOT Reserve: Neural Script Knowledge through Vision and Language and Sound"☆140Updated 2 years ago
- ☆128Updated 2 years ago
- Compress conventional Vision-Language Pre-training data☆49Updated last year
- ☆58Updated last year
- ☆25Updated 7 months ago
- Repo for paper: "Paxion: Patching Action Knowledge in Video-Language Foundation Models" Neurips 23 Spotlight☆37Updated last year
- Code for paper "Diffusion Language Models Can Perform Many Tasks with Scaling and Instruction-Finetuning"☆76Updated last year
- [ICLR 2023] Official implementation of Transnormer in our ICLR 2023 paper - Toeplitz Neural Network for Sequence Modeling☆79Updated last year
- Visual Language Transformer Interpreter - An interactive visualization tool for interpreting vision-language transformers☆92Updated last year
- The official code for paper "EasyGen: Easing Multimodal Generation with a Bidirectional Conditional Diffusion Model and LLMs"☆73Updated 5 months ago
- Code for the paper "Hyperbolic Image-Text Representations", Desai et al, ICML 2023☆161Updated last year
- Online Adaptation of Language Models with a Memory of Amortized Contexts (NeurIPS 2024)☆63Updated 9 months ago
- [TMLR] Public code repo for paper "A Single Transformer for Scalable Vision-Language Modeling"☆133Updated 5 months ago
- PyTorch codes for the paper "An Empirical Study of Multimodal Model Merging"☆38Updated last year
- Source code for the paper "Prefix Language Models are Unified Modal Learners"☆43Updated 2 years ago
- Matryoshka Multimodal Models☆101Updated 3 months ago
- ☆48Updated last year
- Code for the paper titled "CiT Curation in Training for Effective Vision-Language Data".☆78Updated 2 years ago