kyegomez / qformerLinks
Implementation of Qformer from BLIP2 in Zeta Lego blocks.
☆45Updated last year
Alternatives and similar repositories for qformer
Users that are interested in qformer are comparing it to the libraries listed below
Sorting:
- Keras implement of Finite Scalar Quantization☆85Updated 2 years ago
- [ICML 2025] Fourier Position Embedding: Enhancing Attention’s Periodic Extension for Length Generalization☆103Updated 5 months ago
- Implementation of a multimodal diffusion transformer in Pytorch☆106Updated last year
- Official implementation for the paper "A Cheaper and Better Diffusion Language Model with Soft-Masked Noise"☆59Updated 2 years ago
- Pytorch Implementation of the Model from "MIRASOL3B: A MULTIMODAL AUTOREGRESSIVE MODEL FOR TIME-ALIGNED AND CONTEXTUAL MODALITIES"☆25Updated 9 months ago
- The official GitHub page for the survey paper "Discrete Tokenization for Multimodal LLMs: A Comprehensive Survey". And this paper is unde…☆72Updated 3 months ago
- ☆199Updated last year
- ☆138Updated last year
- Implementation of the proposed MaskBit from Bytedance AI☆82Updated last year
- A project for tri-modal LLM benchmarking and instruction tuning.☆50Updated 7 months ago
- A repository for DenseSSMs☆89Updated last year
- [ICLR 2023] Official implementation of Transnormer in our ICLR 2023 paper - Toeplitz Neural Network for Sequence Modeling☆80Updated last year
- UnifiedMLLM: Enabling Unified Representation for Multi-modal Multi-tasks With Large Language Model☆22Updated last year
- Official code implementation for the work Preference Alignment with Flow Matching (NeurIPS 2024)☆62Updated last year
- Code for paper "Patch-Level Training for Large Language Models"☆93Updated last week
- imagetokenizer is a python package, helps you encoder visuals and generate visuals token ids from codebook, supports both image and video…☆37Updated last year
- Implementation of Agent Attention in Pytorch☆92Updated last year
- LLaVA combines with Magvit Image tokenizer, training MLLM without an Vision Encoder. Unifying image understanding and generation.☆37Updated last year
- This is a simple torch implementation of the high performance Multi-Query Attention☆15Updated 2 years ago
- [ICLR 2025 & COLM 2025] Official PyTorch implementation of the Forgetting Transformer and Adaptive Computation Pruning☆133Updated 2 weeks ago
- A Framework for Decoupling and Assessing the Capabilities of VLMs☆43Updated last year
- Implementation of TiTok, proposed by Bytedance in "An Image is Worth 32 Tokens for Reconstruction and Generation"☆181Updated last year
- [CVPR 2025 Highlight] The official CLIP training codebase of Inf-CL: "Breaking the Memory Barrier: Near Infinite Batch Size Scaling for C…☆270Updated 10 months ago
- [ICML'24 Oral] The official code of "DiJiang: Efficient Large Language Models through Compact Kernelization", a novel DCT-based linear at…☆104Updated last year
- Pytorch Implementation of the paper: "Learning to (Learn at Test Time): RNNs with Expressive Hidden States"☆25Updated last week
- (Unofficial) PyTorch implementation of grouped-query attention (GQA) from "GQA: Training Generalized Multi-Query Transformer Models from …☆182Updated last year
- PyTorch implementation of StableMask (ICML'24)☆14Updated last year
- ☆90Updated last year
- Implementation of Infini-Transformer in Pytorch☆113Updated 10 months ago
- [EMNLP 2022] Official implementation of Transnormer in our EMNLP 2022 paper - The Devil in Linear Transformer☆63Updated 2 years ago