dkopi / Bitune
Implementation of Bitune: Bidirectional Instruction-Tuning
☆15Updated 5 months ago
Related projects ⓘ
Alternatives and complementary repositories for Bitune
- ☆31Updated 2 months ago
- Implementation of Infini-Transformer in Pytorch☆104Updated last month
- Exploring an idea where one forgets about efficiency and carries out attention across each edge of the nodes (tokens)☆43Updated last month
- ViLMA: A Zero-Shot Benchmark for Linguistic and Temporal Grounding in Video-Language Models (ICLR 2024, Official Implementation)☆14Updated 9 months ago
- Language Quantized AutoEncoders☆94Updated last year
- ☆46Updated last month
- This repo is based on https://github.com/jiaweizzhao/GaLore☆18Updated last month
- PyTorch codes for the paper "An Empirical Study of Multimodal Model Merging"☆36Updated last year
- One Initialization to Rule them All: Fine-tuning via Explained Variance Adaptation☆29Updated 3 weeks ago
- ☆20Updated this week
- Official implementation of "BERTs are Generative In-Context Learners"☆19Updated 4 months ago
- ☆25Updated 11 months ago
- Official implementation of MAIA, A Multimodal Automated Interpretability Agent☆62Updated 2 months ago
- Code for NOLA, an implementation of "nola: Compressing LoRA using Linear Combination of Random Basis"☆48Updated 2 months ago
- Exploration into the proposed "Self Reasoning Tokens" by Felipe Bonetto☆53Updated 5 months ago
- Official code for the ICML 2024 paper "The Entropy Enigma: Success and Failure of Entropy Minimization"☆43Updated 5 months ago
- Yet another random morning idea to be quickly tried and architecture shared if it works; to allow the transformer to pause for any amount…☆49Updated last year
- Language models scale reliably with over-training and on downstream tasks☆94Updated 7 months ago
- Official code for the paper: "Metadata Archaeology"☆18Updated last year
- Holistic evaluation of multimodal foundation models☆41Updated 2 months ago
- Implementation of 🌻 Mirasol, SOTA Multimodal Autoregressive model out of Google Deepmind, in Pytorch☆88Updated 10 months ago
- Code repository for the public reproduction of the language modelling experiments on "MatFormer: Nested Transformer for Elastic Inference…☆18Updated 11 months ago
- Code and pretrained models for the paper: "MatMamba: A Matryoshka State Space Model"☆49Updated last month
- ☆61Updated 2 months ago
- ☆50Updated last week
- Code release for "SPIQA: A Dataset for Multimodal Question Answering on Scientific Papers"☆40Updated last month
- Code for "Merging Text Transformers from Different Initializations"☆19Updated 3 months ago
- The official repository for our paper "The Dual Form of Neural Networks Revisited: Connecting Test Time Predictions to Training Patterns …☆16Updated 11 months ago
- HGRN2: Gated Linear RNNs with State Expansion☆48Updated 2 months ago