kyegomez / TTL
Pytorch Implementation of the paper: "Learning to (Learn at Test Time): RNNs with Expressive Hidden States"
☆24Updated last week
Alternatives and similar repositories for TTL:
Users that are interested in TTL are comparing it to the libraries listed below
- Implementation of the model: "(MC-ViT)" from the paper: "Memory Consolidation Enables Long-Context Video Understanding"☆20Updated 3 weeks ago
- OLA-VLM: Elevating Visual Perception in Multimodal LLMs with Auxiliary Embedding Distillation, arXiv 2024☆49Updated 3 weeks ago
- Pytorch Implementation of the Model from "MIRASOL3B: A MULTIMODAL AUTOREGRESSIVE MODEL FOR TIME-ALIGNED AND CONTEXTUAL MODALITIES"☆26Updated 3 weeks ago
- ☆41Updated last year
- Implementation of "PaLM2-VAdapter:" from the multi-modal model paper: "PaLM2-VAdapter: Progressively Aligned Language Model Makes a Stron…☆17Updated 3 months ago
- PyTorch code for "ADEM-VL: Adaptive and Embedded Fusion for Efficient Vision-Language Tuning"☆18Updated 3 months ago
- Official Pytorch Implementation of Self-emerging Token Labeling☆32Updated 10 months ago
- Implementation of ViTaR: ViTAR: Vision Transformer with Any Resolution in PyTorch☆30Updated 3 months ago
- Pytorch implementation of HyperLLaVA: Dynamic Visual and Language Expert Tuning for Multimodal Large Language Models☆28Updated 10 months ago
- A big_vision inspired repo that implements a generic Auto-Encoder class capable in representation learning and generative modeling.☆34Updated 7 months ago
- Open source community's implementation of the model from "LANGUAGE MODEL BEATS DIFFUSION — TOKENIZER IS KEY TO VISUAL GENERATION"☆15Updated 3 months ago
- [WACV 2025] Official implementation of "Online-LoRA: Task-free Online Continual Learning via Low Rank Adaptation" by Xiwen Wei, Guihong L…☆31Updated 3 months ago
- Implementation of the "the first large-scale multimodal mixture of experts models." from the paper: "Multimodal Contrastive Learning with…☆25Updated 3 weeks ago
- Implementation of the paper: "Mixture-of-Depths: Dynamically allocating compute in transformer-based language models"☆81Updated last week
- HGRN2: Gated Linear RNNs with State Expansion☆52Updated 6 months ago
- Enable Next-sentence Prediction for Large Language Models with Faster Speed, Higher Accuracy and Longer Context☆25Updated 6 months ago
- ☆46Updated 10 months ago
- Collect papers about Mamba (a selective state space model).☆14Updated 6 months ago
- ☆22Updated 4 months ago
- A Triton Kernel for incorporating Bi-Directionality in Mamba2☆60Updated 2 months ago
- Official implementation of ECCV24 paper: POA☆24Updated 6 months ago
- The official repo of continuous speculative decoding☆24Updated 3 months ago
- Explorations into improving ViTArc with Slot Attention☆37Updated 4 months ago
- This is the official repo for ByteVideoLLM/Dynamic-VLM☆19Updated 2 months ago
- ☆38Updated 3 months ago
- ☆41Updated last month
- Second Generation of the MAMBA Software☆28Updated 4 months ago
- A repository for DenseSSMs☆86Updated 10 months ago
- Implementation of MambaFormer in Pytorch ++ Zeta from the paper: "Can Mamba Learn How to Learn? A Comparative Study on In-Context Learnin…☆20Updated last week
- Implementation of Qformer from BLIP2 in Zeta Lego blocks.☆35Updated 3 months ago