kyegomez / MMCALinks
The open source community's implementation of the all-new Multi-Modal Causal Attention from "DeepSpeed-VisualChat: Multi-Round Multi-Image Interleave Chat via Multi-Modal Causal Attention"
☆11Updated last year
Alternatives and similar repositories for MMCA
Users that are interested in MMCA are comparing it to the libraries listed below
Sorting:
- ☆13Updated 2 years ago
- A public implementation of the ReLoRA pretraining method, built on Lightning-AI's Pytorch Lightning suite.☆33Updated last year
- My implementation of the model KosmosG from "KOSMOS-G: Generating Images in Context with Multimodal Large Language Models"☆13Updated 6 months ago
- A Data Source for Reasoning Embodied Agents☆19Updated last year
- Transformers + Mambas + LSTMS All in One Model☆9Updated this week
- Pytorch Implementation of the paper: "Learning to (Learn at Test Time): RNNs with Expressive Hidden States"☆24Updated this week
- Implementation of the model: "(MC-ViT)" from the paper: "Memory Consolidation Enables Long-Context Video Understanding"☆20Updated 2 months ago
- Tiktok is an advanced multimedia recommender system that fuses the generative modality-aware collaborative self-augmentation and contrast…☆11Updated last year
- Open source community's implementation of the model from "LANGUAGE MODEL BEATS DIFFUSION — TOKENIZER IS KEY TO VISUAL GENERATION"☆15Updated 6 months ago
- Explorations into improving ViTArc with Slot Attention☆41Updated 7 months ago
- The open source implementation of "Connecting Large Language Models with Evolutionary Algorithms Yields Powerful Prompt Optimizers"☆19Updated last year
- Official implementation of ECCV24 paper: POA☆24Updated 9 months ago
- ☆13Updated last year
- Implementation of Metaformer, but in an autoregressive manner☆25Updated 2 years ago
- The code implementation of MAGDi: Structured Distillation of Multi-Agent Interaction Graphs Improves Reasoning in Smaller Language Models…☆34Updated last year
- Tools for content datamining and NLP at scale☆43Updated 11 months ago
- Visual RAG using less than 300 lines of code.☆27Updated last year
- Official repository for the paper "Approximating Two-Layer Feedforward Networks for Efficient Transformers"☆37Updated last year
- Exploration into the Scaling Value Iteration Networks paper, from Schmidhuber's group☆36Updated 8 months ago
- A minimal re-implementation of orthogonal fine-tuning (OFT) for LLMs. Based on nanoGPT and minLoRA.☆12Updated last year
- Official PyTorch Implementation for Vision-Language Models Create Cross-Modal Task Representations, ICML 2025☆25Updated last month
- Explorations into adversarial losses on top of autoregressive loss for language modeling☆36Updated 3 months ago
- ☆23Updated last year
- Directed masked autoencoders☆14Updated 2 years ago
- Implementation and explorations into Blackbox Gradient Sensing (BGS), an evolutionary strategies approach proposed in a Google Deepmind p…☆13Updated this week
- Implementation of a holodeck, written in Pytorch☆18Updated last year
- HGRN2: Gated Linear RNNs with State Expansion☆54Updated 9 months ago
- Implementation of VisionLLaMA from the paper: "VisionLLaMA: A Unified LLaMA Interface for Vision Tasks" in PyTorch and Zeta☆16Updated 6 months ago
- The open source implementation of the model from "Scaling Vision Transformers to 22 Billion Parameters"☆28Updated 2 months ago
- A big_vision inspired repo that implements a generic Auto-Encoder class capable in representation learning and generative modeling.☆35Updated 11 months ago