ariG23498 / mmdpLinks
☆19Updated last week
Alternatives and similar repositories for mmdp
Users that are interested in mmdp are comparing it to the libraries listed below
Sorting:
- Notebooks to demonstrate TimmWrapper☆16Updated 5 months ago
- Pixel Parsing. A reproduction of OCR-free end-to-end document understanding models with open data☆21Updated 11 months ago
- Mixture-of-Transformers: A Sparse and Scalable Architecture for Multi-Modal Foundation Models. TMLR 2025.☆81Updated 2 months ago
- ☆63Updated 9 months ago
- Notebook and Scripts that showcase running quantized diffusion models on consumer GPUs☆38Updated 8 months ago
- One Initialization to Rule them All: Fine-tuning via Explained Variance Adaptation☆40Updated 9 months ago
- ☆58Updated last year
- MatFormer repo☆47Updated 7 months ago
- A single repo with all scripts and utils to train / fine-tune the Mamba model with or without FIM☆55Updated last year
- ☆15Updated 11 months ago
- RWKV is an RNN with transformer-level LLM performance. It can be directly trained like a GPT (parallelizable). So it's combining the best…☆49Updated 3 months ago
- Implementations of attention with the softpick function, naive and FlashAttention-2☆80Updated 2 months ago
- Implementation of Infini-Transformer in Pytorch☆111Updated 6 months ago
- A minimal implementation of LLaVA-style VLM with interleaved image & text & video processing ability.☆94Updated 6 months ago
- Easily run PyTorch on multiple GPUs & machines☆46Updated 2 weeks ago
- A dashboard for exploring timm learning rate schedulers☆19Updated 7 months ago
- Collection of autoregressive model implementation☆85Updated 2 months ago
- This is the repo for the paper "PANGEA: A FULLY OPEN MULTILINGUAL MULTIMODAL LLM FOR 39 LANGUAGES"☆108Updated 2 weeks ago
- Implementation of the general framework for AMIE, from the paper "Towards Conversational Diagnostic AI", out of Google Deepmind☆65Updated 9 months ago
- Utilities for Training Very Large Models☆58Updated 9 months ago
- This library supports evaluating disparities in generated image quality, diversity, and consistency between geographic regions.☆20Updated last year
- ☆73Updated 2 months ago
- Load any clip model with a standardized interface☆21Updated last year
- Implementation of 🌻 Mirasol, SOTA Multimodal Autoregressive model out of Google Deepmind, in Pytorch☆89Updated last year
- Explorations into adversarial losses on top of autoregressive loss for language modeling☆37Updated 4 months ago
- ☆50Updated last year
- ☆49Updated 11 months ago
- Code for the paper: "No Zero-Shot Without Exponential Data: Pretraining Concept Frequency Determines Multimodal Model Performance" [NeurI…☆90Updated last year
- ☆52Updated last week
- ☆48Updated 10 months ago