ariG23498 / mmdpLinks
☆34Updated 6 months ago
Alternatives and similar repositories for mmdp
Users that are interested in mmdp are comparing it to the libraries listed below
Sorting:
- Pixel Parsing. A reproduction of OCR-free end-to-end document understanding models with open data☆23Updated last year
- ☆59Updated last year
- A minimal implementation of LLaVA-style VLM with interleaved image & text & video processing ability.☆97Updated last year
- Minimal sharded dataset loaders, decoders, and utils for multi-modal document, image, and text datasets.☆160Updated last year
- Load any clip model with a standardized interface☆22Updated 2 months ago
- Official repository of "LiNeS: Post-training Layer Scaling Prevents Forgetting and Enhances Model Merging"☆31Updated last year
- Code and pretrained models for the paper: "MatMamba: A Matryoshka State Space Model"☆62Updated last year
- Official PyTorch Implementation for Paper "No More Adam: Learning Rate Scaling at Initialization is All You Need"☆54Updated 11 months ago
- A dashboard for exploring timm learning rate schedulers☆19Updated last year
- ☆65Updated 2 years ago
- Timm model explorer☆42Updated last year
- (WACV 2025 - Oral) Vision-language conversation in 10 languages including English, Chinese, French, Spanish, Russian, Japanese, Arabic, H…☆84Updated 5 months ago
- MEXMA: Token-level objectives improve sentence representations☆42Updated last year
- Implementation of the general framework for AMIE, from the paper "Towards Conversational Diagnostic AI", out of Google Deepmind☆72Updated last year
- Recaption large (Web)Datasets with vllm and save the artifacts.☆52Updated last year
- [COLM 2024] Early Weight Averaging meets High Learning Rates for LLM Pre-training☆18Updated last year
- ☆87Updated 2 years ago
- ☆48Updated last year
- Implementation of MaMMUT, a simple vision-encoder text-decoder architecture for multimodal tasks from Google, in Pytorch☆103Updated 2 years ago
- ☆52Updated last year
- ☆91Updated last year
- DPO, but faster 🚀☆46Updated last year
- M4 experiment logbook☆58Updated 2 years ago
- Official repository for the paper "SwitchHead: Accelerating Transformers with Mixture-of-Experts Attention"☆102Updated last year
- Official implementation of the ICML 2024 paper RoSA (Robust Adaptation)☆44Updated last year
- Easily run PyTorch on multiple GPUs & machines☆57Updated last week
- ☆80Updated last year
- ☆191Updated last year
- MatFormer repo☆68Updated last year
- LL3M: Large Language and Multi-Modal Model in Jax☆73Updated last year