mesolitica / multimodal-LLMLinks
Multi-Modal Language Modeling with Image, Audio and Text Integration, included multi-images and multi-audio in a single multiturn.
☆18Updated last year
Alternatives and similar repositories for multimodal-LLM
Users that are interested in multimodal-LLM are comparing it to the libraries listed below
Sorting:
- A public implementation of the ReLoRA pretraining method, built on Lightning-AI's Pytorch Lightning suite.☆34Updated last year
- Code for NeurIPS LLM Efficiency Challenge☆60Updated last year
- Using open source LLMs to build synthetic datasets for direct preference optimization☆72Updated last year
- Fast approximate inference on a single GPU with sparsity aware offloading☆39Updated 2 years ago
- Improving Text Embedding of Language Models Using Contrastive Fine-tuning☆64Updated last year
- Simple GRPO scripts and configurations.