mesolitica / multimodal-LLM
Multi-Modal Language Modeling with Image, Audio and Text Integration, included multi-images and multi-audio in a single multiturn.
☆17Updated last year
Alternatives and similar repositories for multimodal-LLM:
Users that are interested in multimodal-LLM are comparing it to the libraries listed below
- A public implementation of the ReLoRA pretraining method, built on Lightning-AI's Pytorch Lightning suite.☆33Updated last year
- A Python wrapper around HuggingFace's TGI (text-generation-inference) and TEI (text-embedding-inference) servers.☆34Updated 4 months ago
- Comparing retrieval abilities from GPT4-Turbo and a RAG system on a toy example for various context lengths☆35Updated last year
- Official repo for the paper PHUDGE: Phi-3 as Scalable Judge. Evaluate your LLMs with or without custom rubric, reference answer, absolute…☆49Updated 9 months ago
- ☆48Updated 5 months ago
- ☆32Updated last year
- ☆40Updated 2 months ago
- ☆33Updated 9 months ago
- Code for NeurIPS LLM Efficiency Challenge☆57Updated last year
- Using open source LLMs to build synthetic datasets for direct preference optimization☆59Updated last year
- NeurIPS 2023 - Cappy: Outperforming and Boosting Large Multi-Task LMs with a Small Scorer☆42Updated last year
- Merge LLM that are split in to parts☆26Updated last year
- Simple GRPO scripts and configurations.☆58Updated 2 months ago
- ☆24Updated last year
- Implementation of the Mamba SSM with hf_integration.☆56Updated 7 months ago
- Tools for merging pretrained large language models.☆19Updated 10 months ago
- Open TTS models, built for streaming on the edge☆39Updated 3 weeks ago
- Visual RAG using less than 300 lines of code.☆27Updated last year
- Finetune any model on HF in less than 30 seconds☆58Updated last week
- Universal text classifier for generative models☆23Updated 8 months ago
- MEXMA: Token-level objectives improve sentence representations☆40Updated 3 months ago
- Pixel Parsing. A reproduction of OCR-free end-to-end document understanding models with open data☆21Updated 8 months ago
- ☆44Updated 2 months ago
- Audio tokenization, in the fastest way possible!☆50Updated 7 months ago
- Demonstration that finetuning RoPE model on larger sequences than the pre-trained model adapts the model context limit☆63Updated last year
- SCREWS: A Modular Framework for Reasoning with Revisions☆27Updated last year
- Notebook and Scripts that showcase running quantized diffusion models on consumer GPUs☆38Updated 5 months ago
- ☆20Updated 10 months ago
- Goldfish: Monolingual language models for 350 languages.☆16Updated 7 months ago
- ☆62Updated 8 months ago