mesolitica / multimodal-LLMLinks
Multi-Modal Language Modeling with Image, Audio and Text Integration, included multi-images and multi-audio in a single multiturn.
☆18Updated last year
Alternatives and similar repositories for multimodal-LLM
Users that are interested in multimodal-LLM are comparing it to the libraries listed below
Sorting:
- A public implementation of the ReLoRA pretraining method, built on Lightning-AI's Pytorch Lightning suite.☆35Updated last year
- Code for NeurIPS LLM Efficiency Challenge☆59Updated last year
- Library to facilitate pruning of LLMs based on context☆32Updated last year
- ☆51Updated 9 months ago
- Official repo for the paper PHUDGE: Phi-3 as Scalable Judge. Evaluate your LLMs with or without custom rubric, reference answer, absolute…☆50Updated last year
- ☆51Updated last year
- Using open source LLMs to build synthetic datasets for direct preference optimization☆69Updated last year
- Improving Text Embedding of Language Models Using Contrastive Fine-tuning☆65Updated last year
- Fast approximate inference on a single GPU with sparsity aware offloading☆38Updated last year
- Code and data for "StructLM: Towards Building Generalist Models for Structured Knowledge Grounding" (COLM 2024)☆75Updated last year
- ☆55Updated last year
- NeurIPS 2023 - Cappy: Outperforming and Boosting Large Multi-Task LMs with a Small Scorer☆44Updated last year
- Anchored Preference Optimization and Contrastive Revisions: Addressing Underspecification in Alignment☆60Updated last year
- Entailment self-training☆25Updated 2 years ago
- Multi-Domain Expert Learning☆66Updated last year
- Codebase accompanying the Summary of a Haystack paper.☆79Updated last year
- Simple GRPO scripts and configurations.☆59Updated 9 months ago
- Universal text classifier for generative models☆25Updated last year
- ☆29Updated 3 months ago
- ☆48Updated last year
- Collection of autoregressive model implementation☆86Updated 6 months ago
- QAmeleon introduces synthetic multilingual QA data using PaLM, a 540B large language model. This dataset was generated by prompt tuning P…☆35Updated 2 years ago
- A Python wrapper around HuggingFace's TGI (text-generation-inference) and TEI (text-embedding-inference) servers.☆33Updated 2 months ago
- EMNLP 2024 "Re-reading improves reasoning in large language models". Simply repeating the question to get bidirectional understanding for…☆27Updated 11 months ago
- ☆39Updated last year
- ☆49Updated 2 years ago
- 🚀 Automatically convert unstructured data into a high-quality 'textbook' format, optimized for fine-tuning Large Language Models (LLMs)☆25Updated 2 years ago
- minimal scripts for 24GB VRAM GPUs. training, inference, whatever☆49Updated last week
- Code and pretrained models for the paper: "MatMamba: A Matryoshka State Space Model"☆61Updated last year
- ReBase: Training Task Experts through Retrieval Based Distillation☆29Updated 9 months ago