mistralai / mistral-finetuneLinks
☆3,012Updated 11 months ago
Alternatives and similar repositories for mistral-finetune
Users that are interested in mistral-finetune are comparing it to the libraries listed below
Sorting:
- ☆1,967Updated last week
- Official inference library for pre-processing of Mistral models☆788Updated last week
- Easily use and train state of the art late-interaction retrieval methods (ColBERT) in any RAG pipeline. Designed for modularity and ease-…☆3,658Updated 3 months ago
- PyTorch native post-training library☆5,458Updated this week
- Distilabel is a framework for synthetic data and AI feedback for engineers who need fast, reliable and scalable pipelines based on verifi…☆2,867Updated last week
- Cohere Toolkit is a collection of prebuilt components enabling users to quickly build and deploy RAG applications.☆3,091Updated last week
- Tools for merging pretrained large language models.☆6,258Updated 3 weeks ago
- Training LLMs with QLoRA + FSDP☆1,527Updated 10 months ago
- Multi-LoRA inference server that scales to 1000s of fine-tuned LLMs☆3,405Updated 3 months ago
- Robust recipes to align language models with human and AI preferences☆5,343Updated last month
- Freeing data processing from scripting madness by providing a set of platform-agnostic customizable pipeline processing blocks.☆2,580Updated 2 weeks ago
- Things you can do with the token embeddings of an LLM☆1,447Updated 5 months ago
- A framework for serving and evaluating LLM routers - save LLM costs without compromising quality☆4,255Updated last year
- Collection of notebook guides created by the Brev.dev team!☆1,791Updated 2 weeks ago
- [EMNLP'23, ACL'24] To speed up LLMs' inference and enhance LLM's perceive of key information, compress the prompt and KV-Cache, which ach…☆5,407Updated 6 months ago
- Agentic components of the Llama Stack APIs☆4,272Updated last month
- Lighteval is your all-in-one toolkit for evaluating LLMs across multiple backends☆1,866Updated this week
- Recipes for shrinking, optimizing, customizing cutting edge vision models. 💜☆1,586Updated 3 weeks ago
- Go ahead and axolotl questions☆10,365Updated this week
- Stanford NLP Python library for Representation Finetuning (ReFT)☆1,511Updated 7 months ago
- ☆1,071Updated 11 months ago
- Easy token price estimates for 400+ LLMs. TokenOps.☆1,789Updated this week
- Curated list of datasets and tools for post-training.☆3,671Updated last month
- DataDreamer: Prompt. Generate Synthetic Data. Train & Align Models. 🤖💤☆1,055Updated 7 months ago
- Sharing both practical insights and theoretical knowledge about LLM evaluation that we gathered while managing the Open LLM Leaderboard a…☆1,564Updated 8 months ago
- Everything about the SmolLM and SmolVLM family of models☆3,190Updated 3 weeks ago
- A benchmark to evaluate language models on questions I've previously asked them to solve.☆1,030Updated 4 months ago
- The easiest way to deploy agents, MCP servers, models, RAG, pipelines and more. No MLOps. No YAML.☆3,547Updated last week
- ☆679Updated 4 months ago
- Minimalistic large language model 3D-parallelism training☆2,180Updated last week