mistralai / mistral-finetune
☆2,852Updated 5 months ago
Alternatives and similar repositories for mistral-finetune:
Users that are interested in mistral-finetune are comparing it to the libraries listed below
- PyTorch native post-training library☆4,856Updated this week
- Easily use and train state of the art late-interaction retrieval methods (ColBERT) in any RAG pipeline. Designed for modularity and ease-…☆3,266Updated last week
- ☆1,480Updated this week
- Tools for merging pretrained large language models.☆5,260Updated last week
- Multi-LoRA inference server that scales to 1000s of fine-tuned LLMs☆2,362Updated last week
- Freeing data processing from scripting madness by providing a set of platform-agnostic customizable pipeline processing blocks.☆2,230Updated last week
- Curated list of datasets and tools for post-training.☆2,698Updated 3 weeks ago
- Distilabel is a framework for synthetic data and AI feedback for engineers who need fast, reliable and scalable pipelines based on verifi…☆2,448Updated this week
- [EMNLP'23, ACL'24] To speed up LLMs' inference and enhance LLM's perceive of key information, compress the prompt and KV-Cache, which ach…☆4,879Updated 3 weeks ago
- A framework for serving and evaluating LLM routers - save LLM costs without compromising quality☆3,635Updated 6 months ago
- Go ahead and axolotl questions☆8,648Updated this week
- ☆806Updated 5 months ago
- Training LLMs with QLoRA + FSDP☆1,451Updated 3 months ago
- Cohere Toolkit is a collection of prebuilt components enabling users to quickly build and deploy RAG applications.☆2,959Updated this week
- A PyTorch native library for large model training☆3,326Updated this week
- nanoGPT style version of Llama 3.1☆1,316Updated 6 months ago
- Stanford NLP Python library for Representation Finetuning (ReFT)☆1,418Updated 2 weeks ago
- Knowledge Agents and Management in the Cloud☆3,707Updated this week
- Superfast AI decision making and intelligent processing of multi-modal data.☆2,396Updated this week
- Lightning-fast serving engine for any AI model of any size. Flexible. Easy. Enterprise-scale.☆2,843Updated this week
- Agentic components of the Llama Stack APIs☆4,140Updated this week
- Robust recipes to align language models with human and AI preferences☆5,001Updated 3 months ago
- A benchmark to evaluate language models on questions I've previously asked them to solve.☆973Updated 2 weeks ago
- Enforce the output format (JSON Schema, Regex etc) of a language model☆1,704Updated this week
- A lightweight, low-dependency, unified API to use all common reranking and cross-encoder models.☆1,295Updated last week
- Fast State-of-the-Art Static Embeddings☆1,060Updated this week
- structured outputs for llms☆9,428Updated this week
- Deploy your agentic worfklows to production☆1,964Updated this week
- Optimizing inference proxy for LLMs☆2,040Updated this week
- TextGrad: Automatic ''Differentiation'' via Text -- using large language models to backpropagate textual gradients.☆2,085Updated 3 weeks ago