mistralai / mistral-finetuneLinks
☆2,951Updated 8 months ago
Alternatives and similar repositories for mistral-finetune
Users that are interested in mistral-finetune are comparing it to the libraries listed below
Sorting:
- Distilabel is a framework for synthetic data and AI feedback for engineers who need fast, reliable and scalable pipelines based on verifi…☆2,712Updated this week
- PyTorch native post-training library☆5,217Updated this week
- Tools for merging pretrained large language models.☆5,754Updated last week
- ☆1,735Updated this week
- Easily use and train state of the art late-interaction retrieval methods (ColBERT) in any RAG pipeline. Designed for modularity and ease-…☆3,473Updated 2 weeks ago
- Multi-LoRA inference server that scales to 1000s of fine-tuned LLMs☆2,985Updated last week
- A framework for serving and evaluating LLM routers - save LLM costs without compromising quality☆3,968Updated 9 months ago
- Go ahead and axolotl questions☆9,470Updated this week
- Stanford NLP Python library for Representation Finetuning (ReFT)☆1,476Updated 3 months ago
- ☆719Updated last week
- Freeing data processing from scripting madness by providing a set of platform-agnostic customizable pipeline processing blocks.☆2,390Updated this week
- Training LLMs with QLoRA + FSDP☆1,478Updated 6 months ago
- Recipes for shrinking, optimizing, customizing cutting edge vision models. 💜☆1,461Updated 3 weeks ago
- Lighteval is your all-in-one toolkit for evaluating LLMs across multiple backends☆1,563Updated last week
- Evaluate your LLM's response with Prometheus and GPT4 💯☆945Updated last month
- Curated list of datasets and tools for post-training.☆3,080Updated 4 months ago
- The TinyLlama project is an open endeavor to pretrain a 1.1B Llama model on 3 trillion tokens.☆8,504Updated last year
- Cohere Toolkit is a collection of prebuilt components enabling users to quickly build and deploy RAG applications.☆3,051Updated last week
- [EMNLP'23, ACL'24] To speed up LLMs' inference and enhance LLM's perceive of key information, compress the prompt and KV-Cache, which ach…☆5,111Updated 2 months ago
- Chat language model that can use tools and interpret the results☆1,553Updated 3 weeks ago
- Superfast AI decision making and intelligent processing of multi-modal data.☆2,604Updated 3 weeks ago
- Harness LLMs with Multi-Agent Programming☆3,349Updated this week
- AllenAI's post-training codebase☆2,986Updated this week
- Robust recipes to align language models with human and AI preferences☆5,196Updated last month
- Everything about the SmolLM2 and SmolVLM family of models☆2,442Updated 2 months ago
- Optimizing inference proxy for LLMs☆2,427Updated this week
- A lightweight, low-dependency, unified API to use all common reranking and cross-encoder models.☆1,420Updated last week
- AdalFlow: The library to build & auto-optimize LLM applications.☆3,073Updated 2 months ago
- Convert Compute And Books Into Instruct-Tuning Datasets! Makes: QA, RP, Classifiers.☆1,460Updated 3 months ago
- Python bindings for the Transformer models implemented in C/C++ using GGML library.☆1,864Updated last year