mistralai / mistral-commonLinks
Official inference library for pre-processing of Mistral models
☆840Updated last week
Alternatives and similar repositories for mistral-common
Users that are interested in mistral-common are comparing it to the libraries listed below
Sorting:
- ☆446Updated last year
- Train Models Contrastively in Pytorch☆771Updated 9 months ago
- ☆866Updated 2 years ago
- Accelerate your Hugging Face Transformers 7.6-9x. Native to Hugging Face and PyTorch.☆686Updated last year
- [ICLR 2025] Samba: Simple Hybrid State Space Models for Efficient Unlimited Context Language Modeling☆936Updated last month
- ☆1,173Updated 3 weeks ago
- ☆695Updated 8 months ago
- Automatically evaluate your LLMs in Google Colab☆679Updated last year
- Gemma 2 optimized for your local machine.☆378Updated last year
- Fine-tune mistral-7B on 3090s, a100s, h100s☆723Updated 2 years ago
- A benchmark to evaluate language models on questions I've previously asked them to solve.☆1,040Updated 8 months ago
- Python client library for Mistral AI platform☆685Updated this week
- ☆3,062Updated last month
- ☆269Updated 6 months ago
- A library for easily merging multiple LLM experts, and efficiently train the merged LLM.☆500Updated last year
- DataDreamer: Prompt. Generate Synthetic Data. Train & Align Models. 🤖💤☆1,087Updated 11 months ago
- Extend existing LLMs way beyond the original training length with constant memory usage, without retraining☆733Updated last year
- Official implementation of Half-Quadratic Quantization (HQQ)☆905Updated 3 weeks ago
- ☆557Updated last year
- ☆584Updated last year
- A bagel, with everything.☆325Updated last year
- Guide for fine-tuning Llama/Mistral/CodeLlama models and more☆644Updated 2 months ago
- 🤗 Benchmark Large Language Models Reliably On Your Data☆423Updated last week
- Evaluate your LLM's response with Prometheus and GPT4 💯☆1,029Updated 8 months ago
- Freeing data processing from scripting madness by providing a set of platform-agnostic customizable pipeline processing blocks.☆2,812Updated this week
- The repository for the code of the UltraFastBERT paper☆519Updated last year
- Inference code for Persimmon-8B☆412Updated 2 years ago
- Minimalistic large language model 3D-parallelism training☆2,407Updated last month
- ☆475Updated 2 years ago
- ☆470Updated 2 years ago