mistralai / mistral-commonLinks
Official inference library for pre-processing of Mistral models
☆830Updated last week
Alternatives and similar repositories for mistral-common
Users that are interested in mistral-common are comparing it to the libraries listed below
Sorting:
- ☆446Updated last year
- [ICLR 2025] Samba: Simple Hybrid State Space Models for Efficient Unlimited Context Language Modeling☆930Updated last month
- ☆864Updated 2 years ago
- Fine-tune mistral-7B on 3090s, a100s, h100s☆722Updated 2 years ago
- Train Models Contrastively in Pytorch☆765Updated 8 months ago
- Gemma 2 optimized for your local machine.☆378Updated last year
- A benchmark to evaluate language models on questions I've previously asked them to solve.☆1,036Updated 7 months ago
- Automatically evaluate your LLMs in Google Colab☆675Updated last year
- ☆693Updated 7 months ago
- ☆1,152Updated last year
- Accelerate your Hugging Face Transformers 7.6-9x. Native to Hugging Face and PyTorch.☆686Updated last year
- ☆583Updated last year
- Python client library for Mistral AI platform☆682Updated this week
- Training LLMs with QLoRA + FSDP☆1,534Updated last year
- ☆474Updated last year
- A repository for research on medium sized language models.☆524Updated 6 months ago
- The repository for the code of the UltraFastBERT paper☆520Updated last year
- ☆267Updated 5 months ago
- ☆3,055Updated last month
- Lighteval is your all-in-one toolkit for evaluating LLMs across multiple backends☆2,202Updated last week
- A bagel, with everything.☆325Updated last year
- Extend existing LLMs way beyond the original training length with constant memory usage, without retraining☆733Updated last year
- Data and tools for generating and inspecting OLMo pre-training data.☆1,363Updated last month
- ☆558Updated last year
- Inference code for Persimmon-8B☆412Updated 2 years ago
- DataDreamer: Prompt. Generate Synthetic Data. Train & Align Models. 🤖💤☆1,082Updated 10 months ago
- A benchmark for emotional intelligence in large language models☆392Updated last year
- ☆558Updated last year
- Freeing data processing from scripting madness by providing a set of platform-agnostic customizable pipeline processing blocks.☆2,780Updated this week
- Official implementation of Half-Quadratic Quantization (HQQ)☆902Updated this week