mistralai / mistral-inferenceLinks

Official inference library for Mistral models

☆10,516

Alternatives and similar repositories for mistral-inference

Users that are interested in mistral-inference are comparing it to the libraries listed below

Sorting:

artidoro / qlora
QLoRA: Efficient Finetuning of Quantized LLMs
☆10,719Updated last year
huggingface / text-generation-inference
Large Language Model Text Generation Inference
☆10,605Updated last month
jzhang38 / TinyLlama
The TinyLlama project is an open endeavor to pretrain a 1.1B Llama model on 3 trillion tokens.
☆8,780Updated last year
imoneoi / openchat
OpenChat: Advancing Open-source Language Models with Imperfect Data
☆5,436Updated last year
meta-llama / llama3
The official Meta Llama 3 GitHub site
☆29,055Updated 9 months ago
mit-han-lab / streaming-llm
[ICLR 2024] Efficient Streaming Language Models with Attention Sinks
☆7,094Updated last year
bitsandbytes-foundation / bitsandbytes
Accessible large language models via k-bit quantization for PyTorch.
☆7,687Updated last week
abetlen / llama-cpp-python
Python bindings for llama.cpp
☆9,678Updated 2 months ago
Dao-AILab / flash-attention
Fast and memory-efficient exact attention
☆20,151Updated this week
tatsu-lab / stanford_alpaca
Code and documentation to train Stanford's Alpaca models, and generate the data.
☆30,190Updated last year
nlpxucan / WizardLM
LLMs build upon Evol Insturct: WizardLM, WizardCoder, WizardMath
☆9,459Updated 4 months ago
openlm-research / open_llama
OpenLLaMA, a permissively licensed open source reproduction of Meta AI’s LLaMA 7B trained on the RedPajama dataset
☆7,523Updated 2 years ago
tloen / alpaca-lora
Instruct-tune LLaMA on consumer hardware
☆18,977Updated last year
meta-llama / llama-cookbook
Welcome to the Llama Cookbook! This is your go to guide for Building with Llama: Getting started with Inference, Fine-Tuning, RAG. We als…
☆17,979Updated this week
karpathy / llama2.c
Inference Llama 2 in one file of pure C
☆18,872Updated last year
arcee-ai / mergekit
Tools for merging pretrained large language models.
☆6,394Updated last month
NVIDIA / TensorRT-LLM
TensorRT LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and supports state-of-the-art optimizat…
☆11,955Updated this week
huggingface / peft
🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.
☆19,900Updated this week
Stability-AI / StableLM
StableLM: Stability AI Language Models
☆15,797Updated last year
openai / transformer-debugger
☆4,101Updated last year
huggingface / trl
Train transformer language models with reinforcement learning.
☆16,012Updated this week
axolotl-ai-cloud / axolotl
Go ahead and axolotl questions
☆10,673Updated this week
ShishirPatil / gorilla
Gorilla: Training and Evaluating LLMs for Function Calls (Tool Calls)
☆12,500Updated last week
meta-pytorch / gpt-fast
Simple and efficient pytorch-native transformer text generation in <1000 LOC of python.
☆6,135Updated 2 months ago
togethercomputer / RedPajama-Data
The RedPajama-Data repository contains code for preparing large datasets for training large language models.
☆4,834Updated 10 months ago
EleutherAI / lm-evaluation-harness
A framework for few-shot evaluation of language models.
☆10,433Updated last week
Lightning-AI / lit-llama
Implementation of the LLaMA language model based on nanoGPT. Supports flash attention, Int8 and GPTQ 4bit quantization, LoRA and LLaMA-Ad…
☆6,079Updated 3 months ago
mlc-ai / mlc-llm
Universal LLM Deployment Engine with ML Compilation
☆21,527Updated this week
lm-sys / FastChat
An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.
☆39,185Updated 4 months ago
meta-pytorch / torchtune
PyTorch native post-training library
☆5,547Updated last week