bigscience-workshop / petalsLinks
πΈ Run LLMs at home, BitTorrent-style. Fine-tuning and inference up to 10x faster than offloading
β9,858Updated last year
Alternatives and similar repositories for petals
Users that are interested in petals are comparing it to the libraries listed below
Sorting:
- Run, manage, and scale AI workloads on any AI infrastructure. Use one system to access & manage all AI compute (Kubernetes, 20+ clouds, oβ¦β9,113Updated this week
- LLMs build upon Evol Insturct: WizardLM, WizardCoder, WizardMathβ9,472Updated 6 months ago
- OpenLLaMA, a permissively licensed open source reproduction of Meta AIβs LLaMA 7B trained on the RedPajama datasetβ7,527Updated 2 years ago
- QLoRA: Efficient Finetuning of Quantized LLMsβ10,786Updated last year
- Instruct-tune LLaMA on consumer hardwareβ18,986Updated last year
- Locally run an Instruction-Tuned Chat-Style LLMβ10,194Updated 2 years ago
- A collection of libraries to optimise AI model performancesβ8,363Updated last year
- Build, personalize and control your own LLMs. From data pre-processing to fine-tuning, xTuring provides an easy way to personalize open-sβ¦β2,664Updated 3 weeks ago
- A guidance language for controlling large language models.β21,008Updated last week
- Implementation of the LLaMA language model based on nanoGPT. Supports flash attention, Int8 and GPTQ 4bit quantization, LoRA and LLaMA-Adβ¦β6,091Updated 5 months ago
- Large Language Model Text Generation Inferenceβ10,709Updated last week
- A Bulletproof Way to Generate Structured JSON from Language Modelsβ4,859Updated last year
- Universal LLM Deployment Engine with ML Compilationβ21,750Updated last week
- Python bindings for llama.cppβ9,833Updated 4 months ago
- A language for constraint-guided and efficient LLM programming.β4,101Updated 7 months ago
- Gorilla: Training and Evaluating LLMs for Function Calls (Tool Calls)β12,626Updated last week
- Running large language models on a single GPU for throughput-oriented scenarios.β9,383Updated last year
- Run any open-source LLMs, such as DeepSeek and Llama, as OpenAI compatible API endpoint in the cloud.β12,011Updated this week
- StableLM: Stability AI Language Modelsβ15,787Updated last year
- Tensor library for machine learningβ13,743Updated last week
- Numbers every LLM developer should knowβ4,277Updated last year
- A fast inference library for running LLMs locally on modern consumer-class GPUsβ4,394Updated 2 weeks ago
- Simple UI for LLM Model Finetuningβ2,063Updated 2 years ago
- A more memory-efficient rewrite of the HF transformers implementation of Llama for use with quantized weights.β2,905Updated 2 years ago
- The RedPajama-Data repository contains code for preparing large datasets for training large language models.β4,905Updated last year
- LLM as a Chatbot Serviceβ3,338Updated 2 years ago
- [ICLR 2024] Efficient Streaming Language Models with Attention Sinksβ7,160Updated last year
- Home of StarCoder: fine-tuning & inference!β7,515Updated last year
- OpenAssistant is a chat-based assistant that understands tasks, can interact with third-party systems, and retrieve information dynamicalβ¦β37,492Updated last year
- An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.β39,320Updated 6 months ago