bigscience-workshop / petalsLinks
πΈ Run LLMs at home, BitTorrent-style. Fine-tuning and inference up to 10x faster than offloading
β9,733Updated 10 months ago
Alternatives and similar repositories for petals
Users that are interested in petals are comparing it to the libraries listed below
Sorting:
- LLMs build upon Evol Insturct: WizardLM, WizardCoder, WizardMathβ9,436Updated last month
- OpenLLaMA, a permissively licensed open source reproduction of Meta AIβs LLaMA 7B trained on the RedPajama datasetβ7,514Updated 2 years ago
- Running large language models on a single GPU for throughput-oriented scenarios.β9,352Updated 9 months ago
- QLoRA: Efficient Finetuning of Quantized LLMsβ10,583Updated last year
- Implementation of the LLaMA language model based on nanoGPT. Supports flash attention, Int8 and GPTQ 4bit quantization, LoRA and LLaMA-Adβ¦β6,081Updated last month
- Run any open-source LLMs, such as DeepSeek and Llama, as OpenAI compatible API endpoint in the cloud.β11,616Updated last week
- Gorilla: Training and Evaluating LLMs for Function Calls (Tool Calls)β12,288Updated this week
- Locally run an Instruction-Tuned Chat-Style LLMβ10,226Updated 2 years ago
- A guidance language for controlling large language models.β20,533Updated this week
- Build, customize and control you own LLMs. From data pre-processing to fine-tuning, xTuring provides an easy way to personalize open-sourβ¦β2,663Updated 10 months ago
- LLM as a Chatbot Serviceβ3,327Updated last year
- Instruct-tune LLaMA on consumer hardwareβ18,932Updated last year
- A list of totally open alternatives to ChatGPTβ4,672Updated 2 years ago
- A language for constraint-guided and efficient LLM programming.β4,018Updated 2 months ago
- Home of StarCoder: fine-tuning & inference!β7,435Updated last year
- Large Language Model Text Generation Inferenceβ10,367Updated last week
- [ICLR 2024] Efficient Streaming Language Models with Attention Sinksβ6,949Updated last year
- Universal LLM Deployment Engine with ML Compilationβ21,039Updated this week
- Numbers every LLM developer should knowβ4,248Updated last year
- A more memory-efficient rewrite of the HF transformers implementation of Llama for use with quantized weights.β2,891Updated last year
- Resource list for generating JSON using LLMs via function calling, tools, CFG. Libraries, Models, Notebooks, etc.β2,132Updated 5 months ago
- A collection of libraries to optimise AI model performancesβ8,377Updated last year
- SkyPilot: Run AI and batch jobs on any infra (Kubernetes or 16+ clouds). Get unified execution, cost savings, and high GPU availability vβ¦β8,427Updated this week
- StableLM: Stability AI Language Modelsβ15,824Updated last year
- Tensor library for machine learningβ12,883Updated this week
- RWKV (pronounced RwaKuv) is an RNN with great LLM performance, which can also be directly trained like a GPT transformer (parallelizable)β¦β13,864Updated this week
- The RedPajama-Data repository contains code for preparing large datasets for training large language models.β4,781Updated 7 months ago
- OpenAssistant is a chat-based assistant that understands tasks, can interact with third-party systems, and retrieve information dynamicalβ¦β37,425Updated 11 months ago
- The simplest way to run LLaMA on your local machineβ13,073Updated last year
- Structured Outputsβ12,188Updated this week