bigscience-workshop / petalsLinks
πΈ Run LLMs at home, BitTorrent-style. Fine-tuning and inference up to 10x faster than offloading
β9,885Updated last year
Alternatives and similar repositories for petals
Users that are interested in petals are comparing it to the libraries listed below
Sorting:
- QLoRA: Efficient Finetuning of Quantized LLMsβ10,830Updated last year
- Gorilla: Training and Evaluating LLMs for Function Calls (Tool Calls)β12,705Updated 2 weeks ago
- LLMs build upon Evol Insturct: WizardLM, WizardCoder, WizardMathβ9,475Updated 7 months ago
- The RedPajama-Data repository contains code for preparing large datasets for training large language models.β4,921Updated last year
- Running large language models on a single GPU for throughput-oriented scenarios.β9,380Updated last year
- OpenLLaMA, a permissively licensed open source reproduction of Meta AIβs LLaMA 7B trained on the RedPajama datasetβ7,530Updated 2 years ago
- Instruct-tune LLaMA on consumer hardwareβ18,979Updated last year
- Universal LLM Deployment Engine with ML Compilationβ21,981Updated this week
- Locally run an Instruction-Tuned Chat-Style LLMβ10,187Updated 2 years ago
- Build, personalize and control your own LLMs. From data pre-processing to fine-tuning, xTuring provides an easy way to personalize open-sβ¦β2,664Updated last week
- H2O LLM Studio - a framework and no-code GUI for fine-tuning LLMs. Documentation: https://docs.h2o.ai/h2o-llmstudio/β4,783Updated last week
- Tensor library for machine learningβ13,907Updated this week
- Implementation of the LLaMA language model based on nanoGPT. Supports flash attention, Int8 and GPTQ 4bit quantization, LoRA and LLaMA-Adβ¦β6,093Updated 7 months ago
- Large Language Model Text Generation Inferenceβ10,749Updated 3 weeks ago
- LLM as a Chatbot Serviceβ3,332Updated 2 years ago
- Implementation of RLHF (Reinforcement Learning with Human Feedback) on top of the PaLM architecture. Basically ChatGPT but with PaLMβ7,876Updated 3 months ago
- Run, manage, and scale AI workloads on any AI infrastructure. Use one system to access & manage all AI compute (Kubernetes, 20+ clouds, oβ¦β9,378Updated last week
- Run any open-source LLMs, such as DeepSeek and Llama, as OpenAI compatible API endpoint in the cloud.β12,080Updated last week
- A language for constraint-guided and efficient LLM programming.β4,139Updated 8 months ago
- A guidance language for controlling large language models.β21,225Updated last week
- Private chat with local GPT with document, images, video, etc. 100% private, Apache 2.0. Supports oLLaMa, Mixtral, llama.cpp, and more. Dβ¦β12,004Updated 3 months ago
- StableLM: Stability AI Language Modelsβ15,771Updated last year
- The simplest way to run LLaMA on your local machineβ12,997Updated last year
- An implementation of model parallel autoregressive transformers on GPUs, based on the Megatron and DeepSpeed librariesβ7,374Updated last month
- OpenAssistant is a chat-based assistant that understands tasks, can interact with third-party systems, and retrieve information dynamicalβ¦β37,466Updated last year
- An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.β39,392Updated 8 months ago
- A more memory-efficient rewrite of the HF transformers implementation of Llama for use with quantized weights.β2,907Updated 2 years ago
- A collection of libraries to optimise AI model performancesβ8,354Updated last year
- Bringing stable diffusion models to web browsers. Everything runs inside the browser with no server support.β3,713Updated last year
- OpenChat: Advancing Open-source Language Models with Imperfect Dataβ5,471Updated last year