Manage scalable open LLM inference endpoints in Slurm clusters
☆287Jul 11, 2024Updated last year
Alternatives and similar repositories for llm-swarm
Users that are interested in llm-swarm are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆567Nov 20, 2024Updated last year
- Lighteval is your all-in-one toolkit for evaluating LLMs across multiple backends☆2,396Apr 17, 2026Updated 2 weeks ago
- Freeing data processing from scripting madness by providing a set of platform-agnostic customizable pipeline processing blocks.☆3,033Apr 20, 2026Updated 2 weeks ago
- Easily embed, cluster and semantically label text datasets☆603Mar 28, 2024Updated 2 years ago
- Minimalistic large language model 3D-parallelism training☆2,674Apr 7, 2026Updated 3 weeks ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- Distilabel is a framework for synthetic data and AI feedback for engineers who need fast, reliable and scalable pipelines based on verifi…☆3,199Apr 27, 2026Updated last week
- A Python wrapper around HuggingFace's TGI (text-generation-inference) and TEI (text-embedding-inference) servers.☆32Sep 19, 2025Updated 7 months ago
- Tools for merging pretrained large language models.☆7,052Mar 15, 2026Updated last month
- Automated Identification of Redundant Layer Blocks for Pruning in Large Language Models☆268Apr 23, 2024Updated 2 years ago
- Robust recipes to align language models with human and AI preferences☆5,593Apr 8, 2026Updated 3 weeks ago
- Hugging Face Inference Toolkit used to serve transformers, sentence-transformers, and diffusers models.☆93Apr 15, 2026Updated 3 weeks ago
- A pytorch quantization backend for optimum☆1,038Apr 2, 2026Updated last month
- Automatically derive Python dunder methods for your Rust code☆25Apr 7, 2026Updated 3 weeks ago
- Scripts to convert datasets from various sources to Hugging Face Datasets.☆57Oct 26, 2022Updated 3 years ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- ☆110Jul 15, 2025Updated 9 months ago
- Minimal sharded dataset loaders, decoders, and utils for multi-modal document, image, and text datasets.☆161Apr 3, 2024Updated 2 years ago
- Scaling Data-Constrained Language Models☆343Jun 28, 2025Updated 10 months ago
- [ICLR 2025] Alignment Data Synthesis from Scratch by Prompting Aligned LLMs with Nothing. Your efficient and high-quality synthetic data …☆852Mar 17, 2025Updated last year
- Training and evaluation code for the paper "Headless Language Models: Learning without Predicting with Contrastive Weight Tying" (https:/…☆29Apr 17, 2024Updated 2 years ago
- DataComp for Language Models☆1,439Sep 9, 2025Updated 7 months ago
- S-LoRA: Serving Thousands of Concurrent LoRA Adapters☆1,909Jan 21, 2024Updated 2 years ago
- Data and tools for generating and inspecting OLMo pre-training data.☆1,492Nov 5, 2025Updated 6 months ago
- ☆1,135Jan 10, 2026Updated 3 months ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- A bagel, with everything.☆326Apr 11, 2024Updated 2 years ago
- Large Language Model Text Generation Inference☆10,848Mar 21, 2026Updated last month
- 🚀 Accelerate inference and training of 🤗 Transformers, Diffusers, TIMM and Sentence Transformers with easy to use hardware optimization…☆3,376Apr 28, 2026Updated last week
- AllenAI's post-training codebase☆3,708Updated this week
- Deita: Data-Efficient Instruction Tuning for Alignment [ICLR2024]☆593Dec 9, 2024Updated last year
- Easy and Efficient Quantization for Transformers☆206Mar 25, 2026Updated last month
- A subset of PyTorch's neural network modules, written in Python using OpenAI's Triton.☆600Aug 12, 2025Updated 8 months ago
- A repository for research on medium sized language models.☆537Jun 6, 2025Updated 11 months ago
- Scalable data pre processing and curation toolkit for LLMs☆1,556Updated this week
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Code for the examples presented in the talk "Training a Llama in your backyard: fine-tuning very large models on consumer hardware" given…☆15Oct 16, 2023Updated 2 years ago
- ☆22Apr 17, 2026Updated 2 weeks ago
- Train transformer language models with reinforcement learning.☆18,282Updated this week
- 🏋️ A unified multi-backend utility for benchmarking Transformers, Timm, PEFT, Diffusers and Sentence-Transformers with full support of O…☆336Apr 3, 2026Updated last month
- Experiments with generating opensource language model assistants☆97May 14, 2023Updated 2 years ago
- All-in-one text de-duplication☆756Mar 9, 2026Updated last month
- Go ahead and axolotl questions☆11,842Updated this week