Manage scalable open LLM inference endpoints in Slurm clusters
☆287Jul 11, 2024Updated last year
Alternatives and similar repositories for llm-swarm
Users that are interested in llm-swarm are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆569Nov 20, 2024Updated last year
- Lighteval is your all-in-one toolkit for evaluating LLMs across multiple backends☆2,426Updated this week
- Freeing data processing from scripting madness by providing a set of platform-agnostic customizable pipeline processing blocks.☆3,066May 6, 2026Updated 2 weeks ago
- Easily embed, cluster and semantically label text datasets☆605Mar 28, 2024Updated 2 years ago
- Minimalistic large language model 3D-parallelism training☆2,698Apr 7, 2026Updated last month
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Distilabel is a framework for synthetic data and AI feedback for engineers who need fast, reliable and scalable pipelines based on verifi…☆3,229May 18, 2026Updated last week
- A Python wrapper around HuggingFace's TGI (text-generation-inference) and TEI (text-embedding-inference) servers.☆32Sep 19, 2025Updated 8 months ago
- Tools for merging pretrained large language models.☆7,100May 6, 2026Updated 3 weeks ago
- Automated Identification of Redundant Layer Blocks for Pruning in Large Language Models☆267Apr 23, 2024Updated 2 years ago
- Robust recipes to align language models with human and AI preferences☆5,605Apr 8, 2026Updated last month
- Hugging Face Inference Toolkit used to serve transformers, sentence-transformers, and diffusers models.☆93Apr 15, 2026Updated last month
- A pytorch quantization backend for optimum☆1,040Apr 2, 2026Updated last month
- Automatically derive Python dunder methods for your Rust code☆26Apr 7, 2026Updated last month
- Scripts to convert datasets from various sources to Hugging Face Datasets.☆57Oct 26, 2022Updated 3 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- ☆110Jul 15, 2025Updated 10 months ago
- Minimal sharded dataset loaders, decoders, and utils for multi-modal document, image, and text datasets.☆161Apr 3, 2024Updated 2 years ago
- Scaling Data-Constrained Language Models☆342Jun 28, 2025Updated 10 months ago
- [ICLR 2025] Alignment Data Synthesis from Scratch by Prompting Aligned LLMs with Nothing. Your efficient and high-quality synthetic data …☆861Mar 17, 2025Updated last year
- Training and evaluation code for the paper "Headless Language Models: Learning without Predicting with Contrastive Weight Tying" (https:/…☆29Apr 17, 2024Updated 2 years ago
- DataComp for Language Models☆1,445Sep 9, 2025Updated 8 months ago
- S-LoRA: Serving Thousands of Concurrent LoRA Adapters☆1,913Jan 21, 2024Updated 2 years ago
- Data and tools for generating and inspecting OLMo pre-training data.☆1,499Nov 5, 2025Updated 6 months ago
- ☆1,144Jan 10, 2026Updated 4 months ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- A bagel, with everything.☆326Apr 11, 2024Updated 2 years ago
- Large Language Model Text Generation Inference☆10,856Mar 21, 2026Updated 2 months ago
- 🚀 Accelerate inference and training of 🤗 Transformers, Diffusers, TIMM and Sentence Transformers with easy to use hardware optimization…☆3,398May 18, 2026Updated last week
- AllenAI's post-training codebase☆3,729Updated this week
- Deita: Data-Efficient Instruction Tuning for Alignment [ICLR2024]☆596Dec 9, 2024Updated last year
- Easy and Efficient Quantization for Transformers☆205Mar 25, 2026Updated 2 months ago
- A subset of PyTorch's neural network modules, written in Python using OpenAI's Triton.☆599May 13, 2026Updated last week
- Scalable data pre processing and curation toolkit for LLMs☆1,576Updated this week
- A repository for research on medium sized language models.☆535Jun 6, 2025Updated 11 months ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Code for the examples presented in the talk "Training a Llama in your backyard: fine-tuning very large models on consumer hardware" given…☆15Oct 16, 2023Updated 2 years ago
- ☆23Apr 17, 2026Updated last month
- Train transformer language models with reinforcement learning.☆18,411May 19, 2026Updated last week
- 🏋️ A unified multi-backend utility for benchmarking Transformers, Timm, PEFT, Diffusers and Sentence-Transformers with full support of O…☆336Apr 3, 2026Updated last month
- Experiments with generating opensource language model assistants☆97May 14, 2023Updated 3 years ago
- All-in-one text de-duplication☆759Mar 9, 2026Updated 2 months ago
- Go ahead and axolotl questions☆11,938May 19, 2026Updated last week