Manage scalable open LLM inference endpoints in Slurm clusters
☆284Jul 11, 2024Updated last year
Alternatives and similar repositories for llm-swarm
Users that are interested in llm-swarm are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆567Nov 20, 2024Updated last year
- Lighteval is your all-in-one toolkit for evaluating LLMs across multiple backends☆2,374Apr 7, 2026Updated last week
- Freeing data processing from scripting madness by providing a set of platform-agnostic customizable pipeline processing blocks.☆2,983Updated this week
- Easily embed, cluster and semantically label text datasets☆601Mar 28, 2024Updated 2 years ago
- Minimalistic large language model 3D-parallelism training☆2,644Apr 7, 2026Updated last week
- Serverless GPU API endpoints on Runpod - Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Distilabel is a framework for synthetic data and AI feedback for engineers who need fast, reliable and scalable pipelines based on verifi…☆3,158Apr 6, 2026Updated last week
- A Python wrapper around HuggingFace's TGI (text-generation-inference) and TEI (text-embedding-inference) servers.☆32Sep 19, 2025Updated 6 months ago
- Tools for merging pretrained large language models.☆6,973Mar 15, 2026Updated last month
- Automated Identification of Redundant Layer Blocks for Pruning in Large Language Models☆266Apr 23, 2024Updated last year
- Robust recipes to align language models with human and AI preferences☆5,558Apr 8, 2026Updated last week
- Hugging Face Inference Toolkit used to serve transformers, sentence-transformers, and diffusers models.☆93Apr 7, 2026Updated last week
- A pytorch quantization backend for optimum☆1,036Apr 2, 2026Updated 2 weeks ago
- Automatically derive Python dunder methods for your Rust code☆25Apr 7, 2026Updated last week
- Scripts to convert datasets from various sources to Hugging Face Datasets.☆57Oct 26, 2022Updated 3 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- ☆109Jul 15, 2025Updated 9 months ago
- Scaling Data-Constrained Language Models☆343Jun 28, 2025Updated 9 months ago
- Minimal sharded dataset loaders, decoders, and utils for multi-modal document, image, and text datasets.☆161Apr 3, 2024Updated 2 years ago
- [ICLR 2025] Alignment Data Synthesis from Scratch by Prompting Aligned LLMs with Nothing. Your efficient and high-quality synthetic data …☆843Mar 17, 2025Updated last year
- Training and evaluation code for the paper "Headless Language Models: Learning without Predicting with Contrastive Weight Tying" (https:/…☆29Apr 17, 2024Updated 2 years ago
- DataComp for Language Models☆1,436Sep 9, 2025Updated 7 months ago
- Data and tools for generating and inspecting OLMo pre-training data.☆1,476Nov 5, 2025Updated 5 months ago
- S-LoRA: Serving Thousands of Concurrent LoRA Adapters☆1,907Jan 21, 2024Updated 2 years ago
- ☆1,126Jan 10, 2026Updated 3 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- A bagel, with everything.☆326Apr 11, 2024Updated 2 years ago
- Large Language Model Text Generation Inference☆10,830Mar 21, 2026Updated 3 weeks ago
- 🚀 Accelerate inference and training of 🤗 Transformers, Diffusers, TIMM and Sentence Transformers with easy to use hardware optimization…☆3,354Apr 2, 2026Updated 2 weeks ago
- AllenAI's post-training codebase☆3,683Updated this week
- Deita: Data-Efficient Instruction Tuning for Alignment [ICLR2024]☆592Dec 9, 2024Updated last year
- Easy and Efficient Quantization for Transformers☆206Mar 25, 2026Updated 3 weeks ago
- A subset of PyTorch's neural network modules, written in Python using OpenAI's Triton.☆600Aug 12, 2025Updated 8 months ago
- A repository for research on medium sized language models.☆537Jun 6, 2025Updated 10 months ago
- Scalable data pre processing and curation toolkit for LLMs☆1,520Apr 9, 2026Updated last week
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Code for the examples presented in the talk "Training a Llama in your backyard: fine-tuning very large models on consumer hardware" given…☆15Oct 16, 2023Updated 2 years ago
- Train transformer language models with reinforcement learning.☆18,054Updated this week
- ☆22Apr 7, 2026Updated last week
- 🏋️ A unified multi-backend utility for benchmarking Transformers, Timm, PEFT, Diffusers and Sentence-Transformers with full support of O…☆334Apr 3, 2026Updated last week
- All-in-one text de-duplication☆750Mar 9, 2026Updated last month
- Experiments with generating opensource language model assistants☆97May 14, 2023Updated 2 years ago
- Efficient few-shot learning with Sentence Transformers☆2,710Apr 2, 2026Updated 2 weeks ago