huggingface / llm-swarmLinks

Manage scalable open LLM inference endpoints in Slurm clusters

☆277

Alternatives and similar repositories for llm-swarm

Users that are interested in llm-swarm are comparing it to the libraries listed below

Sorting:

arcee-ai / EvolKit
EvolKit is an innovative framework designed to automatically enhance the complexity of instructions used for fine-tuning Large Language M…
☆243Updated last year
imoneoi / multipack
Multipack distributed sampler for fast padding-free training of LLMs
☆202Updated last year
huggingface / data-is-better-together
Let's build better datasets, together!
☆265Updated 11 months ago
lm-sys / llm-decontaminator
Code for the paper "Rethinking Benchmark and Contamination for Language Models with Rephrased Samples"
☆315Updated last year
huggingface / cosmopedia
☆556Updated last year
booydar / babilong
BABILong is a benchmark for LLM evaluation using the needle-in-a-haystack approach.
☆219Updated 3 months ago
allenai / fm-cheatsheet
Website for hosting the Open Foundation Models Cheat Sheet.
☆269Updated 6 months ago
QuixiAI / spectrum
☆138Updated 3 months ago
jxmorris12 / cde
code for training & evaluating Contextual Document Embedding models
☆201Updated 6 months ago
JinjieNi / MixEval
The official evaluation suite and dynamic data release for MixEval.
☆253Updated last year
davanstrien / awesome-synthetic-datasets
awesome synthetic (text) datasets
☆310Updated 2 weeks ago
jondurbin / bagel
A bagel, with everything.
☆325Updated last year
dwzhu-pku / PoSE
Positional Skip-wise Training for Efficient Context Window Extension of LLMs to Extremely Length (ICLR 2024)
☆205Updated last year
dust-tt / llama-ssp
Experiments on speculative sampling with Llama models
☆127Updated 2 years ago
hamelsmu / llama-inference
experiments with inference on llama
☆103Updated last year
LLM360 / amber-train
Pre-training code for Amber 7B LLM
☆169Updated last year
sabetAI / BLoRA
batched loras
☆347Updated 2 years ago
sail-sg / sailcraft
🚢 Data Toolkit for Sailor Language Models
☆94Updated 9 months ago
sileod / tasksource
Datasets collection and preprocessings framework for NLP extreme multitask learning
☆189Updated 4 months ago
ServiceNow / Fast-LLM
Accelerating your LLM training to full speed! Made with ❤️ by ServiceNow Research
☆262Updated this week
RulinShao / retrieval-scaling
Official repository for "Scaling Retrieval-Based Langauge Models with a Trillion-Token Datastore".
☆220Updated last month
IBM / ModuleFormer
ModuleFormer is a MoE-based architecture that includes two different types of experts: stick-breaking attention heads and feedforward exp…
☆225Updated 2 months ago
huggingface / datablations
Scaling Data-Constrained Language Models
☆342Updated 5 months ago
Locutusque / TPU-Alignment
Fully fine-tune large models like Mistral, Llama-2-13B, or Qwen-14B completely for free
☆232Updated last year
huggingface / fineweb-2
☆209Updated last month
kernelmachine / cbtm
Code repository for the c-BTM paper
☆108Updated 2 years ago
allenai / WildBench
Benchmarking LLMs with Challenging Tasks from Real Users
☆246Updated last year
daniel-furman / sft-demos
Lightweight demos for finetuning LLMs. Powered by 🤗 transformers and open-source datasets.
☆78Updated last year
VITA-Group / Q-GaLore
Q-GaLore: Quantized GaLore with INT4 Projection and Layer-Adaptive Low-Rank Gradients.
☆202Updated last year
bminixhofer / zett
Code for Zero-Shot Tokenizer Transfer
☆142Updated 10 months ago