huggingface / llm-swarmLinks
Manage scalable open LLM inference endpoints in Slurm clusters
☆257Updated 10 months ago
Alternatives and similar repositories for llm-swarm
Users that are interested in llm-swarm are comparing it to the libraries listed below
Sorting:
- Code for the paper "Rethinking Benchmark and Contamination for Language Models with Rephrased Samples"☆302Updated last year
- ☆517Updated 6 months ago
- Multipack distributed sampler for fast padding-free training of LLMs☆188Updated 9 months ago
- The official evaluation suite and dynamic data release for MixEval.☆241Updated 6 months ago
- EvolKit is an innovative framework designed to automatically enhance the complexity of instructions used for fine-tuning Large Language M…☆220Updated 7 months ago
- ☆121Updated last month
- experiments with inference on llama☆104Updated 11 months ago
- Positional Skip-wise Training for Efficient Context Window Extension of LLMs to Extremely Length (ICLR 2024)☆202Updated last year
- Official repository for "Scaling Retrieval-Based Langauge Models with a Trillion-Token Datastore".☆201Updated 3 weeks ago
- Website for hosting the Open Foundation Models Cheat Sheet.☆267Updated 3 weeks ago
- Experiments on speculative sampling with Llama models☆126Updated last year
- DSIR large-scale data selection framework for language model training☆249Updated last year
- NeurIPS Large Language Model Efficiency Challenge: 1 LLM + 1GPU + 1Day☆254Updated last year
- A bagel, with everything.☆320Updated last year
- Benchmarking LLMs with Challenging Tasks from Real Users☆223Updated 6 months ago
- Scaling Data-Constrained Language Models☆334Updated 8 months ago
- PyTorch building blocks for the OLMo ecosystem☆222Updated this week
- Implementation of paper Data Engineering for Scaling Language Models to 128K Context☆461Updated last year
- code for training & evaluating Contextual Document Embedding models☆191Updated 2 weeks ago
- Let's build better datasets, together!☆258Updated 5 months ago
- awesome synthetic (text) datasets☆281Updated 7 months ago
- Pre-training code for Amber 7B LLM☆166Updated last year
- Evaluating LLMs with fewer examples☆156Updated last year
- Q-GaLore: Quantized GaLore with INT4 Projection and Layer-Adaptive Low-Rank Gradients.☆199Updated 10 months ago
- This project studies the performance and robustness of language models and task-adaptation methods.☆150Updated last year
- LOFT: A 1 Million+ Token Long-Context Benchmark☆198Updated last month
- BABILong is a benchmark for LLM evaluation using the needle-in-a-haystack approach.☆202Updated 3 weeks ago
- Code repository for the c-BTM paper☆106Updated last year
- Accelerating your LLM training to full speed! Made with ❤️ by ServiceNow Research☆200Updated this week
- Lightweight demos for finetuning LLMs. Powered by 🤗 transformers and open-source datasets.☆77Updated 7 months ago