foundation-model-stack / fm-training-estimatorLinks
Estimate resources needed to train LLMs
β13Updated 2 weeks ago
Alternatives and similar repositories for fm-training-estimator
Users that are interested in fm-training-estimator are comparing it to the libraries listed below
Sorting:
- π Collection of tuning recipes with HuggingFace SFTTrainer and PyTorch FSDP.β54Updated this week
- InstructLab Training Library - Efficient Fine-Tuning with Message-Format Dataβ44Updated this week
- An extendible framework for executing benchmarks and computational experiments at scaleβ34Updated this week
- β13Updated 2 months ago
- Create and deploy virtual-experiments - co-processing computational workflowsβ10Updated 5 months ago
- Python library for Synthetic Data Generationβ51Updated 3 weeks ago
- llm-d benchmark scripts and toolingβ39Updated this week
- β51Updated 4 months ago
- A tool to detect infrastructure issues on cloud native AI systemsβ52Updated 3 months ago
- IBM development fork of https://github.com/huggingface/text-generation-inferenceβ62Updated 3 months ago
- π Collection of libraries used with fms-hf-tuning to accelerate fine-tuning and training of large models.β13Updated this week
- Community maintained hardware plugin for vLLM on Spyreβ38Updated this week
- Cloud Native Benchmarking of Foundation Modelsβ44Updated 4 months ago
- Prometheus collector and exporter for Slurm cluster metrics. A Slinky project.β14Updated last month
- Auto-tuning for vllm. Getting the best performance out of your LLM deployment (vllm+guidellm+optuna)β25Updated last week
- β23Updated 3 years ago
- Module, Model, and Tensor Serialization/Deserializationβ279Updated 4 months ago
- Helm charts for llm-dβ50Updated 5 months ago
- β273Updated this week
- β42Updated last week
- Simplifying the definition and execution, scaling and deployment of pipelines on the cloud.β235Updated 2 years ago
- Health checks for Azure N- and H-series VMs.β55Updated last week
- Run Slurm as a Kubernetes scheduler. A Slinky project.β53Updated this week
- Python library for Evaluationβ16Updated last week
- A top-like tool for monitoring GPUs in a clusterβ85Updated last year
- A starter kit for evaluating benchmarks on the π€ Hubβ15Updated last year
- Benchmark suite for LLMs from Fireworks.aiβ84Updated last month
- IBM Spectrum LSF - IBM Cloudβ15Updated last year
- MAD (Model Automation and Dashboarding)β30Updated this week
- MLPerfβ’ logging libraryβ37Updated this week