foundation-model-stack / fm-training-estimatorLinks
Estimate resources needed to train LLMs
β13Updated 5 months ago
Alternatives and similar repositories for fm-training-estimator
Users that are interested in fm-training-estimator are comparing it to the libraries listed below
Sorting:
- π Collection of tuning recipes with HuggingFace SFTTrainer and PyTorch FSDP.β47Updated this week
- llm-d benchmark scripts and toolingβ21Updated this week
- Create and deploy virtual-experiments - co-processing computational workflowsβ10Updated 3 weeks ago
- InstructLab Training Library - Efficient Fine-Tuning with Message-Format Dataβ42Updated this week
- β47Updated last week
- Python library for Synthetic Data Generationβ42Updated this week
- β12Updated this week
- Cloud Native Benchmarking of Foundation Modelsβ39Updated last week
- IBM development fork of https://github.com/huggingface/text-generation-inferenceβ61Updated 3 months ago
- Bridge operator repoβ21Updated 3 months ago
- β232Updated this week
- A tool to detect infrastructure issues on cloud native AI systemsβ44Updated 2 weeks ago
- π Collection of libraries used with fms-hf-tuning to accelerate fine-tuning and training of large models.β11Updated last month
- β23Updated 3 years ago
- Python library for Evaluationβ15Updated this week
- Simplifying the definition and execution, scaling and deployment of pipelines on the cloud.β233Updated last year
- β38Updated this week
- Machine Learning Agility (MLAgility) benchmark and benchmarking toolsβ39Updated last week
- How to build an ACP compliant agent that uses MCP as well!β11Updated 3 months ago
- A top-like tool for monitoring GPUs in a clusterβ85Updated last year
- LM engine is a library for pretraining/finetuning LLMsβ61Updated this week
- Benchmark suite for LLMs from Fireworks.aiβ76Updated last week
- Example ML projects that use the Determined library.β32Updated 10 months ago
- Community maintained hardware plugin for vLLM on Spyreβ30Updated last week
- Large Language Model Text Generation Inference on Habana Gaudiβ34Updated 4 months ago
- Helm charts for llm-dβ51Updated 2 weeks ago
- This repository contains the results and code for the MLPerfβ’ Training v4.0 benchmark.β12Updated last year
- GitHub bot to assist with the taxonomy contribution workflowβ17Updated 9 months ago
- NVIDIA NCCL Tests for Distributed Trainingβ102Updated 2 weeks ago
- This is the open source version of HPL-MXP. The code performance has been verified on Frontierβ17Updated last month