foundation-model-stack / fm-training-estimatorLinks
Estimate resources needed to train LLMs
β13Updated 4 months ago
Alternatives and similar repositories for fm-training-estimator
Users that are interested in fm-training-estimator are comparing it to the libraries listed below
Sorting:
- π Collection of tuning recipes with HuggingFace SFTTrainer and PyTorch FSDP.β47Updated this week
- π Collection of libraries used with fms-hf-tuning to accelerate fine-tuning and training of large models.β11Updated last week
- Helm charts for llm-dβ43Updated this week
- Python library for Synthetic Data Generationβ42Updated this week
- llm-d benchmark scripts and toolingβ17Updated last week
- β12Updated last month
- InstructLab Training Library - Efficient Fine-Tuning with Message-Format Dataβ43Updated this week
- β43Updated 3 months ago
- InstaSlice Operator facilitates slicing of accelerators using stable APIsβ38Updated this week
- Python library for Evaluationβ15Updated this week
- β37Updated this week
- GitHub bot to assist with the taxonomy contribution workflowβ16Updated 7 months ago
- Cloud Native Benchmarking of Foundation Modelsβ38Updated 2 weeks ago
- Test Orchestrator for Performance and Scalability of AI pLatformsβ15Updated this week
- Artifacts for the Distributed Workloads stack as part of ODHβ31Updated this week
- An intuitive, easy-to-use python interface for batch resource requesting, access, job submission, and observation. Simplifying the develoβ¦β29Updated last week
- TrustyAI Explainability Toolkitβ42Updated last month
- Core repository for an AI-powered OCP assistant serviceβ51Updated this week
- Repository to demo GPU Sharing with Time Slicing, MPS, MIG and othersβ42Updated 8 months ago
- β19Updated this week
- IBM development fork of https://github.com/huggingface/text-generation-inferenceβ60Updated last month
- Model Server for Keplerβ27Updated this week
- Python bindings for TrustyAI's explainability libraryβ16Updated 2 months ago
- InstaSlice facilitates the use of Dynamic Resource Allocation (DRA) on Kubernetes clusters for GPU sharingβ28Updated 6 months ago
- Simplified model deployment on llm-dβ24Updated 3 weeks ago
- Caikit is an AI toolkit that enables users to manage models through a set of developer friendly APIs.β106Updated this week
- Resources, demos, recipes,... to work with LLMs on OpenShift with OpenShift AI or Open Data Hub.β127Updated this week
- This project makes running the InstructLab large language model (LLM) fine-tuning process easy and flexible on OpenShiftβ26Updated this week
- Tutorials and demos related to move2kubeβ12Updated 3 months ago
- Community maintained hardware plugin for vLLM on Spyreβ26Updated this week