π Collection of tuning recipes with HuggingFace SFTTrainer and PyTorch FSDP.
β57Apr 22, 2026Updated last month
Alternatives and similar repositories for fms-hf-tuning
Users that are interested in fms-hf-tuning are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Estimate resources needed to train LLMsβ14Feb 10, 2026Updated 3 months ago
- π Collection of libraries used with fms-hf-tuning to accelerate fine-tuning and training of large models.β14Jan 30, 2026Updated 3 months ago
- β25Sep 9, 2024Updated last year
- π Collection of components for development, training, tuning, and inference of foundation models leveraging PyTorch native components.β222May 20, 2026Updated last week
- π Efficiently (pre)training foundation models with native PyTorch features, including FSDP for training and SDPA implementation of Flashβ¦β286Nov 24, 2025Updated 6 months ago
- Simple, predictable pricing with DigitalOcean hosting β’ AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- A tool to post-process json trace files for IBM-AIU performance analysis. It enhances the traces with additional statistics extracted froβ¦β12May 13, 2026Updated 2 weeks ago
- Maximal Update Parametrization (ΞΌP) with Flax & Optax.β16Dec 27, 2023Updated 2 years ago
- A Command Line Tool to create shareable development workspaces instantly on different Linux distributions irrespective of host operating β¦β11Dec 22, 2019Updated 6 years ago
- IBM development fork of https://github.com/huggingface/text-generation-inferenceβ64Sep 18, 2025Updated 8 months ago
- Open source project for data preparation for GenAI applicationsβ932May 15, 2026Updated 2 weeks ago
- InstaSlice Operator facilitates slicing of accelerators using stable APIsβ51May 21, 2026Updated last week
- InstaSlice facilitates the use of Dynamic Resource Allocation (DRA) on Kubernetes clusters for GPU sharingβ30Nov 27, 2024Updated last year
- Example for EJB remoting in Wildflyβ21Feb 11, 2021Updated 5 years ago
- An end to end ML project. Using MLflow for experiment tracking and model registry. Prefect for workflow orchestration. S3 for artifacts sβ¦β12Sep 11, 2022Updated 3 years ago
- End-to-end encrypted email - Proton Mail β’ AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- Desktop as a Containerβ12Aug 2, 2022Updated 3 years ago
- [EMNLP 2025 Findings] Familiarity-aware Evidence Compression for Retrieval Augmented Generationβ15Aug 20, 2025Updated 9 months ago
- Streaming data workshop with Infinispan, Vert.x and OpenShiftβ12Apr 29, 2018Updated 8 years ago
- This repo contains the follow-along student instructions for the lab. https://rhoai-mlops.github.io/lab-instructions/β15Apr 9, 2026Updated last month
- π¦ An Android based contact tracing app which enables people to self-isolate if they have been in close proximity to someone tested positβ¦β13Jan 9, 2023Updated 3 years ago
- test images with not appropriate labels in MNIST datasetβ10Mar 3, 2018Updated 8 years ago
- CraftML is a restful web service for easy pipeline creation without code.β13Apr 18, 2021Updated 5 years ago
- Basic load generation for apm-server built on heyβ16Aug 8, 2024Updated last year
- Train, tune, and infer Bamba modelβ137May 15, 2026Updated 2 weeks ago
- Managed hosting for WordPress and PHP on Cloudways β’ AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- [Findings of ACL 2023] Communication Efficient Federated Learning for Multilingual Machine Translation with Adapterβ12Sep 4, 2023Updated 2 years ago
- Community maintained hardware plugin for vLLM on Spyreβ52May 21, 2026Updated last week
- β16Jan 21, 2025Updated last year
- Compile WASM binaries to Javascript code.β31Feb 6, 2024Updated 2 years ago
- Direct Preference Optimization for RWKV, aiming for RWKV-5 and 6.β11Mar 1, 2024Updated 2 years ago
- Expanded KR-BERT by adding more training dataβ13Apr 23, 2021Updated 5 years ago
- β18Nov 11, 2025Updated 6 months ago
- GoldFinch and other hybrid transformer componentsβ13Dec 9, 2025Updated 5 months ago
- A mini book for java unit testingβ13Dec 12, 2020Updated 5 years ago
- Deploy open-source AI quickly and easily - Special Bonus Offer β’ AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- A performance testing and analysis automation frameworkβ15May 21, 2026Updated last week
- Code for the paper "Pretrained Models for Multilingual Federated Learning" at NAACL 2022β11Aug 9, 2022Updated 3 years ago
- Traffic Light recognition using FasterRCNN in Pytorchβ11Jul 23, 2023Updated 2 years ago
- ζΈ εε€§ε¦ε¦ηε₯εΊ·εεΊθ‘ζ ε΅ζ₯εζ―ζ₯θͺε¨ζδΊ€β14Jan 30, 2021Updated 5 years ago
- Training PyTorch Faster-RCNN on custom datasetβ14Jun 2, 2021Updated 4 years ago
- A variational autoencoder for text processing using 1D convolutions and the FastText word embeddingsβ12Dec 11, 2022Updated 3 years ago
- A cpp client for the Etsy StatsD server.β14Nov 24, 2025Updated 6 months ago