🚀 Collection of tuning recipes with HuggingFace SFTTrainer and PyTorch FSDP.
☆57Mar 30, 2026Updated 2 weeks ago
Alternatives and similar repositories for fms-hf-tuning
Users that are interested in fms-hf-tuning are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Estimate resources needed to train LLMs☆14Feb 10, 2026Updated 2 months ago
- 🚀 Collection of libraries used with fms-hf-tuning to accelerate fine-tuning and training of large models.☆14Jan 30, 2026Updated 2 months ago
- Synthetic Data Generation for Foundation Models☆21Nov 10, 2025Updated 5 months ago
- Tutorials and demos related to move2kube☆13Mar 6, 2025Updated last year
- A collection of apps ideal for migration demos☆16Oct 14, 2025Updated 6 months ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Client samples for IBM Cloud SQL Query service☆12Mar 4, 2024Updated 2 years ago
- ☆11May 4, 2022Updated 3 years ago
- Source code for Activated LoRA☆25Nov 22, 2025Updated 4 months ago
- IBM development fork of https://github.com/huggingface/text-generation-inference☆63Sep 18, 2025Updated 6 months ago
- ☆43Updated this week
- A GPT-powered AI auto scraper for websites. AI Web Scraping made easy.☆14Jun 26, 2023Updated 2 years ago
- this is based on the paper Chain-of-Retrieval Augmented Generation☆14Mar 29, 2025Updated last year
- In-depth code associated with my Medium blog post, "How to Load PyTorch Models 340 Times Faster with Ray"☆28Sep 2, 2022Updated 3 years ago
- InstaSlice facilitates the use of Dynamic Resource Allocation (DRA) on Kubernetes clusters for GPU sharing☆30Nov 27, 2024Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Simply `curl https://installer.to/name | bash` to install any tool☆16Dec 10, 2022Updated 3 years ago
- Desktop as a Container☆12Aug 2, 2022Updated 3 years ago
- [EMNLP 2025 Findings] Familiarity-aware Evidence Compression for Retrieval Augmented Generation☆15Aug 20, 2025Updated 7 months ago
- Observability Volume Management☆41Mar 19, 2025Updated last year
- Toonification of real face images using PyTorch, Stylegan2 and Image-to-Image translation☆13Jun 14, 2022Updated 3 years ago
- This repo contains the follow-along student instructions for the lab. https://rhoai-mlops.github.io/lab-instructions/☆15Mar 13, 2026Updated last month
- ☆12Dec 14, 2024Updated last year
- ☆19Mar 23, 2025Updated last year
- test images with not appropriate labels in MNIST dataset☆10Mar 3, 2018Updated 8 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- CraftML is a restful web service for easy pipeline creation without code.☆13Apr 18, 2021Updated 4 years ago
- GoldFinch and other hybrid transformer components☆12Dec 9, 2025Updated 4 months ago
- Basic load generation for apm-server built on hey☆16Aug 8, 2024Updated last year
- [Findings of ACL 2023] Communication Efficient Federated Learning for Multilingual Machine Translation with Adapter☆12Sep 4, 2023Updated 2 years ago
- CoCoFL: Communication- and Computation-Aware Federated Learning via Partial NN Freezing and Quantization☆13Aug 3, 2024Updated last year
- Experiment with NVIDIA Triton and Whisper☆15Apr 29, 2024Updated last year
- ☆15Jan 21, 2025Updated last year
- [ICLR 2025] RaSA: Rank-Sharing Low-Rank Adaptation☆10May 19, 2025Updated 10 months ago
- This is the implementation of CounterCurate, the data curation pipeline of both physical and semantic counterfactual image-caption pairs.☆19Jun 27, 2024Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- ☆17Dec 11, 2024Updated last year
- Eden Flux LoRA trainer and full-finetuning☆23Mar 21, 2025Updated last year
- Expanded KR-BERT by adding more training data☆13Apr 23, 2021Updated 4 years ago
- ☆18Nov 11, 2025Updated 5 months ago
- ☆26Dec 8, 2025Updated 4 months ago
- Mathematical Analysis (et analyse fonctionnelle)☆14Feb 1, 2022Updated 4 years ago
- For converting LLM datasets from one format into another.☆22Nov 12, 2025Updated 5 months ago