Tune efficiently any LLM model from HuggingFace using distributed training (multiple GPU) and DeepSpeed. Uses Ray AIR to orchestrate the training on multiple AWS GPU instances
☆60Jun 20, 2023Updated 2 years ago
Alternatives and similar repositories for LLM-distributed-finetune
Users that are interested in LLM-distributed-finetune are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Ray - A curated list of resources: https://github.com/ray-project/ray☆80Oct 21, 2025Updated 5 months ago
- ☆25Jan 2, 2023Updated 3 years ago
- RayLLM - LLMs on Ray (Archived). Read README for more info.☆1,267Mar 13, 2025Updated last year
- Training Recipes for Agentic Reinforcement Learning in LLMs: A Survey☆24Jan 30, 2026Updated 2 months ago
- Distributed XGBoost on Ray☆154Jun 25, 2024Updated last year
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Some microbenchmarks and design docs before commencement☆11Feb 1, 2021Updated 5 years ago
- ☆11Aug 2, 2022Updated 3 years ago
- LightGBM on Ray☆51Feb 4, 2024Updated 2 years ago
- Do we need rebalancing strategies? A theoretical and empirical study around SMOTE and its variants (Sakho, Malherbe and Scornet; 2024)☆11Sep 2, 2025Updated 7 months ago
- ☆11Apr 5, 2021Updated 5 years ago
- Saliency calculation module for Chainer☆12May 28, 2019Updated 6 years ago
- ☆44Sep 6, 2021Updated 4 years ago
- ☆16Apr 3, 2024Updated 2 years ago
- SQLGPT is an advanced SQL query generator powered by natural language processing. Seamlessly transforming plain English queries into comp…☆10Oct 24, 2023Updated 2 years ago
- Serverless GPU API endpoints on Runpod - Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Distributed Reinforcement Learning for LLM Fine-Tuning with multi-GPU utilization☆22Mar 12, 2025Updated last year
- libsmctrl论文的复现,添加了python端接口,可以在python端灵活调用接口来分配计算资源☆12May 21, 2024Updated last year
- Multi-agent simulator in Jax for research and teaching in AI & ALife☆31Updated this week
- Linear-chain LSTM-CRFs and Convolutional CRFs in PyTorch.☆22Aug 11, 2017Updated 8 years ago
- Tutorial notebooks for SciFM24☆11Apr 2, 2024Updated 2 years ago
- [ACL'21 Findings] Why Machine Reading Comprehension Models Learn Shortcuts?☆16Aug 8, 2023Updated 2 years ago
- Batch Multi-Fidelity Bayesian Optimization with Deep Auto-Regressive Networks☆12Nov 3, 2021Updated 4 years ago
- ☆14Mar 2, 2023Updated 3 years ago
- a fast implementation of BM25☆10Sep 15, 2022Updated 3 years ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Code for the paper: CodeTree: Agent-guided Tree Search for Code Generation with Large Language Models☆33Apr 1, 2025Updated last year
- Example of applying CUDA graphs to LLaMA-v2☆11Aug 25, 2023Updated 2 years ago
- Pygloo provides Python bindings for Gloo.☆22Jul 7, 2025Updated 9 months ago
- Open-source Claude Code agent multiplexer — run dozens of parallel AI coding agents unattended via tmux☆125Updated this week
- Fast GPU based tensor core reductions☆13Jan 13, 2023Updated 3 years ago
- This repository contains the results and code for the MLPerf™ Inference v4.0 benchmark.☆11Jul 24, 2025Updated 8 months ago
- ☆11May 4, 2022Updated 3 years ago
- Transformers at any scale☆42Jan 18, 2024Updated 2 years ago
- ☆15Nov 24, 2018Updated 7 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Source Code for <Target-Side Data Augmentation for Sequence Generation>☆12Oct 6, 2021Updated 4 years ago
- SeqGAN implementation with Tensorflow☆18Jan 14, 2018Updated 8 years ago
- Flyte Backend Plugins contributed by the Flyte community.☆29Oct 9, 2023Updated 2 years ago
- Notebooks for the O'Reilly book "Learning Ray"☆351Apr 25, 2024Updated last year
- MPC Server for PySpark inpired by the LakeSail☆18Feb 26, 2026Updated last month
- setup pytorch on android☆12Mar 2, 2020Updated 6 years ago
- Command-line script for inferencing from models such as LLaMA, in a chat scenario, with LoRA adaptations☆33Jun 1, 2023Updated 2 years ago