Tune efficiently any LLM model from HuggingFace using distributed training (multiple GPU) and DeepSpeed. Uses Ray AIR to orchestrate the training on multiple AWS GPU instances
☆60Jun 20, 2023Updated 2 years ago
Alternatives and similar repositories for LLM-distributed-finetune
Users that are interested in LLM-distributed-finetune are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Ray - A curated list of resources: https://github.com/ray-project/ray☆82Oct 21, 2025Updated 7 months ago
- ☆25Jan 2, 2023Updated 3 years ago
- Tracking Ray Enhancement Proposals☆69May 19, 2026Updated last week
- RayLLM - LLMs on Ray (Archived). Read README for more info.☆1,267Mar 13, 2025Updated last year
- Distributed XGBoost on Ray☆154Jun 25, 2024Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Some microbenchmarks and design docs before commencement☆11Feb 1, 2021Updated 5 years ago
- ☆11Aug 2, 2022Updated 3 years ago
- Run CWTools on your Clausewitz mod PDXScript code in parallel to your builds thanks to GitHub Actions.☆16Mar 10, 2023Updated 3 years ago
- ☆11Mar 16, 2026Updated 2 months ago
- ☆11Apr 5, 2021Updated 5 years ago
- PostgreSQL BM25S extension☆137May 14, 2026Updated 2 weeks ago
- ☆16Apr 3, 2024Updated 2 years ago
- Dependency Parsing as Sequence Labeling with BERT☆13Nov 1, 2020Updated 5 years ago
- SQLGPT is an advanced SQL query generator powered by natural language processing. Seamlessly transforming plain English queries into comp…☆10Oct 24, 2023Updated 2 years ago
- End-to-end encrypted cloud storage - Proton Drive • AdSpecial offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
- Distributed Reinforcement Learning for LLM Fine-Tuning with multi-GPU utilization☆22Mar 12, 2025Updated last year
- ☆19Mar 29, 2022Updated 4 years ago
- ☆20Oct 15, 2023Updated 2 years ago
- a fast implementation of BM25☆10Sep 15, 2022Updated 3 years ago
- ☆12Apr 30, 2024Updated 2 years ago
- Code for the paper: CodeTree: Agent-guided Tree Search for Code Generation with Large Language Models☆34Apr 1, 2025Updated last year
- Example of applying CUDA graphs to LLaMA-v2☆11Aug 25, 2023Updated 2 years ago
- Pygloo provides Python bindings for Gloo.☆22Jul 7, 2025Updated 10 months ago
- Python Bash emulation for agents, a port of vercel-labs/just-bash☆50Feb 19, 2026Updated 3 months ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- NeurIPS 2023 - Cappy: Outperforming and Boosting Large Multi-Task LMs with a Small Scorer☆48Mar 29, 2024Updated 2 years ago
- Fast GPU based tensor core reductions☆13Jan 13, 2023Updated 3 years ago
- Standalone Flash Attention v2 kernel without libtorch dependency☆113Sep 10, 2024Updated last year
- Practice for Machine Learning in Production course☆14Jun 7, 2025Updated 11 months ago
- ☆15Nov 24, 2018Updated 7 years ago
- SeqGAN implementation with Tensorflow☆18Jan 14, 2018Updated 8 years ago
- MPC Server for PySpark inpired by the LakeSail☆18Feb 26, 2026Updated 3 months ago
- socket program to send data with encryption☆13Jun 1, 2021Updated 4 years ago
- WIP. Veloce is a low-code Ray-based parallelization library that makes machine learning computation novel, efficient, and heterogeneous.☆17Aug 4, 2022Updated 3 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- LLCL-MIPS is a superscalar MIPS processor, which supports MIPS Release 1 instructions and is capable of booting linux kernel. (第五届龙芯杯特等奖作…☆40Jan 26, 2022Updated 4 years ago
- Code for our EMNLP-2023 paper: "Active Instruction Tuning: Improving Cross-Task Generalization by Training on Prompt Sensitive Tasks"☆26Nov 16, 2023Updated 2 years ago
- Federated Learning - PyTorch☆15Jun 27, 2021Updated 4 years ago
- ⚡ Guidance, samples, and tools for HPC workloads on AKS clusters with RDMA and InfiniBand support, including GPUDirect RDMA.☆23May 19, 2026Updated last week
- Integration between Lance and Ray for distributed data processing☆27May 21, 2026Updated last week
- For advanced physics-driven combined with neural network enhancement force field.☆18Mar 9, 2026Updated 2 months ago
- GEMM by WMMA (tensor core)☆15Jul 31, 2022Updated 3 years ago