Simple and efficient DeepSeek V3 SFT using pipeline parallel and expert parallel, with both FP8 and BF16 trainings
☆117Jul 27, 2025Updated 8 months ago
Alternatives and similar repositories for pipelining-sft
Users that are interested in pipelining-sft are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A scalable asynchronous reinforcement learning implementation with in-flight weight updates.☆382Mar 23, 2026Updated last week
- ☆92Jul 5, 2024Updated last year
- Python package for compressing floating-point PyTorch tensors☆13Jul 22, 2024Updated last year
- Train your own SOTA deductive reasoning model☆108Mar 6, 2025Updated last year
- ⚔️ [ICLR 2026] Official code of "Search Arena: Analyzing Search-Augmented LLMs".☆54Feb 23, 2026Updated last month
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- ☆17Apr 7, 2025Updated 11 months ago
- ☆117Jan 4, 2026Updated 2 months ago
- Open Source Speech/Text Data on AI☆19Sep 13, 2022Updated 3 years ago
- ☆14Feb 12, 2024Updated 2 years ago
- magicspeech competition recipe☆18Jun 29, 2020Updated 5 years ago
- Multi-Agent Verification: Scaling Test-Time Compute with Multiple Verifiers☆28Mar 1, 2025Updated last year
- ☆23Oct 17, 2024Updated last year
- Scalable toolkit for efficient model reinforcement☆1,447Mar 22, 2026Updated last week
- noise reduction☆17Jul 3, 2024Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- A single-layer, streaming codec model providing SOTA audio quality and discrete tokens designed for superior downstream modelability.☆114Jun 4, 2025Updated 9 months ago
- Train speculative decoding models effortlessly and port them smoothly to SGLang serving.☆736Mar 21, 2026Updated last week
- ☆34Nov 26, 2025Updated 4 months ago
- ☆34Nov 11, 2025Updated 4 months ago
- TreeRL: LLM Reinforcement Learning with On-Policy Tree Search in ACL'25☆90Jun 16, 2025Updated 9 months ago
- Software relating to relational empirical risk minimization☆16Jun 12, 2021Updated 4 years ago
- ☆43Jan 27, 2026Updated 2 months ago
- Simple and efficient pytorch-native transformer training and inference (batched)☆79Apr 2, 2024Updated last year
- Combining SOAP and MUON☆19Feb 11, 2025Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- A lightweight reinforcement learning framework that integrates seamlessly into your codebase, empowering developers to focus on algorithm…☆102Aug 25, 2025Updated 7 months ago
- j1-micro (1.7B) & j1-nano (600M) are absurdly tiny but mighty reward models.☆102Jul 19, 2025Updated 8 months ago
- Super-Efficient RLHF Training of LLMs with Parameter Reallocation☆334Apr 24, 2025Updated 11 months ago
- Codebase for running (conditional) probing experiments☆21Nov 13, 2022Updated 3 years ago
- Example of how to use R in Jupyter notebooks and make compatible with Binder☆17Feb 25, 2019Updated 7 years ago
- A subset of PyTorch's neural network modules, written in Python using OpenAI's Triton.☆598Aug 12, 2025Updated 7 months ago
- Learning Formal Mathematics from Intrinsic Motivation☆37Jul 10, 2025Updated 8 months ago
- Gantry provides an API that streamlines running experiments in Beaker☆33Mar 11, 2026Updated 2 weeks ago
- ☆18Mar 13, 2026Updated 2 weeks ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- ☆23Jun 18, 2024Updated last year
- utilities for batched llm calls with retries☆49Mar 17, 2026Updated last week
- ☆93Oct 30, 2025Updated 5 months ago
- Utility to use eleven lab's streaming to in the command line☆11Aug 8, 2023Updated 2 years ago
- Open-Ended Speaking Style Modeling via Fine-Grained and Multi-Granular Contrastive Language-Speech Pre-training☆70Feb 7, 2026Updated last month
- A vim plugin used to auto insert code comment header block☆16Apr 4, 2013Updated 12 years ago
- A framework for majority vote classifiers allowing for computation of PAC Bayesian risk bounds.☆13Feb 9, 2023Updated 3 years ago