Simple and efficient DeepSeek V3 SFT using pipeline parallel and expert parallel, with both FP8 and BF16 trainings
☆118Jul 27, 2025Updated 11 months ago
Alternatives and similar repositories for pipelining-sft
Users that are interested in pipelining-sft are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A fast, local, and secure approach for training LLMs for coding tasks using GRPO with WebAssembly and interpreter feedback.☆42Apr 4, 2025Updated last year
- A scalable asynchronous reinforcement learning implementation with in-flight weight updates.☆425Updated this week
- ☆93Jul 5, 2024Updated last year
- Python package for compressing floating-point PyTorch tensors☆13Jul 22, 2024Updated last year
- Lego for GRPO☆30May 27, 2025Updated last year
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- Train your own SOTA deductive reasoning model☆112Mar 6, 2025Updated last year
- ⚔️ [ICLR 2026] Official code of "Search Arena: Analyzing Search-Augmented LLMs".☆57Feb 23, 2026Updated 4 months ago
- Featurized Density Ratio Estimation☆20Jul 11, 2021Updated 4 years ago
- Pipeline parallelism for the minimalist☆39Aug 6, 2025Updated 10 months ago
- ☆17Apr 7, 2025Updated last year
- Open Source Speech/Text Data on AI☆19Sep 13, 2022Updated 3 years ago
- magicspeech competition recipe☆18Jun 29, 2020Updated 6 years ago
- Multi-Agent Verification: Scaling Test-Time Compute with Multiple Verifiers☆32Mar 1, 2025Updated last year
- ☆23Oct 17, 2024Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- ☆15May 26, 2026Updated last month
- Scalable toolkit for efficient model reinforcement☆1,762Updated this week
- ☆152May 13, 2026Updated last month
- Website with current metrics on the fastest AI models.☆43Nov 13, 2024Updated last year
- ☆35Nov 11, 2025Updated 7 months ago
- TreeRL: LLM Reinforcement Learning with On-Policy Tree Search in ACL'25☆97Jun 16, 2025Updated last year
- ☆37Nov 26, 2025Updated 7 months ago
- Simple and efficient pytorch-native transformer training and inference (batched)☆78Apr 2, 2024Updated 2 years ago
- Train speculative decoding models effortlessly and port them smoothly to SGLang serving.☆937Updated this week
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Combining SOAP and MUON☆22Feb 11, 2025Updated last year
- A lightweight reinforcement learning framework that integrates seamlessly into your codebase, empowering developers to focus on algorithm…☆106Aug 25, 2025Updated 10 months ago
- Super-Efficient RLHF Training of LLMs with Parameter Reallocation☆336Apr 24, 2025Updated last year
- j1-micro (1.7B) & j1-nano (600M) are absurdly tiny but mighty reward models.☆105Jul 19, 2025Updated 11 months ago
- A subset of PyTorch's neural network modules, written in Python using OpenAI's Triton.☆604May 13, 2026Updated last month
- 🦀 A Rust implementation of a RoBERTa classification model for the SNLI dataset☆13Sep 13, 2021Updated 4 years ago
- ☆18Jun 23, 2026Updated last week
- ☆24Jun 18, 2024Updated 2 years ago
- Accelerating MoE with IO and Tile-aware Optimizations☆720Updated this week
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Gantry provides an API that streamlines running experiments in Beaker☆33Apr 8, 2026Updated 2 months ago
- Automated Theorem Prover inspired by Aletheia. Claude Code for mathematicians.☆76Apr 20, 2026Updated 2 months ago
- Spectral Sphere Optimizer☆119Mar 23, 2026Updated 3 months ago
- Official PyTorch implementation for "TensorLens: End-to-End Transformer Analysis via High-Order Attention Tensors" [ACL 2026]☆47Apr 14, 2026Updated 2 months ago
- Utility to use eleven lab's streaming to in the command line☆11Aug 8, 2023Updated 2 years ago
- utilities for batched llm calls with retries☆51Jun 12, 2026Updated 2 weeks ago
- An all-in one place for information and creation of BD class droids from Behold-Urwar Droid Concepts☆13Aug 5, 2025Updated 10 months ago