Simple and efficient DeepSeek V3 SFT using pipeline parallel and expert parallel, with both FP8 and BF16 trainings
☆117Jul 27, 2025Updated 9 months ago
Alternatives and similar repositories for pipelining-sft
Users that are interested in pipelining-sft are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A fast, local, and secure approach for training LLMs for coding tasks using GRPO with WebAssembly and interpreter feedback.☆42Apr 4, 2025Updated last year
- A scalable asynchronous reinforcement learning implementation with in-flight weight updates.☆403Updated this week
- ☆93Jul 5, 2024Updated last year
- Python package for compressing floating-point PyTorch tensors☆13Jul 22, 2024Updated last year
- Lego for GRPO☆30May 27, 2025Updated 11 months ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Train your own SOTA deductive reasoning model☆110Mar 6, 2025Updated last year
- ☆67Apr 18, 2026Updated last week
- ⚔️ [ICLR 2026] Official code of "Search Arena: Analyzing Search-Augmented LLMs".☆56Feb 23, 2026Updated 2 months ago
- Featurized Density Ratio Estimation☆19Jul 11, 2021Updated 4 years ago
- Pipeline parallelism for the minimalist☆39Aug 6, 2025Updated 8 months ago
- ☆17Apr 7, 2025Updated last year
- Open Source Speech/Text Data on AI☆19Sep 13, 2022Updated 3 years ago
- ☆128Apr 11, 2026Updated 2 weeks ago
- magicspeech competition recipe☆18Jun 29, 2020Updated 5 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Multi-Agent Verification: Scaling Test-Time Compute with Multiple Verifiers☆29Mar 1, 2025Updated last year
- ☆23Oct 17, 2024Updated last year
- ☆14Apr 8, 2026Updated 3 weeks ago
- Source code for EMNLP findings paper "Open-Vocabulary Argument Role Prediction for Event Extraction"☆19Nov 5, 2022Updated 3 years ago
- [NeurIPS 2025 Spotlight] Scaling Computer-Use Grounding via UI Decomposition and Synthesis☆163Nov 6, 2025Updated 5 months ago
- Scalable toolkit for efficient model reinforcement☆1,568Updated this week
- A single-layer, streaming codec model providing SOTA audio quality and discrete tokens designed for superior downstream modelability.☆114Jun 4, 2025Updated 10 months ago
- Website with current metrics on the fastest AI models.☆43Nov 13, 2024Updated last year
- ☆35Nov 11, 2025Updated 5 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- TreeRL: LLM Reinforcement Learning with On-Policy Tree Search in ACL'25☆90Jun 16, 2025Updated 10 months ago
- Software relating to relational empirical risk minimization☆16Jun 12, 2021Updated 4 years ago
- ☆35Nov 26, 2025Updated 5 months ago
- Train speculative decoding models effortlessly and port them smoothly to SGLang serving.☆801Apr 2, 2026Updated 3 weeks ago
- Simple and efficient pytorch-native transformer training and inference (batched)☆78Apr 2, 2024Updated 2 years ago
- Combining SOAP and MUON☆20Feb 11, 2025Updated last year
- A lightweight reinforcement learning framework that integrates seamlessly into your codebase, empowering developers to focus on algorithm…☆103Aug 25, 2025Updated 8 months ago
- j1-micro (1.7B) & j1-nano (600M) are absurdly tiny but mighty reward models.☆104Jul 19, 2025Updated 9 months ago
- Codebase for running (conditional) probing experiments☆21Nov 13, 2022Updated 3 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- A subset of PyTorch's neural network modules, written in Python using OpenAI's Triton.☆600Aug 12, 2025Updated 8 months ago
- Accelerating MoE with IO and Tile-aware Optimizations☆661Apr 22, 2026Updated last week
- ☆18Updated this week
- ☆24Jun 18, 2024Updated last year
- Official PyTorch implementation for "TensorLens: End-to-End Transformer Analysis via High-Order Attention Tensors" [ACL 2026]☆46Apr 14, 2026Updated 2 weeks ago
- Gantry provides an API that streamlines running experiments in Beaker☆33Apr 8, 2026Updated 3 weeks ago
- ☆94Oct 30, 2025Updated 6 months ago