character-ai / pipelining-sftView external linksLinks
Simple and efficient DeepSeek V3 SFT using pipeline parallel and expert parallel, with both FP8 and BF16 trainings
☆114Jul 27, 2025Updated 6 months ago
Alternatives and similar repositories for pipelining-sft
Users that are interested in pipelining-sft are comparing it to the libraries listed below
Sorting:
- coded with and corrected by Google Anti-Gravity☆13Nov 23, 2025Updated 2 months ago
- A scalable asynchronous reinforcement learning implementation with in-flight weight updates.☆362Updated this week
- ☆39Jan 27, 2026Updated 2 weeks ago
- Python package for compressing floating-point PyTorch tensors☆13Jul 22, 2024Updated last year
- A fast, local, and secure approach for training LLMs for coding tasks using GRPO with WebAssembly and interpreter feedback.☆41Apr 4, 2025Updated 10 months ago
- Code for "Consistent Estimators for Learning to Defer to an Expert" (ICML 2020)☆15Jan 28, 2023Updated 3 years ago
- A FREE comprehensive step-by-step 8-bit ATmega328P C and Assembler tutorial covering Embedded Software Development to Reverse Engineering…☆11Nov 26, 2025Updated 2 months ago
- CIKM 2022: Evaluating Interpolation and Extrapolation Performance of Neural Retrieval Models☆11Aug 4, 2022Updated 3 years ago
- Software relating to relational empirical risk minimization☆17Jun 12, 2021Updated 4 years ago
- ⚔️ [ICLR 2026] Official code of "Search Arena: Analyzing Search-Augmented LLMs".☆49Jan 26, 2026Updated 2 weeks ago
- j1-micro (1.7B) & j1-nano (600M) are absurdly tiny but mighty reward models.☆102Jul 19, 2025Updated 6 months ago
- A framework for majority vote classifiers allowing for computation of PAC Bayesian risk bounds.☆14Feb 9, 2023Updated 3 years ago
- creditmodel, 模型,评分卡,scorecard, vintage, automatic modeling☆11Aug 10, 2024Updated last year
- ☆14Jun 25, 2025Updated 7 months ago
- ☆40Dec 6, 2025Updated 2 months ago
- Official implementation for "Law of the Weakest Link: Cross capabilities of Large Language Models"☆43Oct 1, 2024Updated last year
- Fluid Language Model Benchmarking☆26Sep 16, 2025Updated 4 months ago
- Multi-Agent Verification: Scaling Test-Time Compute with Multiple Verifiers☆27Mar 1, 2025Updated 11 months ago
- This website contains the python code accompanying the book "Mathematical Foundations of Deep Learning Models and Algorithms" by Konstant…☆42Nov 24, 2025Updated 2 months ago
- Source code for EMNLP findings paper "Open-Vocabulary Argument Role Prediction for Event Extraction"☆19Nov 5, 2022Updated 3 years ago
- Example of how to use R in Jupyter notebooks and make compatible with Binder☆17Feb 25, 2019Updated 6 years ago
- ☆17Updated this week
- EmbeddedLLM: API server for Embedded Device Deployment. Currently support CUDA/OpenVINO/IpexLLM/DirectML/CPU☆50Oct 6, 2024Updated last year
- Manages vllm-nccl dependency☆17Jun 3, 2024Updated last year
- Simple and efficient pytorch-native transformer training and inference (batched)☆79Apr 2, 2024Updated last year
- This library supports evaluating disparities in generated image quality, diversity, and consistency between geographic regions.☆20Jun 3, 2024Updated last year
- Scalable toolkit for efficient model reinforcement☆1,307Updated this week
- Codebase for running (conditional) probing experiments☆22Nov 13, 2022Updated 3 years ago
- Rust crate for some audio utilities☆27Mar 8, 2025Updated 11 months ago
- Inference-time scaling for LLMs-as-a-judge.☆328Nov 5, 2025Updated 3 months ago
- Pipeline parallelism for the minimalist☆40Aug 6, 2025Updated 6 months ago
- RWKV models and examples powered by candle.☆24Jan 19, 2026Updated 3 weeks ago
- TreeRL: LLM Reinforcement Learning with On-Policy Tree Search in ACL'25☆88Jun 16, 2025Updated 7 months ago
- Featurized Density Ratio Estimation☆20Jul 11, 2021Updated 4 years ago
- ☆92Jul 5, 2024Updated last year
- One-stop shop for running and fine-tuning transformer-based language models for retrieval☆63Jan 2, 2026Updated last month
- Train your own SOTA deductive reasoning model☆107Mar 6, 2025Updated 11 months ago
- [NeurIPS 2025 Spotlight] Scaling Computer-Use Grounding via UI Decomposition and Synthesis☆148Nov 6, 2025Updated 3 months ago
- PyTorch building blocks for the OLMo ecosystem☆785Updated this week