SalesforceAIResearch/PretrainRL-pipeline

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/SalesforceAIResearch/PretrainRL-pipeline)

SalesforceAIResearch / PretrainRL-pipeline

An automated data pipeline scaling RL to pretraining levels

☆76

Alternatives and similar repositories for PretrainRL-pipeline

Users that are interested in PretrainRL-pipeline are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

gydpku / Data_Synthesis_RL
View on GitHub
☆123May 26, 2025Updated last year
stellalisy / PrefPalette
View on GitHub
☆21Apr 3, 2026Updated 3 months ago
Ignoramus0817 / SynthQuestions
View on GitHub
☆19Jul 30, 2025Updated 11 months ago
actava-ai / Cura
View on GitHub
actAVA Cura: Specialized Model for Agentic Healthcare
☆24Jul 20, 2026Updated last week
allenai / olmix
View on GitHub
☆41May 26, 2026Updated 2 months ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
ant-research / long-context-modeling
View on GitHub
Research work aimed at addressing the problem of modeling infinite-length context
☆50Dec 18, 2025Updated 7 months ago
GAIR-NLP / LIMI
View on GitHub
LIMI: Less is More for Agency
☆162Oct 14, 2025Updated 9 months ago
BM-K / KoDiffCSE
View on GitHub
Difference-based Contrastive Learning for Korean Sentence Embeddings
☆23Mar 11, 2026Updated 4 months ago
J-Seo / KoCommonGEN-V2
View on GitHub
KoCommonGEN v2: A Benchmark for Navigating Korean Commonsense Reasoning Challenges in Large Language Models
☆25Aug 24, 2024Updated last year
CoopReason / TESSY
View on GitHub
A Teacher–Student Cooperation Framework to Synthesize Student-Consistent SFT Data
☆35May 1, 2026Updated 2 months ago
sail-sg / SkyLadder
View on GitHub
The official repository for SkyLadder: Better and Faster Pretraining via Context Window Scheduling
☆43Dec 29, 2025Updated 7 months ago
seal-rg / streaming
View on GitHub
Code for the paper Multi-Stream LLMs: Unblocking Language Models with Parallel Streams of Thoughts, Inputs and Outputs
☆65Jun 23, 2026Updated last month
collinear-ai / spider
View on GitHub
Streamline on-policy/off-policy distillation workflows in a few lines of code
☆109Updated this week
Aloriosa / srmt
View on GitHub
The original Shared Recurrent Memory Transformer implementation
☆36Jul 11, 2025Updated last year
End-to-end encrypted email - Proton Mail • Ad
Special offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
ahans30 / goldfish-loss
View on GitHub
[NeurIPS 2024] Goldfish Loss: Mitigating Memorization in Generative LLMs
☆98Nov 17, 2024Updated last year
tilde-research / nitrobrew-release
View on GitHub
Fused KL divergence from hidden states for knowledge distillation
☆20Apr 28, 2026Updated 3 months ago
bethgelab / delta-belief-rl
View on GitHub
Official implementation of the ΔBelief-RL method.
☆31Feb 28, 2026Updated 5 months ago
DSBA-Lab / CodeLab
View on GitHub
DSBA code study
☆30Nov 7, 2023Updated 2 years ago
menik1126 / UNComp
View on GitHub
[EMNLP 2025🔥] UNComp: Can Matrix Entropy Uncover Sparsity? -- A Compressor Design from an Uncertainty-Aware Perspective
☆20Jan 7, 2026Updated 6 months ago
recursal / GoldFinch-paper
View on GitHub
GoldFinch and other hybrid transformer components
☆46Jul 20, 2024Updated 2 years ago
TIGER-AI-Lab / MAmmoTH2
View on GitHub
Official code for "MAmmoTH2: Scaling Instructions from the Web" [NeurIPS 2024]
☆146Oct 27, 2024Updated last year
yichengchen24 / DataChef
View on GitHub
☆25Feb 12, 2026Updated 5 months ago
multimodal-art-projection / CodeCriticBench
View on GitHub
☆16Nov 1, 2025Updated 8 months ago
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
kyutai-labs / ARC-Encoder
View on GitHub
☆30Jan 5, 2026Updated 6 months ago
HazyResearch / scaling-verification
View on GitHub
☆26Sep 4, 2025Updated 10 months ago
TIGER-AI-Lab / General-Reasoner
View on GitHub
General Reasoner: Advancing LLM Reasoning Across All Domains [NeurIPS25]
☆229Nov 27, 2025Updated 8 months ago
mathllm / MathCoder2
View on GitHub
☆71Oct 16, 2024Updated last year
Aviously / diff-interpretation-tuning
View on GitHub
Code for Learning to Interpret Weight Differences in Language Models (Goel et al. 2025)
☆20Jan 4, 2026Updated 6 months ago
SalesforceAIResearch / CoDA
View on GitHub
Salesforce AI Research's open diffusion language model
☆65Jun 2, 2026Updated last month
CodeCreator / WebOrganizer
View on GitHub
Organize the Web: Constructing Domains Enhances Pre-Training Data Curation
☆83May 2, 2025Updated last year
violetxi / ExpRL
View on GitHub
☆22Jun 16, 2026Updated last month
LLM360 / k2v2_train
View on GitHub
Training codebase for K2-V2
☆22Dec 17, 2025Updated 7 months ago
Deploy open-source AI quickly and easily - Special Bonus Offer • Ad
Runpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
actava-ai / chi-bench
View on GitHub
Χ-Bench: Can AI Agents Automate End-to-End, Long-Horizon, Policy-Rich Healthcare Workflows?
☆54Updated this week
neulab / data-agora
View on GitHub
[ACL 2025 Main] Official Repository for "Evaluating Language Models as Synthetic Data Generators"
☆41Dec 13, 2024Updated last year
SalesforceAIResearch / swecomm
View on GitHub
☆28Jun 2, 2026Updated last month
WisdomShell / RewardAnything
View on GitHub
RewardAnything: Generalizable Principle-Following Reward Models
☆44Jun 11, 2025Updated last year
apple / ml-reversal-blessing
View on GitHub
☆17Jul 31, 2025Updated 11 months ago
Seeing-Fast-and-Slow / Seeing-Fast-and-Slow
View on GitHub
☆16May 28, 2026Updated 2 months ago
NVlabs / Tool-N1
View on GitHub
☆231Jun 2, 2025Updated last year