collinear-ai/spider

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/collinear-ai/spider)

collinear-ai / spider

Streamline on-policy/off-policy distillation workflows in a few lines of code

☆107

Alternatives and similar repositories for spider

Users that are interested in spider are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

collinear-ai / simlab
View on GitHub
SimLab is the data layer for creating simulations to QA, evaluate, hillclimb, and refine agents.
☆23Updated this week
collinear-ai / tau-trait
View on GitHub
TraitBasis applied to TauBench
☆18Nov 11, 2025Updated 8 months ago
michaelbzhu / lora-without-regret
View on GitHub
☆47Oct 23, 2025Updated 9 months ago
OpenRewardAI / openreward-cookbook
View on GitHub
Training and evaluating with OpenReward
☆33Apr 28, 2026Updated 2 months ago
RiddleHe / nanochat
View on GitHub
The best ChatGPT that $100 can buy.
☆54Updated this week
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
brendanhogan / completion_tree_view
View on GitHub
☆15Apr 26, 2025Updated last year
HazyResearch / scaling-verification
View on GitHub
☆26Sep 4, 2025Updated 10 months ago
SalesforceAIResearch / PretrainRL-pipeline
View on GitHub
An automated data pipeline scaling RL to pretraining levels
☆76Jun 2, 2026Updated last month
hallerite / ludic
View on GitHub
Ludic – an LLM-RL library for the era of experience
☆67Jan 9, 2026Updated 6 months ago
kubernetes-bad / reward-composer
View on GitHub
Lego for GRPO
☆30May 27, 2025Updated last year
brendanhogan / picoDeepResearch
View on GitHub
☆69May 23, 2025Updated last year
open-tinker / OpenTinker
View on GitHub
OpenTinker is an RL-as-a-Service infrastructure for foundation models
☆676Mar 21, 2026Updated 4 months ago
tyler-romero / microR1
View on GitHub
Simple repository for training small reasoning models
☆51Feb 17, 2026Updated 5 months ago
Zhiyuan-Zeng / RLVE
View on GitHub
[ICML 2026] RLVE: Scaling Up Reinforcement Learning for Language Models with Adaptive Verifiable Environments
☆226Apr 30, 2026Updated 2 months ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
NovaSky-AI / SkyRL
View on GitHub
SkyRL: A Modular Full-stack RL Library for LLMs
☆2,093Updated this week
bethgelab / delta-belief-rl
View on GitHub
Official implementation of the ΔBelief-RL method.
☆31Feb 28, 2026Updated 4 months ago
Aloriosa / srmt
View on GitHub
The original Shared Recurrent Memory Transformer implementation
☆36Jul 11, 2025Updated last year
EvanZhuang / knowledge_flow
View on GitHub
Official Implementation of Knowledge Flow Prompting
☆35Oct 20, 2025Updated 9 months ago
Ziems / arbor
View on GitHub
A framework for optimizing DSPy programs with RL
☆340Jan 12, 2026Updated 6 months ago
seeM / Jim
View on GitHub
Jim is a simple, beautiful Jupyter notebook editor for macOS
☆35May 31, 2023Updated 3 years ago
thepowerfuldeez / sample_efficient_gpt
View on GitHub
Training framework with a goal to explore the frontier of sample efficiency of small language models
☆101Jan 25, 2026Updated 6 months ago
PrimeIntellect-ai / prime-rl
View on GitHub
Agentic RL Training at Scale
☆1,724Updated this week
13point5 / swe-grep-oss
View on GitHub
An RL environment similar to Cognition's SWE-Grep
☆16Mar 10, 2026Updated 4 months ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
tilde-research / nitrobrew-release
View on GitHub
Fused KL divergence from hidden states for knowledge distillation
☆19Apr 28, 2026Updated 2 months ago
deepfates / textile
View on GitHub
tuimorphic choose-your-own-adventure story game
☆21Updated this week
SalesforceAIResearch / LaTRO
View on GitHub
☆127Jun 2, 2026Updated last month
sail-sg / oat
View on GitHub
🌾 OAT: A research-friendly framework for LLM online alignment, including reinforcement learning, preference learning, etc.
☆666Jan 29, 2026Updated 5 months ago
sdan / vmux-examples
View on GitHub
Example scripts for vmux - run any command in the cloud
☆42Apr 17, 2026Updated 3 months ago
xjdr-alt / muzero_sketch
View on GitHub
☆40Jul 26, 2024Updated 2 years ago
sanyalsunny111 / Looped-GPT
View on GitHub
Minimal and highly hackable implementation of Looped Transformers with GPT
☆25Mar 8, 2026Updated 4 months ago
PrimeIntellect-ai / verifiers
View on GitHub
Our library for RL environments + evals
☆4,400Updated this week
eyalbd2 / RL-based-Language-Modeling
View on GitHub
☆13Jan 27, 2019Updated 7 years ago
Serverless GPU API endpoints on Runpod - Get Bonus Credits • Ad
Skip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
brendanhogan / DeepSeekRL-Extended
View on GitHub
Exploring Applications of GRPO
☆252Aug 25, 2025Updated 11 months ago
NousResearch / atropos
View on GitHub
Atropos is a Language Model Reinforcement Learning Environments framework for collecting and evaluating LLM trajectories through diverse …
☆1,340Jul 4, 2026Updated 3 weeks ago
complex-reasoning / RPG
View on GitHub
[ICLR 2026] RPG: KL-Regularized Policy Gradient (https://arxiv.org/abs/2505.17508)
☆76Jun 29, 2026Updated 3 weeks ago
PrimeIntellect-ai / lab-cookbook
View on GitHub
Lab Cookbook
☆37Updated this week
JD-P / RetroInstruct
View on GitHub
Synthetic data derived by templating, few shot prompting, transformations on public domain corpora, and monte carlo tree search.
☆34Oct 8, 2025Updated 9 months ago
RiddleHe / llm-interp
View on GitHub
A collection of lightweight interpretability scripts to understand how LLMs think
☆90Mar 18, 2026Updated 4 months ago
FrontierCS / FrontierSmith
View on GitHub
FrontierSmith, a new system that uses AI to synthesize open-ended coding problems at scale
☆48May 30, 2026Updated last month