OpenPipe/deductive-reasoning

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/OpenPipe/deductive-reasoning)

OpenPipe / deductive-reasoning

Train your own SOTA deductive reasoning model

☆111

Alternatives and similar repositories for deductive-reasoning

Users that are interested in deductive-reasoning are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

OpenPipe / rl-experiments
View on GitHub
OpenPipe Reinforcement Learning Experiments
☆34Mar 14, 2025Updated last year
bradhilton / temporal-clue
View on GitHub
Clue inspired puzzles for testing LLM deduction abilities
☆47Mar 19, 2026Updated 4 months ago
kubernetes-bad / reward-composer
View on GitHub
Lego for GRPO
☆30May 27, 2025Updated last year
axolotl-ai-cloud / grpo_code
View on GitHub
A fast, local, and secure approach for training LLMs for coding tasks using GRPO with WebAssembly and interpreter feedback.
☆41Apr 4, 2025Updated last year
brendanhogan / completion_tree_view
View on GitHub
☆15Apr 26, 2025Updated last year
End-to-end encrypted email - Proton Mail • Ad
Special offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
haizelabs / annotate
View on GitHub
Skill to annotate and create ai judges from agent logs
☆17Oct 28, 2025Updated 8 months ago
eligotts / legos
View on GitHub
☆24Jan 22, 2026Updated 5 months ago
PrimeIntellect-ai / lab-cookbook
View on GitHub
Lab Cookbook
☆37Updated this week
cpldcpu / llmbenchmark
View on GitHub
Various LLM Benchmarks
☆26Feb 20, 2026Updated 5 months ago
swarnaHub / System-1.x
View on GitHub
PyTorch code for System-1.x: Learning to Balance Fast and Slow Planning with Language Models
☆25Jul 22, 2024Updated last year
sheryc / resonance_rope
View on GitHub
[ACL 24 Findings] Implementation of Resonance RoPE and the PosGen synthetic dataset.
☆24Mar 5, 2024Updated 2 years ago
JacksonCakes / vision-r1
View on GitHub
☆13Mar 23, 2025Updated last year
huggingface / trl-tuto
View on GitHub
☆52Feb 20, 2026Updated 5 months ago
LAION-AI / AIW
View on GitHub
Alice in Wonderland code base for experiments and raw experiments data
☆129Feb 4, 2026Updated 5 months ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
XinmingTu / auto-discovery
View on GitHub
☆31Mar 13, 2026Updated 4 months ago
open-thought / reasoning-gym
View on GitHub
[NeurIPS 2025 Spotlight] Reasoning Environments for Reinforcement Learning with Verifiable Rewards
☆1,463Apr 17, 2026Updated 3 months ago
character-ai / pipelining-sft
View on GitHub
Simple and efficient DeepSeek V3 SFT using pipeline parallel and expert parallel, with both FP8 and BF16 trainings
☆118Jul 27, 2025Updated 11 months ago
willccbb / localchat
View on GitHub
☆13Apr 16, 2025Updated last year
open-thought / reasoning-gym-eval
View on GitHub
Collection of LLM completions for reasoning-gym task datasets
☆31Jul 4, 2025Updated last year
rodrigobaron / anthill
View on GitHub
☆24Jan 22, 2025Updated last year
glaive-ai / reflection_70b_training
View on GitHub
☆17Feb 12, 2025Updated last year
facebookresearch / moodist
View on GitHub
moodist
☆27Apr 23, 2026Updated 2 months ago
PrimeIntellect-ai / verifiers
View on GitHub
Our library for RL environments + evals
☆4,389Updated this week
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
anilsharmay / full-stack-local-deep-research-agent
View on GitHub
Deploying full-stack on-prem deep research agent that can be run entirely on a local machine for $0!
☆34Nov 8, 2025Updated 8 months ago
firstbatchxyz / function-calling-eval
View on GitHub
The DPAB-α Benchmark
☆32Jan 15, 2025Updated last year
PrimeIntellect-ai / prime-rl
View on GitHub
Agentic RL Training at Scale
☆1,698Updated this week
huggingface / yourbench
View on GitHub
🤗 Benchmark Large Language Models Reliably On Your Data
☆451Apr 2, 2026Updated 3 months ago
satrams / rent-rl
View on GitHub
RENT (Reinforcement Learning via Entropy Minimization) is an unsupervised method for training reasoning LLMs.
☆42Oct 31, 2025Updated 8 months ago
QuesmaOrg / otel-bench
View on GitHub
OpenTelemetry Benchmark - can AI trace your failed login?
☆20Updated this week
BY571 / DistRL-LLM
View on GitHub
Distributed Reinforcement Learning for LLM Fine-Tuning with multi-GPU utilization
☆22Mar 12, 2025Updated last year
meta-pytorch / torchtune
View on GitHub
PyTorch native post-training library
☆5,784Updated this week
GAIR-NLP / AIME-Preview
View on GitHub
☆84Mar 11, 2025Updated last year
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
Ammar-Alnagar / Ammar-Alnagar
View on GitHub
☆13Apr 10, 2026Updated 3 months ago
brendanhogan / picoDeepResearch
View on GitHub
☆69May 23, 2025Updated last year
linkedin / ControlLLM
View on GitHub
Control LLM
☆23Apr 6, 2025Updated last year
hao-ai-lab / Consistency_LLM
View on GitHub
[ICML 2024] CLLMs: Consistency Large Language Models
☆416Nov 16, 2024Updated last year
brendanhogan / DeepSeekRL-Extended
View on GitHub
Exploring Applications of GRPO
☆252Aug 25, 2025Updated 10 months ago
zenforic / csm-multi
View on GitHub
Adding a multi-text multi-speaker script (diffe) that is based on a script from asiff00 on issue 61 for Sesame: A Conversational Speech G…
☆26Mar 28, 2025Updated last year
Percent-BFD / neurips_submission
View on GitHub
☆17Nov 23, 2023Updated 2 years ago