Train your own SOTA deductive reasoning model
☆107Mar 6, 2025Updated 11 months ago
Alternatives and similar repositories for deductive-reasoning
Users that are interested in deductive-reasoning are comparing it to the libraries listed below
Sorting:
- OpenPipe Reinforcement Learning Experiments☆32Mar 14, 2025Updated 11 months ago
- Clue inspired puzzles for testing LLM deduction abilities☆45Mar 24, 2025Updated 11 months ago
- Lego for GRPO☆30May 27, 2025Updated 9 months ago
- A fast, local, and secure approach for training LLMs for coding tasks using GRPO with WebAssembly and interpreter feedback.☆41Apr 4, 2025Updated 10 months ago
- ☆15Apr 26, 2025Updated 10 months ago
- A RL env with procedurally generated symbolic reasoning data☆33Feb 19, 2026Updated last week
- Training tiny models to prove hard theorems☆29Feb 15, 2026Updated last week
- Deploying full-stack on-prem deep research agent that can be run entirely on a local machine for $0!☆30Nov 8, 2025Updated 3 months ago
- [ACL 24 Findings] Implementation of Resonance RoPE and the PosGen synthetic dataset.☆24Mar 5, 2024Updated last year
- Collection of LLM completions for reasoning-gym task datasets☆30Jul 4, 2025Updated 7 months ago
- Alice in Wonderland code base for experiments and raw experiments data☆130Feb 4, 2026Updated 3 weeks ago
- Async RL Training at Scale☆1,096Updated this week
- unsloth-5090-multiple☆60May 21, 2025Updated 9 months ago
- Your personal ArXiv Feed☆23Dec 18, 2024Updated last year
- ☆16Feb 22, 2025Updated last year
- PyTorch code for System-1.x: Learning to Balance Fast and Slow Planning with Language Models☆25Jul 22, 2024Updated last year
- ☆24Jan 22, 2025Updated last year
- Adding a multi-text multi-speaker script (diffe) that is based on a script from asiff00 on issue 61 for Sesame: A Conversational Speech G…☆26Mar 28, 2025Updated 11 months ago
- Code for the paper "FinRLlama: A Solution to LLM-Engineered Signals Challenge at FinRL Contest 2024"☆13Feb 14, 2025Updated last year
- Code for the AACL 2022 Paper "This Patient Looks Like That Patient: Prototypical Networks for Interpretable Diagnosis Prediction from Cli…☆12Nov 18, 2022Updated 3 years ago
- [ICML 2024] CLLMs: Consistency Large Language Models☆412Nov 16, 2024Updated last year
- Understanding R1-Zero-Like Training: A Critical Perspective☆1,215Aug 27, 2025Updated 6 months ago
- ☆12Jun 21, 2022Updated 3 years ago
- ☆12Mar 23, 2025Updated 11 months ago
- Moondream MCP Server in Python☆44Jul 2, 2025Updated 7 months ago
- The original Shared Recurrent Memory Transformer implementation☆33Jul 11, 2025Updated 7 months ago
- ☆49Sep 8, 2025Updated 5 months ago
- General Reasoner: Advancing LLM Reasoning Across All Domains [NeurIPS25]☆221Nov 27, 2025Updated 3 months ago
- [ICLR 2026] Tina: Tiny Reasoning Models via LoRA☆320Sep 23, 2025Updated 5 months ago
- ScrollNet for Continual Learning☆11Sep 11, 2023Updated 2 years ago
- Evaluation of LLMs on latest math competitions☆222Feb 20, 2026Updated last week
- 🤗 Benchmark Large Language Models Reliably On Your Data☆429Dec 30, 2025Updated 2 months ago
- code for the table-based open domain question answering project, with paper title: "Reasoning over Hybrid Chain for Table-and-Text Open D…☆12Sep 16, 2022Updated 3 years ago
- ☆14Apr 16, 2025Updated 10 months ago
- Process Reward Models That Think☆79Nov 29, 2025Updated 3 months ago
- ☆18Aug 19, 2025Updated 6 months ago
- ☆15Dec 3, 2024Updated last year
- ☆13Jan 27, 2019Updated 7 years ago
- MilimoChat: Privacy-first, self-hosted AI chat with customizable personas, context-aware memory, and local analytics. Built on Python/Str…☆14Mar 12, 2025Updated 11 months ago