Training tiny models to prove hard theorems
☆29Feb 15, 2026Updated last week
Alternatives and similar repositories for QED-Nano
Users that are interested in QED-Nano are comparing it to the libraries listed below
Sorting:
- ☆23Jan 6, 2026Updated last month
- Deploying full-stack on-prem deep research agent that can be run entirely on a local machine for $0!☆30Nov 8, 2025Updated 3 months ago
- The code for paper "EPO: Entropy-regularized Policy Optimization for LLM Agents Reinforcement Learning"☆37Oct 1, 2025Updated 4 months ago
- Visualize any repo or codebase into diagram or animation☆20Oct 14, 2024Updated last year
- ☆19Oct 2, 2023Updated 2 years ago
- Official Implementation of the paper "Jointly Reinforcing Diversity and Quality in Language Model Generations"☆57Dec 26, 2025Updated last month
- Official Implementation for NorMuon paper☆56Feb 9, 2026Updated 2 weeks ago
- Official implementation for paper "How Far Are We from Genuinely Useful Deep Research Agents?"☆64Dec 10, 2025Updated 2 months ago
- ☆54Nov 12, 2025Updated 3 months ago
- ☆161Dec 18, 2025Updated 2 months ago
- Ludic – an LLM-RL library for the era of experience☆60Jan 9, 2026Updated last month
- This is the official repo for the paper "AMO-Bench: Large Language Models Still Struggle in High School Math Competitions".☆62Feb 6, 2026Updated 2 weeks ago
- Code for Evolving Language Models without Labels: Majority Drives Selection, Novelty Promotes Variation (EVOL-RL).☆48Oct 16, 2025Updated 4 months ago
- A fast, local, and secure approach for training LLMs for coding tasks using GRPO with WebAssembly and interpreter feedback.☆41Apr 4, 2025Updated 10 months ago
- ☆76Feb 18, 2026Updated last week
- [NeurIPS 2025] RL Tango: Reinforcing Generator and Verifier Together for Language Reasoning☆52Oct 23, 2025Updated 4 months ago
- Official Repository of Native Parallel Reasoner☆100Feb 5, 2026Updated 3 weeks ago
- A framework for building provenance-based intrusion detection systems with neural networks☆71Feb 19, 2026Updated last week
- Primus-SaFE(Stability and Fault Endurance)☆50Updated this week
- Text to audio with Tik-Tok Voices☆13Apr 6, 2023Updated 2 years ago
- A powerful MCP testing tool with multi-provider LLM support (Ollama, OpenAI, Claude, Gemini). Test, debug, and develop MCP servers with a…☆18Jan 7, 2026Updated last month
- Source code and dataset for the paper 'Saamayik: A Benchmark and Dataset for English-Sanskrit Translation'☆15Oct 11, 2025Updated 4 months ago
- Repository of IPBench☆19Jan 4, 2026Updated last month
- ☆76Jan 15, 2026Updated last month
- "A Survey on Agent-as-a-Judge"☆91Jan 12, 2026Updated last month
- Unofficial Implementation of Chain-of-Thought Reasoning Without Prompting☆35Mar 19, 2024Updated last year
- ☆29Jan 15, 2026Updated last month
- GBM implementation on Legate☆14Jan 28, 2026Updated 3 weeks ago
- ☆11Jul 17, 2023Updated 2 years ago
- ☆13Jan 14, 2026Updated last month
- 用Paddle复现论文ChineseBERT: Chinese Pretraining Enhanced by Glyph and Pinyin Information(ACL2021)☆10Nov 15, 2021Updated 4 years ago
- Simple and powerful extension for searching web and viewing website content.☆11Apr 11, 2025Updated 10 months ago
- Home server set up☆13Oct 5, 2025Updated 4 months ago
- ☆34Sep 22, 2025Updated 5 months ago
- ☆10Sep 29, 2024Updated last year
- Klear-Reasoner: Advancing Reasoning Capability via Gradient-Preserving Clipping Policy Optimization☆81Dec 25, 2025Updated 2 months ago
- Clue inspired puzzles for testing LLM deduction abilities☆45Mar 24, 2025Updated 11 months ago
- ☆18Dec 9, 2025Updated 2 months ago
- The source code and the data for ACL 2022 paper "Show Me More Details: Discovering Hierarchies of Procedures from Semi-structured Web Dat…☆14Apr 21, 2023Updated 2 years ago