facebookresearch / polymathLinks
AI Agent leveraging symbolic reasoning and other auxiliary tools to boost its capabilities on various logic and reasoning benchmarks. This project aims to develop a robust and flexible AI system that can tackle complex problems in areas such as decision-making, mathematics, and programming.
☆39Updated 5 months ago
Alternatives and similar repositories for polymath
Users that are interested in polymath are comparing it to the libraries listed below
Sorting:
- This is the official repository for all the code of TheoremLlama☆47Updated 6 months ago
- ☆42Updated last year
- ☆83Updated last year
- LeanUniverse: A Library for Consistent and Scalable Lean4 Dataset Management☆75Updated last year
- Large language models designed for formal theorem proving through tool-integrated reasoning.☆31Updated 5 months ago
- This repository contains popular code generation frameworks such as MapCoder, CodeSIM.☆69Updated 7 months ago
- ☆408Updated last month
- ☆148Updated this week
- A benchmark for evaluating LLMs on open-ended CS problems. Exploring the Next Frontier of Computer Science.☆103Updated this week
- ☆191Updated 2 weeks ago
- ☆76Updated last month
- AlgoTune is a NeurIPS 2025 benchmark made up of 154 math, physics, and computer science problems. The goal is write code that solves each…☆86Updated last week
- ☆225Updated 10 months ago
- LLMs + Lean, on your laptop or in the cloud☆199Updated 3 months ago
- BigOBench assesses the capacity of Large Language Models (LLMs) to comprehend time-space computational complexity of input or generated c…☆40Updated 9 months ago
- ☆21Updated 6 months ago
- Official Repository of Native Parallel Reasoner☆100Updated 2 weeks ago
- Multi-Granularity LLM Debugger [ICSE2026]☆95Updated 7 months ago
- LIMI: Less is More for Agency☆160Updated 3 months ago
- ☆42Updated last month
- [ACL 2024] Do Large Language Models Latently Perform Multi-Hop Reasoning?☆88Updated 10 months ago
- Evaluation of LLMs on latest math competitions☆214Updated last month
- Automatic solver for plane geometry problems.☆85Updated 5 months ago
- ☆106Updated last year
- ☆32Updated last week
- Formal-LLM: Integrating Formal Language and Natural Language for Controllable LLM-based Agents☆132Updated last year
- ☆12Updated 9 months ago
- The original Shared Recurrent Memory Transformer implementation☆33Updated 6 months ago
- [EMNLP 2025] The official implementation for paper "Agentic-R1: Distilled Dual-Strategy Reasoning"☆102Updated 5 months ago
- ☆29Updated 2 months ago