facebookresearch / polymathLinks
AI Agent leveraging symbolic reasoning and other auxiliary tools to boost its capabilities on various logic and reasoning benchmarks. This project aims to develop a robust and flexible AI system that can tackle complex problems in areas such as decision-making, mathematics, and programming.
☆31Updated 3 weeks ago
Alternatives and similar repositories for polymath
Users that are interested in polymath are comparing it to the libraries listed below
Sorting:
- LeanUniverse: A Library for Consistent and Scalable Lean4 Dataset Management☆70Updated 7 months ago
- ☆41Updated 11 months ago
- This is the official repository for all the code of TheoremLlama☆44Updated last month
- ☆70Updated last year
- ☆286Updated last month
- The original Shared Recurrent Memory Transformer implementation☆30Updated last month
- BigOBench assesses the capacity of Large Language Models (LLMs) to comprehend time-space computational complexity of input or generated c…☆37Updated 4 months ago
- ☆21Updated last month
- [ACL 2024] Do Large Language Models Latently Perform Multi-Hop Reasoning?☆77Updated 5 months ago
- Train, tune, and infer Bamba model☆131Updated 3 months ago
- Repository to create traveling waves integrate special information through time☆55Updated last month
- ☆41Updated last year
- ☆56Updated 2 months ago
- Resa: Transparent Reasoning Models via SAEs☆41Updated 3 weeks ago
- ☆10Updated 4 months ago
- A multimodal agent that can interact with its own PC in a multimodal manner.☆31Updated this week
- ☆26Updated 2 months ago
- Multi-Granularity LLM Debugger☆90Updated 2 months ago
- [ICML 2023] "Outline, Then Details: Syntactically Guided Coarse-To-Fine Code Generation", Wenqing Zheng, S P Sharan, Ajay Kumar Jaiswal, …☆41Updated last year
- On The Planning Abilities of OpenAI's o1 Models: Feasibility, Optimality, and Generalizability☆40Updated 2 months ago
- CodeMind is a generic framework for evaluating inductive code reasoning of LLMs. It is equipped with a static analysis component that ena…☆39Updated 4 months ago
- [NeurIPS 2024] Can LLMs Learn by Teaching for Better Reasoning? A Preliminary Study☆53Updated 9 months ago
- For ACL25 paper "WAFFLE: Multi-Modal Model for Automated Front-End Development" - by Shanchao Liang and Nan Jiang and Shangshu Qian and L…☆11Updated 3 months ago
- UQ: Assessing Language Models on Unsolved Questions☆23Updated 2 weeks ago
- Official implementation of "Reasoning Path Compression: Compressing Generation Trajectories for Efficient LLM Reasoning"☆21Updated 3 months ago
- ☆19Updated 6 months ago
- ☆35Updated 3 months ago
- ☆39Updated 2 months ago
- alternative way to calculating self attention☆18Updated last year
- The Automated LLM Speedrunning Benchmark measures how well LLM agents can reproduce previous innovations and discover new ones in languag…☆96Updated last month