deepseek-ai / DeepSeek-Prover-V1.5

☆220

Related projects ⓘ

Alternatives and complementary repositories for DeepSeek-Prover-V1.5

MARIO-Math-Reasoning / Super_MARIO
☆239Updated 3 weeks ago
THUDM / ReST-MCTS
ReST-MCTS*: LLM Self-Training via Process Reward Guided Tree Search (NeurIPS 2024)
☆295Updated 3 weeks ago
project-numina / aimo-progress-prize
☆304Updated 3 months ago
waterhorse1 / LLM_Tree_Search
(ICML 2024) Alphazero-like Tree-Search can guide large language model decoding and training
☆215Updated 5 months ago
OpenBMB / OlympiadBench
[ACL 2024]Official GitHub repo for OlympiadBench: A Challenging Benchmark for Promoting AGI with Olympiad-Level Bilingual Multimodal Scie…
☆92Updated 3 months ago
tongyx361 / Awesome-LLM4Math
Curation of resources for LLM mathematical reasoning, most of which are screened by @tongyx361 to ensure high quality and accompanied wit…
☆81Updated 3 months ago
yangky11 / miniF2F-lean4
☆33Updated last week
lean-dojo / ReProver
Retrieval-Augmented Theorem Provers for Lean
☆225Updated 2 months ago
hkust-nlp / llm-compression-intelligence
Official github repo for the paper "Compression Represents Intelligence Linearly" [COLM 2024]
☆127Updated last month
wiio12 / LEGO-Prover
Code for the paper LEGO-Prover: Neural Theorem Proving with Growing Libraries
☆53Updated 8 months ago
vwxyzjn / summarize_from_feedback_details
☆112Updated 3 months ago
zhaoyu-li / DL4TP
[COLM 2024] A Survey on Deep Learning for Theorem Proving
☆131Updated 2 months ago
QwenLM / AutoIF
☆212Updated 3 months ago
YuxiXie / MCTS-DPO
This is the repository that contains the source code for the Self-Evaluation Guided MCTS for online DPO.
☆187Updated 3 months ago
OFA-Sys / gsm8k-ScRel
Codes and Data for Scaling Relationship on Learning Mathematical Reasoning with Large Language Models
☆216Updated last month
GAIR-NLP / ReAlign
Reformatted Alignment
☆112Updated last month
KbsdJames / Omni-MATH
The official repository of the Omni-MATH benchmark.
☆45Updated last week
albertqjiang / MMA
The official repository for the paper Multilingual Mathematical Autoformalization
☆32Updated 5 months ago
keirp / OpenWebMath
☆116Updated 6 months ago
OpenBMB / Eurus
☆283Updated last month
open-compass / MathBench
[ACL 2024 Findings] MathBench: A Comprehensive Multi-Level Difficulty Mathematics Evaluation Dataset
☆84Updated 3 months ago
openai / safety-rbr-code-and-data
Code and example data for the paper: Rule Based Rewards for Language Model Safety
☆153Updated 3 months ago
SkyworkAI / Skywork-MoE
Skywork-MoE: A Deep Dive into Training Techniques for Mixture-of-Experts Language Models
☆127Updated 4 months ago
SimpleBerry / LLaMA-O1
Large Reasoning Models
☆457Updated this week
ZubinGou / math-evaluation-harness
A simple toolkit for benchmarking LLMs on mathematical reasoning tasks. 🧮✨
☆94Updated 6 months ago
ryoungj / ObsScaling
[NeurIPS'24 Spotlight] Observational Scaling Laws
☆42Updated last month
THUDM / WebRL
Building Open LLM Web Agents with Self-Evolving Online Curriculum RL
☆147Updated this week
Eleanor-H / MUSTARD
Code & data for ICLR 2024 spotlight paper: 🍯MUSTARD: Mastering Uniform Synthesis of Theorem and Proof Data
☆36Updated 5 months ago
expz / quiet-star
Implementation of the Quiet-STAR paper (https://arxiv.org/pdf/2403.09629.pdf)
☆39Updated 3 months ago
BrendanGraham14 / mcts-llm
☆67Updated 4 months ago