☆631Feb 25, 2026Updated 3 weeks ago
Alternatives and similar repositories for superhuman
Users that are interested in superhuman are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆12Mar 31, 2024Updated last year
- Technical report of Kimina-Prover Preview.☆364Jul 10, 2025Updated 8 months ago
- FROM $f(x)$ AND $g(x)$ TO $f(g(x))$: LLMs Learn New Skills in RL by Composing Old Ones☆64Jan 26, 2026Updated last month
- ☆13Aug 29, 2025Updated 6 months ago
- A framework for few-shot evaluation of autoregressive language models.☆26Dec 21, 2023Updated 2 years ago
- ☆15Apr 26, 2025Updated 10 months ago
- Generic interface for hooking up to any Interactive Theorem Prover (ITP) and collecting data for training ML models for AI in formal theo…☆18Feb 19, 2026Updated last month
- LLMs + Lean, on your laptop or in the cloud☆203Oct 10, 2025Updated 5 months ago
- The (open-source part of) code to reproduce "BPPSA: Scaling Back-propagation by Parallel Scan Algorithm".☆13Jun 7, 2021Updated 4 years ago
- Code for the paper LEGO-Prover: Neural Theorem Proving with Growing Libraries☆67Feb 29, 2024Updated 2 years ago
- Retrieval-Augmented Theorem Provers for Lean☆318Jan 30, 2025Updated last year
- Brain Interpreter and Visualizer Online.☆10Sep 1, 2016Updated 9 years ago
- Script for processing OpenAI's PRM800K process supervision dataset into an Alpaca-style instruction-response format☆27Jul 12, 2023Updated 2 years ago
- ☆14Apr 16, 2025Updated 11 months ago
- Official Implementation for the paper "VisCodex: Unified Multimodal Code Generation via Merging Vision and Coding Models"☆22Aug 14, 2025Updated 7 months ago
- From Accuracy to Robustness: A Study of Rule- and Model-based Verifiers in Mathematical Reasoning.☆25Oct 7, 2025Updated 5 months ago
- A Lean4 script for robustly verifying submitted proofs of theorems and implementations of functions☆42Mar 10, 2026Updated 2 weeks ago
- NeqLIPS: a powerful Olympiad-level inequality prover☆40Sep 7, 2025Updated 6 months ago
- Verified efficient algorithms in Lean4.☆37Jan 3, 2026Updated 2 months ago
- ☆62Jan 20, 2026Updated 2 months ago
- [COLM 2024] A Survey on Deep Learning for Theorem Proving☆219May 28, 2025Updated 9 months ago
- LeanEuclid is a benchmark for autoformalization in the domain of Euclidean geometry, targeting the proof assistant Lean.☆125Nov 25, 2025Updated 3 months ago
- Waffer-thin FlaskGPT on Vercel.☆12Jun 1, 2023Updated 2 years ago
- 💻 Terminal-Agent with Human-in-the-Loop Learning☆39Jan 16, 2026Updated 2 months ago
- A framework bridging cognitive science and LLM reasoning research to diagnose and improve how large language models reason, based on anal…☆37Nov 26, 2025Updated 3 months ago
- ☆16Jul 29, 2024Updated last year
- sgx-based encrypted deduplication prototype☆14May 14, 2021Updated 4 years ago
- ☆59Dec 1, 2025Updated 3 months ago
- Code for L0-ARM: Network Sparsification via Stochastic Binary Optimization☆15Oct 25, 2019Updated 6 years ago
- Official Code for What Makes and Breaks Safety Fine-tuning? A Mechanistic Study (NeurIPS 2024)☆12Oct 31, 2024Updated last year
- Flax (JAX) implementation of Progressive Growing of GANs for Improved Quality, Stability, and Variation☆12May 24, 2021Updated 4 years ago
- (ACL 2025 Main) Safe: Enhancing Mathematical Reasoning in Large Language Models via Retrospective Step-aware Formal Verification - Offici…☆20Dec 26, 2025Updated 2 months ago
- Repository of <FormalMATH: Benchmarking Formal Mathematical Reasoning of Large Language Models>☆77Jan 8, 2026Updated 2 months ago
- ☆14Mar 27, 2024Updated last year
- Implementation of the model: "Reka Core, Flash, and Edge: A Series of Powerful Multimodal Language Models" in PyTorch☆28Mar 9, 2026Updated 2 weeks ago
- LeanHammer is an automated reasoning tool for Lean that brings together multiple proof search and reconstruction techniques and combines …☆84Mar 17, 2026Updated last week
- ☆19Jan 24, 2025Updated last year
- ☆33Jul 9, 2025Updated 8 months ago
- A node library to interact with the GitHub issues API☆25Jan 27, 2026Updated last month