The official repository of ALE-Bench
☆177Apr 10, 2026Updated this week
Alternatives and similar repositories for ALE-Bench
Users that are interested in ALE-Bench are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ThetaEvolve: Test-time Learning on Open Problems, enabling RL training on AlphaEvolve/OpenEvolve and emphasizing scaling test-time comput…☆141Feb 27, 2026Updated last month
- AlgoTune is a NeurIPS 2025 benchmark made up of 154 math, physics, and computer science problems. The goal is write code that solves each…☆95Mar 12, 2026Updated last month
- [COLM 2025] Official code for "When To Solve, When To Verify: Compute-Optimal Problem Solving and Generative Verification for LLM Reasoni…☆15Oct 31, 2025Updated 5 months ago
- An extention to the GaLore paper, to perform Natural Gradient Descent in low rank subspace☆18Oct 21, 2024Updated last year
- A unified suite for generating elite reasoning problems and training high-performance LLMs, including pioneering attention-free architect…☆134Jan 31, 2026Updated 2 months ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Code for "Echo Chamber: RL Post-training Amplifies Behaviors Learned in Pretraining"☆27Oct 14, 2025Updated 6 months ago
- A Tree Search Library with Flexible API for LLM Inference-Time Scaling☆533Feb 5, 2026Updated 2 months ago
- ☆13Mar 26, 2019Updated 7 years ago
- Dataset for AAAI paper "Natural Language Inference in Context - Investigating Contextual Reasoning over Long Texts"☆11Nov 18, 2022Updated 3 years ago
- ☆14Dec 28, 2022Updated 3 years ago
- MangaLMM – Try the official demo below☆38Nov 9, 2025Updated 5 months ago
- A curated list of resources on Reinforcement Learning with Verifiable Rewards (RLVR) and the reasoning capability boundary of Large Langu…☆87Dec 12, 2025Updated 4 months ago
- An Automatic Theorem Prover for Hilbert System, generating nearly-minimal proofs.☆14Jan 21, 2025Updated last year
- Repo for "AlphaResearch: Accelerating New Algorithm Discovery with Language Models"☆54Nov 12, 2025Updated 5 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- The official repo for our paper: LegalAgentBench: Evaluating LLM Agents in Legal Domainl☆43Dec 30, 2024Updated last year
- ☆12Jun 13, 2023Updated 2 years ago
- [COLM 2025] EvalTree: Profiling Language Model Weaknesses via Hierarchical Capability Trees☆31Jul 11, 2025Updated 9 months ago
- ☆17Aug 5, 2025Updated 8 months ago
- documentation used in my projects☆19Updated this week
- Evaluating LLMs with fewer examples☆173Apr 12, 2024Updated 2 years ago
- Implementation of GuP [Arai+ SIGMOD'23]☆10Jan 10, 2024Updated 2 years ago
- Recipes to train the self-rewarding reasoning LLMs.☆231Mar 2, 2025Updated last year
- Python3 implementation of the paper [Large-scale optimal transport map estimation using projection pursuit]☆15Feb 24, 2021Updated 5 years ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Darwin Gödel Machine: Open-Ended Evolution of Self-Improving Agents☆1,986Aug 13, 2025Updated 8 months ago
- virtual node analysis on ogb benchmark dataset☆14Mar 9, 2023Updated 3 years ago
- This is the implementation of word aligner using Hidden Markov Model☆10Jun 24, 2019Updated 6 years ago
- An ergonomic, opinionated memory interface for AI agents☆39Dec 18, 2025Updated 3 months ago
- ☆27Mar 21, 2024Updated 2 years ago
- A benchmark dataset for evaluating LLM's SVG editing capabilities☆37Oct 17, 2024Updated last year
- A Recipe for Building LLM Reasoners to Solve Complex Instructions☆31Oct 9, 2025Updated 6 months ago
- [CHIL 2024] Interpretation of Intracardiac Electrograms Through Textual Representations☆12Sep 4, 2024Updated last year
- ☆24Updated this week
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- [ICLR 26] The official code repository for the paper "Mirage or Method? How Model–Task Alignment Induces Divergent RL Conclusions".☆17Feb 9, 2026Updated 2 months ago
- The rule-based evaluation subset and code implementation of Omni-MATH☆27Dec 23, 2024Updated last year
- Randomized Linear Algebra in Python☆13Mar 21, 2017Updated 9 years ago
- Benchmarking Agentic LLM and VLM Reasoning On Games☆243Updated this week
- A lightweight computational physics framework, based on the organization of turboWAVE. Implements a "Simulation, PhysicsModule, ComputeTo…☆11Apr 1, 2026Updated 2 weeks ago
- ☆10Nov 6, 2024Updated last year
- Row-wise block scaling for fp8 quantization matrix multiplication. Solution to GPU mode AMD challenge.☆19Feb 9, 2026Updated 2 months ago