Code and data for the paper "Why think step by step? Reasoning emerges from the locality of experience"
☆64Apr 4, 2025Updated last year
Alternatives and similar repositories for why-think-step-by-step
Users that are interested in why-think-step-by-step are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Repo for "AlphaResearch: Accelerating New Algorithm Discovery with Language Models"☆54Nov 12, 2025Updated 5 months ago
- ☆225Mar 26, 2025Updated last year
- Official implementation of the transformer (TF) architecture suggested in a paper entitled "Looped Transformers as Programmable Computers…☆38Apr 8, 2023Updated 3 years ago
- ☆88Jul 30, 2024Updated last year
- [NeurIPS 2024] Code and Data Repo for Paper "Embedding Trajectory for Out-of-Distribution Detection in Mathematical Reasoning"☆28May 28, 2024Updated last year
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- Official repository for our paper, Transformers Learn Higher-Order Optimization Methods for In-Context Learning: A Study with Linear Mode…☆19Nov 19, 2024Updated last year
- Extract structured data from PDFs, Word docs and images. Embeddable directly into your application, regardless of the stack.☆23May 7, 2025Updated 11 months ago
- Learning Formal Mathematics from Intrinsic Motivation☆37Jul 10, 2025Updated 9 months ago
- ACE (Adaptive Code Evolution) is an AI-powered system for code analysis and optimization.☆12Mar 25, 2026Updated 3 weeks ago
- Preference-based Recursive Language Modeling for Exploratory Optimization of Reasoning☆240Feb 24, 2025Updated last year
- ☆42Jun 11, 2025Updated 10 months ago
- A Python library to orchestrate LLMs in a neural network-inspired structure☆53Oct 4, 2024Updated last year
- Code and dataset for the paper "IsarStep: a Benchmark for High-level Mathematical Reasoning"☆12Mar 15, 2021Updated 5 years ago
- Code for the paper "VinePPO: Unlocking RL Potential For LLM Reasoning Through Refined Credit Assignment"☆189May 25, 2025Updated 10 months ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Code for "Towards Revealing the Mystery behind Chain of Thought: a Theoretical Perspective"☆22Jul 16, 2023Updated 2 years ago
- An automated approach to the Collatz conjecture☆12Oct 7, 2023Updated 2 years ago
- ☆16Jul 10, 2023Updated 2 years ago
- Perceptive Learning for Legged Robots in IsaacLab. | LocoTouch: Learning Dynamic Quadrupedal Transport with Tactile Sensing (CoRL'25)☆57Sep 18, 2025Updated 7 months ago
- ☆18Nov 21, 2020Updated 5 years ago
- Sparse and discrete interpretability tool for neural networks☆64Feb 12, 2024Updated 2 years ago
- ☆84Aug 31, 2023Updated 2 years ago
- Fork of Flame repo for training of some new stuff in development☆19Updated this week
- ☆12Oct 10, 2021Updated 4 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Handling Big Data with Knowledge Graph: A Detailed Guide☆30May 11, 2025Updated 11 months ago
- ☆31Apr 24, 2023Updated 2 years ago
- Python bindings for NVIDIA CUDA APIs.☆13Mar 2, 2024Updated 2 years ago
- Official implementation of EPiC: Efficient Video Camera Control Learning with Precise Anchor-Video Guidance☆48Jun 2, 2025Updated 10 months ago
- ☆12Mar 31, 2024Updated 2 years ago
- ☆12Feb 6, 2021Updated 5 years ago
- Synthetic data derived by templating, few shot prompting, transformations on public domain corpora, and monte carlo tree search.☆34Oct 8, 2025Updated 6 months ago
- Unofficial implementation of paper : Exploring the Space of Key-Value-Query Models with Intention☆12May 24, 2023Updated 2 years ago
- Exploring CoT-Decoding from Google DeepMind's paper, "Chain-of-Thought Reasoning Without Prompting".☆13Feb 22, 2024Updated 2 years ago
- Deploy open-source AI quickly and easily - Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- Analyze AI agent trajectories: extract actions, summarize, embed, and visualize.☆109Updated this week
- Repository for the paper Stream of Search: Learning to Search in Language☆153Feb 3, 2025Updated last year
- reinforcement learning algorithms from the book by Sutton and Barto☆17Feb 27, 2021Updated 5 years ago
- Code for L4DC 2022 paper: Joint Synthesis of Safety Certificate and Safe Control Policy Using Constrained Reinforcement Learning.☆15Jul 31, 2023Updated 2 years ago
- (NeurIPS 2025) Official implementation for "MJ-Bench: Is Your Multimodal Reward Model Really a Good Judge for Text-to-Image Generation?"☆50Jun 3, 2025Updated 10 months ago
- Code for RATIONALYST: Pre-training Process-Supervision for Improving Reasoning https://arxiv.org/pdf/2410.01044☆35Oct 3, 2024Updated last year
- Pytorch Datasets for Easy-To-Hard☆29Jan 9, 2025Updated last year