Towards a Mechanistic Understanding of Large Reasoning Models: A Survey of Training, Inference, and Failures
☆33Jan 29, 2026Updated 3 months ago
Alternatives and similar repositories for Awesome-LRM-Mechanisms
Users that are interested in Awesome-LRM-Mechanisms are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Official repository for the paper Number Cookbook: Number Understanding of Language Models and How to Improve It.☆21Mar 31, 2025Updated last year
- exploring whether LLMs perform case-based or rule-based reasoning☆31Mar 2, 2024Updated 2 years ago
- ☆36Oct 22, 2025Updated 6 months ago
- LeanDojo-v2 is an end-to-end framework for training, evaluating, and deploying AI-assisted theorem provers for Lean 4.☆82Apr 26, 2026Updated 3 weeks ago
- Whole cell model of E. coli implemented with Vivarium☆29Updated this week
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- The official codes of Rethinking Knowledge Graph Evaluation Under the Open-World Assumption (NeurIPS 2022)☆24Sep 20, 2022Updated 3 years ago
- Official codebase for our paper "Do Language Models Use Their Depth Efficiently?"☆29Jun 25, 2025Updated 10 months ago
- ☆19Apr 26, 2026Updated 3 weeks ago
- A curated list of resources on Reinforcement Learning with Verifiable Rewards (RLVR) and the reasoning capability boundary of Large Langu…☆88Dec 12, 2025Updated 5 months ago
- PyTorch implementation of Gaussian word embeddings☆19Apr 7, 2018Updated 8 years ago
- A library for subgraph GNN based on pyg☆39Nov 28, 2024Updated last year
- Marathon: A Multiple-choice Long Context Evaluation Benchmark for Large Language Models.☆10May 16, 2024Updated 2 years ago
- Code repository for "The Clock and the Pizza: Two Stories in Mechanistic Explanation of Neural Networks"☆20Nov 24, 2023Updated 2 years ago
- [VLDB'2025] LEAP: LLM-powered End-to-end Automatic Library for Processing Social Science Queries on Unstructured Data☆20Nov 3, 2025Updated 6 months ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- An ironically named R package to automatically fetch data from the UK Parliament API.☆31Oct 8, 2021Updated 4 years ago
- Write your own operating system with Rust!☆30Mar 23, 2026Updated last month
- AgentOccam: A Simple Yet Strong Baseline for LLM-Based Web Agents☆55Jan 28, 2025Updated last year
- Benchmarking Social Intelligence of Language Agents through Interactive Scenarios☆13Jan 4, 2025Updated last year
- ☆42Jun 11, 2025Updated 11 months ago
- Whole Cell Model of E. coli☆40May 11, 2026Updated last week
- ☆14Oct 19, 2025Updated 7 months ago
- Utilities for Python developing and debugging.☆25Dec 1, 2021Updated 4 years ago
- Official PyTorch code for ICLR 2025 paper "Gnothi Seauton: Empowering Faithful Self-Interpretability in Black-Box Models"☆23Mar 4, 2025Updated last year
- End-to-end encrypted cloud storage - Proton Drive • AdSpecial offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
- ☆11Apr 12, 2024Updated 2 years ago
- All-in-One Safety Evaluation Framwork☆50Apr 21, 2026Updated 3 weeks ago
- [NeurIPS25] RULE: Reinforcement UnLEarning Achieves Forge-retain Pareto Optimality☆21Oct 22, 2025Updated 6 months ago
- JoinAI是一个开源仓库,专注于算法工程能力的培养,包括工程和数学原理的整理☆11Apr 20, 2025Updated last year
- The official codebase for "Experiential Reinforcement Learning" - https://arxiv.org/pdf/2602.13949v1☆68May 8, 2026Updated last week
- Text Adventure Learning Environment Suite - Benchmark to evaluate language models on interactive text environments.☆29May 9, 2026Updated last week
- [ICLR 2026] A library for generating difficulty-scalable, multi-tool, and verifiable agentic tasks with execution trajectories.☆188Jul 6, 2025Updated 10 months ago
- The Implementation for the Paper "Time-Stamped Language Model: Teaching Language Models toUnderstand The Flow of Events"☆11May 6, 2021Updated 5 years ago
- Automated bibliography verification and LaTeX quality auditing for papers.☆117May 6, 2026Updated 2 weeks ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- A supervised fine-tuning method for controllable reasoning length in large language models (一种通过有监督微调实现大语言模型思考长度可控的方法)☆11May 8, 2025Updated last year
- (ACL 2025) 🔥🔥🔥Code for "Empowering Multimodal Large Language Models with Evol-Instruct"☆22May 15, 2025Updated last year
- Socratic-Zero is a fully autonomous framework that generates high-quality training data for mathematical reasoning☆36Oct 26, 2025Updated 6 months ago
- [ICLR 2026] Official code for [EdiVal-Agent Automated, object-centric evaluation for multi-turn instruction-based image editing]☆27Mar 1, 2026Updated 2 months ago
- Fantastic Data Engineering for Large Language Models☆93Dec 29, 2024Updated last year
- Official Code Repository for EnvGen: Generating and Adapting Environments via LLMs for Training Embodied Agents (COLM 2024)☆40Jul 13, 2024Updated last year
- EMNLP 2022 Demo "SynKB: Semantic Search for Chemical Synthesis Procedures"☆17Oct 31, 2022Updated 3 years ago