Towards a Mechanistic Understanding of Large Reasoning Models: A Survey of Training, Inference, and Failures
☆33Jan 29, 2026Updated 3 months ago
Alternatives and similar repositories for Awesome-LRM-Mechanisms
Users that are interested in Awesome-LRM-Mechanisms are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Official repository for the paper Number Cookbook: Number Understanding of Language Models and How to Improve It.☆20Mar 31, 2025Updated last year
- exploring whether LLMs perform case-based or rule-based reasoning☆31Mar 2, 2024Updated 2 years ago
- ☆35Oct 22, 2025Updated 6 months ago
- LeanDojo-v2 is an end-to-end framework for training, evaluating, and deploying AI-assisted theorem provers for Lean 4.☆74Updated this week
- Whole cell model of E. coli implemented with Vivarium☆29Updated this week
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- The official codes of Rethinking Knowledge Graph Evaluation Under the Open-World Assumption (NeurIPS 2022)☆23Sep 20, 2022Updated 3 years ago
- Official codebase for our paper "Do Language Models Use Their Depth Efficiently?"☆29Jun 25, 2025Updated 10 months ago
- ☆19Sep 16, 2025Updated 7 months ago
- A curated list of resources on Reinforcement Learning with Verifiable Rewards (RLVR) and the reasoning capability boundary of Large Langu…☆88Dec 12, 2025Updated 4 months ago
- PyTorch implementation of Gaussian word embeddings☆19Apr 7, 2018Updated 8 years ago
- A library for subgraph GNN based on pyg☆39Nov 28, 2024Updated last year
- Marathon: A Multiple-choice Long Context Evaluation Benchmark for Large Language Models.☆10May 16, 2024Updated last year
- Code repository for "The Clock and the Pizza: Two Stories in Mechanistic Explanation of Neural Networks"☆19Nov 24, 2023Updated 2 years ago
- [VLDB'2025] LEAP: LLM-powered End-to-end Automatic Library for Processing Social Science Queries on Unstructured Data☆20Nov 3, 2025Updated 5 months ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- An ironically named R package to automatically fetch data from the UK Parliament API.☆31Oct 8, 2021Updated 4 years ago
- Write your own operating system with Rust!☆30Mar 23, 2026Updated last month
- AgentOccam: A Simple Yet Strong Baseline for LLM-Based Web Agents☆54Jan 28, 2025Updated last year
- ☆42Jun 11, 2025Updated 10 months ago
- ☆14Oct 19, 2025Updated 6 months ago
- Whole Cell Model of E. coli☆40Updated this week
- Utilities for Python developing and debugging.☆25Dec 1, 2021Updated 4 years ago
- Official PyTorch code for ICLR 2025 paper "Gnothi Seauton: Empowering Faithful Self-Interpretability in Black-Box Models"☆23Mar 4, 2025Updated last year
- ☆11Apr 12, 2024Updated 2 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- [NeurIPS25] RULE: Reinforcement UnLEarning Achieves Forge-retain Pareto Optimality☆21Oct 22, 2025Updated 6 months ago
- All-in-One Safety Evaluation Framwork☆48Apr 21, 2026Updated last week
- Automated bibliography verification and LaTeX quality auditing for papers.☆86Jan 22, 2026Updated 3 months ago
- JoinAI是一个开源仓库,专注于算法工程能力的培养,包括工程和数学原理的整理☆11Apr 20, 2025Updated last year
- The official codebase for "Experiential Reinforcement Learning" - https://arxiv.org/pdf/2602.13949v1☆68Apr 8, 2026Updated 3 weeks ago
- Text Adventure Learning Environment Suite - Benchmark to evaluate language models on interactive text environments.☆27Apr 1, 2026Updated 3 weeks ago
- [ICLR 2026] A library for generating difficulty-scalable, multi-tool, and verifiable agentic tasks with execution trajectories.☆186Jul 6, 2025Updated 9 months ago
- The Implementation for the Paper "Time-Stamped Language Model: Teaching Language Models toUnderstand The Flow of Events"☆11May 6, 2021Updated 4 years ago
- A supervised fine-tuning method for controllable reasoning length in large language models (一种通过有监督微调实现大语言模型思考长度可控的方法)☆11May 8, 2025Updated 11 months ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- (ACL 2025) 🔥🔥🔥Code for "Empowering Multimodal Large Language Models with Evol-Instruct"☆22May 15, 2025Updated 11 months ago
- Socratic-Zero is a fully autonomous framework that generates high-quality training data for mathematical reasoning☆36Oct 26, 2025Updated 6 months ago
- SkillOrchestra: Learning to Route Agents via Skill Transfer☆59Mar 25, 2026Updated last month
- [ICLR 2026] Official code for [EdiVal-Agent Automated, object-centric evaluation for multi-turn instruction-based image editing]☆27Mar 1, 2026Updated last month
- Fantastic Data Engineering for Large Language Models☆93Dec 29, 2024Updated last year
- Official Code Repository for EnvGen: Generating and Adapting Environments via LLMs for Training Embodied Agents (COLM 2024)☆40Jul 13, 2024Updated last year
- EMNLP 2022 Demo "SynKB: Semantic Search for Chemical Synthesis Procedures"☆17Oct 31, 2022Updated 3 years ago