Towards a Mechanistic Interpretation of Multi-Step Reasoning Capabilities of Language Models
☆15Nov 4, 2023Updated 2 years ago
Alternatives and similar repositories for MechanisticProbe
Users that are interested in MechanisticProbe are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Official PyTorch code for "Sample Efficient Offline-to-Online Reinforcement Learning" in TKDE'23.☆16Aug 14, 2023Updated 2 years ago
- Code for the paper "A Mechanistic Interpretation of Arithmetic Reasoning in Language Models using Causal Mediation Analysis"☆20Jun 12, 2025Updated 11 months ago
- This is official project in our paper: Is Bigger and Deeper Always Better? Probing LLaMA Across Scales and Layers☆31Jan 13, 2024Updated 2 years ago
- Less is More: Mitigating Multimodal Hallucination from an EOS Decision Perspective (ACL 2024)☆57Oct 28, 2024Updated last year
- Redwood Research's transformer interpretability tools☆15Apr 15, 2022Updated 4 years ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- ☆17Mar 22, 2025Updated last year
- Baseline models for the paper: "Modeling Naive Psychology of Characters in Simple Commonsense Stories" by Hannah Rashkin, Antoine Bosselu…☆16Feb 23, 2021Updated 5 years ago
- A codebase for ACL 2023 paper: Mitigating Label Biases for In-context Learning☆10Aug 4, 2023Updated 2 years ago
- e☆43Apr 23, 2025Updated last year
- ☆10Aug 24, 2023Updated 2 years ago
- [ICML 2024] Unveiling and Harnessing Hidden Attention Sinks: Enhancing Large Language Models without Training through Attention Calibrati…☆45Jun 30, 2024Updated last year
- Multi-camera calibration (intrinsics, extrinsics, and bundle adjustment)☆14Nov 2, 2025Updated 6 months ago
- Preprint: Asymmetry in Low-Rank Adapters of Foundation Models☆39Feb 27, 2024Updated 2 years ago
- AdaICL: Which Examples to Annotate of In-Context Learning? Towards Effective and Efficient Selection☆19Oct 30, 2023Updated 2 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- ⚠️ ARCHIVED - All development moved to https://github.com/itbench-hub/ITBench/tree/main/scenarios☆15Feb 24, 2026Updated 2 months ago
- Clean, extensible implementation of MACAW [ICML 2021]☆12Dec 7, 2021Updated 4 years ago
- Curation of resources for LLM mathematical reasoning, most of which are screened by @tongyx361 to ensure high quality and accompanied wit…☆157Jul 12, 2024Updated last year
- [ASE 2025] CoSIL: Software Issue Localization via LLM-Driven Code Repository Graph Searching☆20Apr 20, 2026Updated last month
- ☆12Feb 6, 2021Updated 5 years ago
- CVPR2021: Detecting Human-Object Interaction via Fabricated Compositional Learning☆16Jul 7, 2021Updated 4 years ago
- Train your own GPT2!☆14Apr 11, 2023Updated 3 years ago
- Codebase for Global Neural CCG Parsing with Optimality Guarantees☆25Apr 27, 2017Updated 9 years ago
- Code for L4DC 2022 paper: Joint Synthesis of Safety Certificate and Safe Control Policy Using Constrained Reinforcement Learning.☆15Jul 31, 2023Updated 2 years ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Code for experiments on transformers using Markovian data.☆22Nov 22, 2024Updated last year
- A Gymnasium-based Environment of the Abstraction and Reasoning Corpus (ARC)☆71Aug 30, 2024Updated last year
- [ICLR 2025] "Training LMs on Synthetic Edit Sequences Improves Code Synthesis" (Piterbarg, Pinto, Fergus)☆19Feb 11, 2025Updated last year
- [EMNLP'24] Evaluating LLM performance and sensitivity when there is a "task-switch". Code for "LLM Task Interference: An Initial Study on…☆15Oct 27, 2024Updated last year
- ☆12Jan 10, 2025Updated last year
- ☆19Sep 1, 2025Updated 8 months ago
- Source code of “Reinforcement Learning with Token-level Feedback for Controllable Text Generation (NAACL 2024)☆17Dec 8, 2024Updated last year
- [NeurIPS 2024] Mitigating Object Hallucination via Concentric Causal Attention☆66Aug 30, 2025Updated 8 months ago
- This is the official implementation of Multi-Agent PPO.☆147Jan 17, 2023Updated 3 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- ☆19Mar 25, 2025Updated last year
- Code release for "Generating Code World Models with Large Language Models Guided by Monte Carlo Tree Search" published at NeurIPS '24.☆18Feb 21, 2025Updated last year
- Establishing new state-of-the-art results for Bokeh Rendering on the EBB! Dataset.☆16Aug 25, 2023Updated 2 years ago
- Extension of Neural Radiance Feilds (Mildenhall et al 2020) to perform 3D style transfer. Implementation in PyTorch Lightning.☆14Oct 18, 2021Updated 4 years ago
- Reasoning Agentic Retrieval-Augmented Generation for Industry Challenges☆28May 14, 2025Updated last year
- Implements the Messenger environment and EMMA model.☆25Jun 14, 2023Updated 2 years ago
- [ICML 2024] Junk DNA Hypothesis: A Task-Centric Angle of LLM Pre-trained Weights through Sparsity; Lu Yin*, Ajay Jaiswal*, Shiwei Liu, So…☆16Apr 21, 2025Updated last year