alestolfo/lm-arithmetic

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/alestolfo/lm-arithmetic)

alestolfo / lm-arithmetic

Code for the paper "A Mechanistic Interpretation of Arithmetic Reasoning in Language Models using Causal Mediation Analysis"

☆20

Alternatives and similar repositories for lm-arithmetic

Users that are interested in lm-arithmetic are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

yifan-h / MechanisticProbe
View on GitHub
Towards a Mechanistic Interpretation of Multi-Step Reasoning Capabilities of Language Models
☆15Nov 4, 2023Updated 2 years ago
apartresearch / Integer_Addition
View on GitHub
✱ Understanding the underlying learning dynamics of simple tasks in Transformer networks
☆19Aug 16, 2024Updated last year
Muennighoff / FLAN
View on GitHub
Provides a minimal implementation to extract FLAN datasets for further processing
☆11Feb 1, 2023Updated 3 years ago
FutureForMe / MADKE
View on GitHub
☆14Jan 6, 2025Updated last year
EIT-NLP / AccuracyParadox-RLHF
View on GitHub
[EMNLP 2024 Main] Official implementation of the paper "The Accuracy Paradox in RLHF: When Better Reward Models Don't Yield Better Langua…
☆12Nov 11, 2024Updated last year
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
liyongqi67 / GCoQA
View on GitHub
☆18Jun 24, 2025Updated last year
UVa-NLP / VMASK
View on GitHub
Code for the paper "Learning Variational Word Masks to Improve the Interpretability of Neural Text Classifiers"
☆18Dec 15, 2020Updated 5 years ago
zepingyu0512 / arithmetic-mechanism
View on GitHub
code for EMNLP 2024 paper: Interpreting Arithmetic Mechanism in Large Language Models through Comparative Neuron Analysis
☆12Nov 17, 2024Updated last year
VIM-Bench / VIM_TOOL
View on GitHub
☆12Jun 12, 2024Updated 2 years ago
WanliYoung / Collapse-in-Model-Editing
View on GitHub
Code and data repository for two papers (ACL & EMNLP 2024) on the topic of collapse in model editing.
☆10Dec 20, 2024Updated last year
technion-cs-nlp / llm-arithmetic-heuristics
View on GitHub
☆27Jun 9, 2026Updated last month
katiekang1998 / reasoning_generalization
View on GitHub
☆33Jan 7, 2025Updated last year
dattalab-6-cam / multicam-calibration
View on GitHub
Multi-camera calibration (intrinsics, extrinsics, and bundle adjustment)
☆14Nov 2, 2025Updated 8 months ago
LLM-MI-Research / Actionable-MI
View on GitHub
☆15Jan 20, 2026Updated 6 months ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
NJUNLP / AdaR
View on GitHub
☆15Dec 8, 2025Updated 7 months ago
iwangjian / Midi-Tuning
View on GitHub
[ACL 2024] Instruct Once, Chat Consistently in Multiple Rounds: An Efficient Tuning Framework for Dialogue
☆26Oct 18, 2025Updated 9 months ago
Hritikbansal / jpo
View on GitHub
☆13Jul 2, 2025Updated last year
wjhou / Recap
View on GitHub
[EMNLP 2023 Findings] RECAP: Towards Precise Radiology Report Generation via Dynamic Disease Progression Reasoning
☆28Jun 12, 2025Updated last year
mjy1111 / PEAK
View on GitHub
The repository for our paper: Neighboring Perturbations of Knowledge Editing on Large Language Models
☆16May 4, 2024Updated 2 years ago
Nix07 / finetuning
View on GitHub
This repository contains the code used for the experiments in the paper "Fine-Tuning Enhances Existing Mechanisms: A Case Study on Entity…
☆32Oct 27, 2025Updated 8 months ago
liyongqi67 / MINDER
View on GitHub
☆71Jun 24, 2025Updated last year
jina-ai / mteb-de
View on GitHub
MTEB: Massive Text Embedding Benchmark
☆11Jan 29, 2024Updated 2 years ago
gauss5930 / AlpaGasus2-QLoRA
View on GitHub
This is AlpaGasus2-QLoRA based on LLaMA2 with AlpaGasus mechanism using QLoRA!
☆15Nov 22, 2023Updated 2 years ago
Serverless GPU API endpoints on Runpod - Get Bonus Credits • Ad
Skip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
chungkwong / mathocr-myscript-android
View on GitHub
Offline handwritten mathematical expression recognition via stroke extraction for Android
☆14Sep 7, 2020Updated 5 years ago
Brandeis-cosi101a-hwe / HME_recognition
View on GitHub
Handwritten Math Expressions Recognition
☆13Sep 8, 2017Updated 8 years ago
xiaojunxu / learning-to-watermark-llm
View on GitHub
☆22Mar 19, 2024Updated 2 years ago
JoHof / IntegratedGradientsTutorial
View on GitHub
Very concise example of integrated gradients (a method to reveal areas of attention in input images)
☆10Jun 17, 2019Updated 7 years ago
YiCheng98 / Cooper
View on GitHub
This repository provides the data and the codes used in the AAAI'24 paper, COOPER: Coordinating Specialized Agents towards a Complex Dial…
☆28Mar 1, 2024Updated 2 years ago
Thomasyyj / LongBio-Benchmark
View on GitHub
A controlled benchmark on evaluating and studying the dynamics of Long Context Language Models
☆26Oct 17, 2025Updated 9 months ago
YangCao28 / nano-SGLang
View on GitHub
Nano SGLang
☆17Jul 21, 2025Updated last year
MoroccoAI / Moroccans-top-AI-confs
View on GitHub
Curated list of Moroccans publishing in the most prestigious AI conferences
☆11Jul 6, 2026Updated 2 weeks ago
iwangjian / TopDial
View on GitHub
[EMNLP 2023] Target-oriented Proactive Dialogue Systems with Personalization: Problem Formulation and Dataset Curation
☆31Oct 18, 2025Updated 9 months ago
Serverless GPU API endpoints on Runpod - Get Bonus Credits • Ad
Skip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
ezeli / Transformer_model
View on GitHub
A pytorch implementation of Attention Is All You Need (Transformer) for image captioning.
☆12Nov 15, 2021Updated 4 years ago
CAMeL-Lab / Gumar-Ngrams
View on GitHub
The complete [1 to 5]-gram Gumar Corpus in the style of Google n-grams.
☆12Feb 5, 2020Updated 6 years ago
dvruette / concept-guidance
View on GitHub
Code accompanying the paper "A Language Model's Guide Through Latent Space". It contains functionality for training and using concept vec…
☆21Feb 23, 2024Updated 2 years ago
thu-coai / Stylized-Story-Generation-with-Style-Guided-Planning
View on GitHub
Codes for paper "Stylized Story Generation with Style-Guided Planning"
☆12May 9, 2021Updated 5 years ago
icip-cas / ReasoningLens
View on GitHub
ReasoningLens: a user-friendly toolkit to visualize, understand, and debug model reasoning chains.
☆25Jul 7, 2026Updated 2 weeks ago
eecrazy / CausalBank
View on GitHub
CausalBank dataset from our IJCAI 2020 paper "Guided Generation of Cause and Effect" https://www.ijcai.org/Proceedings/2020/502
☆51Jan 22, 2021Updated 5 years ago
XiaoyuXu-Vincent / step-saliency
View on GitHub
Official code for paper "Reasoning Fails Where Step Flow Breaks" (ACL 2026)
☆18Apr 19, 2026Updated 3 months ago