zepingyu0512 / arithmetic-mechanismView external linksLinks
code for EMNLP 2024 paper: Interpreting Arithmetic Mechanism in Large Language Models through Comparative Neuron Analysis
☆12Nov 17, 2024Updated last year
Alternatives and similar repositories for arithmetic-mechanism
Users that are interested in arithmetic-mechanism are comparing it to the libraries listed below
Sorting:
- code for EMNLP 2024 paper: How do Large Language Models Learn In-Context? Query and Key Matrices of In-Context Heads are Two Towers for M…☆13Nov 17, 2024Updated last year
- ☆17Apr 26, 2024Updated last year
- Official implementation of "MMNeuron: Discovering Neuron-Level Domain-Specific Interpretation in Multimodal Large Language Model". Our co…☆25Dec 20, 2024Updated last year
- code for EMNLP 2024 paper: Neuron-Level Knowledge Attribution in Large Language Models☆50Nov 17, 2024Updated last year
- ☆25Apr 26, 2024Updated last year
- Official implementation of “Watch Your Step: A Fine-Grained Evaluation Framework for Multi-hop Knowledge Editing in Large Language Models…☆46Nov 25, 2025Updated 2 months ago
- Code for reproducing our paper "Low Rank Adapting Models for Sparse Autoencoder Features"☆17Mar 31, 2025Updated 10 months ago
- ☆16Apr 26, 2024Updated last year
- ☆17Apr 26, 2024Updated last year
- ☆16Apr 26, 2024Updated last year
- ☆20Aug 26, 2024Updated last year
- ☆18Apr 26, 2024Updated last year
- ☆19Jan 3, 2025Updated last year
- ☆91Dec 23, 2024Updated last year
- ☆20Jul 15, 2024Updated last year
- [ACL 2024] Unveiling Linguistic Regions in Large Language Models☆33Jun 9, 2024Updated last year
- DELLA-Merging: Reducing Interference in Model Merging through Magnitude-Based Sampling☆36Jul 12, 2024Updated last year
- Enable Next-sentence Prediction for Large Language Models with Faster Speed, Higher Accuracy and Longer Context☆41Aug 16, 2024Updated last year
- DreamSmooth: Improving Model-Based RL with Reward Smoothing (ICLR 2024)☆12May 6, 2024Updated last year
- About Code release for "Imagination Mechanism: Mesh Information Propagation for Enhancing Data Efficiency in Reinforcement Learning"☆13Oct 7, 2023Updated 2 years ago
- Teaching a humanoid to walk(ish), then displaying in your browser (using tensorflow.js and reinforcement learning)☆10Sep 7, 2020Updated 5 years ago
- A collection of heat engines, based on the OpenAI Gym environment framework for use with reinforcement learning applications.☆15Dec 20, 2021Updated 4 years ago
- ☆16Feb 22, 2025Updated 11 months ago
- ☆14Mar 21, 2024Updated last year
- ☆11Jan 11, 2022Updated 4 years ago
- The code for the paper "A Bayesian Approach to Online Planning" published in ICML 2024.☆13Jun 17, 2024Updated last year
- code for polite☆11Feb 28, 2024Updated last year
- One Initialization to Rule them All: Fine-tuning via Explained Variance Adaptation☆48Oct 20, 2025Updated 3 months ago
- Conditional DDPM for characterizing radio sources from dirty images. (autumn 2023)☆11Nov 30, 2023Updated 2 years ago
- ☆13Jun 22, 2025Updated 7 months ago
- ☆39Jan 16, 2026Updated last month
- ReLAx - Reinforcement Learning Applications Library☆15Feb 19, 2023Updated 2 years ago
- [ICML 2023] Variational Curriculum Reinforcement Learning for Unsupervised Discovery of Skills☆12Jul 15, 2023Updated 2 years ago
- unifloc on python☆15Nov 14, 2020Updated 5 years ago
- ☆13May 3, 2024Updated last year
- ☆10Feb 12, 2024Updated 2 years ago
- Code for the paper "FinRLlama: A Solution to LLM-Engineered Signals Challenge at FinRL Contest 2024"☆13Feb 14, 2025Updated last year
- Code repository for our work on Quantum Pi☆10Jun 4, 2024Updated last year
- The official code release for "Langevin Soft Actor-Critic: Efficient Exploration through Uncertainty-Driven Critic Learning", ICLR 2025☆13May 28, 2025Updated 8 months ago