apartresearch / Integer_AdditionView external linksLinks
✱ Understanding the underlying learning dynamics of simple tasks in Transformer networks
☆18Aug 16, 2024Updated last year
Alternatives and similar repositories for Integer_Addition
Users that are interested in Integer_Addition are comparing it to the libraries listed below
Sorting:
- Code for the paper "A Mechanistic Interpretation of Arithmetic Reasoning in Language Models using Causal Mediation Analysis"☆19Jun 12, 2025Updated 8 months ago
- A YAML editor for the modern Kirby games☆13Aug 23, 2025Updated 5 months ago
- AlgZoo: uninterpreted models with fewer than 1,500 parameters☆40Jan 19, 2026Updated 3 weeks ago
- [WWW 25] USPTO-LLM: A Large Language Model-Assisted Information-enriched Chemical Reaction Dataset☆15Dec 12, 2024Updated last year
- ☆11Nov 8, 2022Updated 3 years ago
- A curated list of resources on Reinforcement Learning with Verifiable Rewards (RLVR) and the reasoning capability boundary of Large Langu…☆85Dec 12, 2025Updated 2 months ago
- Code for LLM_Catastrophic_Forgetting via SAM.☆11Jun 7, 2024Updated last year
- code for EMNLP 2024 paper: Interpreting Arithmetic Mechanism in Large Language Models through Comparative Neuron Analysis☆12Nov 17, 2024Updated last year
- Your finetuned model's back to its original safety standards faster than you can say "SafetyLock"!☆11Oct 16, 2024Updated last year
- An implementation of online data mixing for the Pile dataset, based on the GPT-NeoX library.☆13Jan 9, 2024Updated 2 years ago
- Code for the "Overcoming Sparsity Artifacts in Crosscoders to Interpret Chat-Tuning" paper.☆16Nov 21, 2025Updated 2 months ago
- Target-constrained Bidirectional Planning for Generation of Target-oriented Proactive Dialogue (TOIS)☆13Oct 18, 2025Updated 3 months ago
- ☆12Nov 5, 2024Updated last year
- [EMNLP 2024 Main] Official implementation of the paper "The Accuracy Paradox in RLHF: When Better Reward Models Don't Yield Better Langua…☆13Nov 11, 2024Updated last year
- Source code for paper "On the Pareto Front of Multilingual Neural Machine Translation" @ NeurIPS 2023☆16Sep 27, 2023Updated 2 years ago
- ☆14Apr 21, 2023Updated 2 years ago
- A curated list of Heterophilous Graph Self-Supervised Learning papers.☆15Apr 6, 2023Updated 2 years ago
- Code to reproduce key results accompanying "SAEs (usually) Transfer Between Base and Chat Models"☆13Jul 18, 2024Updated last year
- Fine-Tuning Pre-trained Transformers into Decaying Fast Weights☆19Oct 9, 2022Updated 3 years ago
- Fine grained Empathy Direction Detection☆15Dec 11, 2020Updated 5 years ago
- [ICLR'25 Spotlight] Revisiting Random Walks for Learning on Graphs (RWNN), in PyTorch☆17Mar 4, 2025Updated 11 months ago
- 2D Vector-Quantized Auto-Encoder for compression of Whole-Slide Images in Histopathology☆16Jul 18, 2024Updated last year
- ☆12May 6, 2024Updated last year
- ☆12Mar 25, 2019Updated 6 years ago
- PyTorch implementation of StableMask (ICML'24)☆15Jun 27, 2024Updated last year
- ☆22Updated this week
- huggingface ChineseBert Tokenizer☆16Apr 16, 2022Updated 3 years ago
- Code and Data release for "Improving Multilingual Translation by Representation and Gradient Regularization" (Yang et al. EMNLP 2021), an…☆13Aug 12, 2024Updated last year
- Knowledge transfer from high-resource to low-resource programming languages for Code LLMs☆16Aug 12, 2025Updated 6 months ago
- A benchmark for mechanistic discovery of circuits in Transformers☆16Dec 15, 2024Updated last year
- Forcing Diffuse Distributions out of Language Models☆18Sep 10, 2024Updated last year
- Hands-on construction of a complete neural network☆14Aug 20, 2023Updated 2 years ago
- code scripts for blog posts I published☆12Jun 17, 2020Updated 5 years ago
- millisecond repeat image filter with Photo gallery 毫秒级图片库去重, 去重效率与图片库数量无关,0.1~10ms☆17Dec 20, 2019Updated 6 years ago
- Engine for collecting, uploading, and downloading model activations☆26Apr 2, 2025Updated 10 months ago
- This repository releases the code and data for utterance rewriting in open-domain dialogues.☆18Feb 24, 2023Updated 2 years ago
- Our research proposes a novel MoGU framework that improves LLMs' safety while preserving their usability.☆18Jan 14, 2025Updated last year
- ☆13Dec 14, 2021Updated 4 years ago
- (Unofficial) Implementation of ICLR 2021 paper "Gradient Vaccine: Investigating and Improving Multi-task Optimization in Massively Multil…☆14Sep 14, 2022Updated 3 years ago