eth-sri / language-model-arithmetic
Controlled Text Generation via Language Model Arithmetic
☆215Updated 5 months ago
Alternatives and similar repositories for language-model-arithmetic:
Users that are interested in language-model-arithmetic are comparing it to the libraries listed below
- ☆305Updated 8 months ago
- Mass-editing thousands of facts into a transformer memory (ICLR 2023)☆464Updated last year
- Self-playing Adversarial Language Game Enhances LLM Reasoning, NeurIPS 2024☆120Updated last week
- This is a repository for "PMET: Precise Model Editing in a Transformer"☆49Updated last year
- Code for In-context Vectors: Making In Context Learning More Effective and Controllable Through Latent Space Steering☆164Updated 2 weeks ago
- ☆130Updated last year
- [IJCAI 2024] FactCHD: Benchmarking Fact-Conflicting Hallucination Detection☆86Updated 10 months ago
- visualizing attention for LLM users☆193Updated 2 months ago
- [NeurIPS 2023] This is the code for the paper `Large Language Model as Attributed Training Data Generator: A Tale of Diversity and Bias`.☆151Updated last year
- Steer LLM outputs towards a certain topic/subject and enhance response capabilities using activation engineering by adding steering vecto…☆224Updated 2 weeks ago
- [ICLR 2023] Codebase for Copy-Generator model, including an implementation of kNN-LM☆185Updated last month
- Codebase for reproducing the experiments of the semantic uncertainty paper (paragraph-length experiments).☆53Updated 10 months ago
- Code and data accompanying our paper on arXiv "Faithful Chain-of-Thought Reasoning".☆157Updated 9 months ago
- [EMNLP 2023] Adapting Language Models to Compress Long Contexts☆293Updated 5 months ago
- [NeurIPS 2024] Knowledge Circuits in Pretrained Transformers☆126Updated last week
- Official Implementation of InstructZero; the first framework to optimize bad prompts of ChatGPT(API LLMs) and finally obtain good prompts…☆187Updated 7 months ago
- Code for NeurIPS'24 paper 'Grokked Transformers are Implicit Reasoners: A Mechanistic Journey to the Edge of Generalization'☆183Updated 3 months ago
- ☆141Updated 10 months ago
- TART: A plug-and-play Transformer module for task-agnostic reasoning☆195Updated last year
- Code and data for "Lumos: Learning Agents with Unified Data, Modular Design, and Open-Source LLMs"☆463Updated 11 months ago
- Official repository for "Scaling Retrieval-Based Langauge Models with a Trillion-Token Datastore".☆189Updated this week
- Evaluating LLMs with fewer examples☆145Updated 10 months ago
- Inference-Time Intervention: Eliciting Truthful Answers from a Language Model☆502Updated last month
- ☆283Updated last year
- Learning to Compress Prompts with Gist Tokens - https://arxiv.org/abs/2304.08467☆279Updated 2 weeks ago
- Self-Alignment with Principle-Following Reward Models☆154Updated last year
- DSIR large-scale data selection framework for language model training☆241Updated 10 months ago
- Official implementation for the paper "DoLa: Decoding by Contrasting Layers Improves Factuality in Large Language Models"☆464Updated last month
- A simple unified framework for evaluating LLMs☆199Updated last month
- Functional Benchmarks and the Reasoning Gap☆84Updated 5 months ago