eth-sri / language-model-arithmetic
Controlled Text Generation via Language Model Arithmetic
☆216Updated 6 months ago
Alternatives and similar repositories for language-model-arithmetic:
Users that are interested in language-model-arithmetic are comparing it to the libraries listed below
- ☆307Updated 9 months ago
- This is a repository for "PMET: Precise Model Editing in a Transformer"☆50Updated last year
- [ICLR 2023] Codebase for Copy-Generator model, including an implementation of kNN-LM☆185Updated 2 months ago
- visualizing attention for LLM users☆201Updated 3 months ago
- Mass-editing thousands of facts into a transformer memory (ICLR 2023)☆472Updated last year
- ☆288Updated last year
- Self-Alignment with Principle-Following Reward Models☆156Updated last year
- Inference-Time Intervention: Eliciting Truthful Answers from a Language Model☆516Updated 2 months ago
- Evaluating LLMs with fewer examples☆148Updated 11 months ago
- Code for In-context Vectors: Making In Context Learning More Effective and Controllable Through Latent Space Steering☆167Updated last month
- Benchmarking LLMs with Challenging Tasks from Real Users☆219Updated 5 months ago
- Layer-Condensed KV cache w/ 10 times larger batch size, fewer params and less computation. Dramatic speed up with better task performance…☆148Updated 2 months ago
- [EMNLP 2023] The CoT Collection: Improving Zero-shot and Few-shot Learning of Language Models via Chain-of-Thought Fine-Tuning☆236Updated last year
- ☆131Updated last year
- Improving Alignment and Robustness with Circuit Breakers☆192Updated 6 months ago
- Self-playing Adversarial Language Game Enhances LLM Reasoning, NeurIPS 2024☆124Updated last month
- [NeurIPS'24] SelfCodeAlign: Self-Alignment for Code Generation☆298Updated last month
- Codebase for reproducing the experiments of the semantic uncertainty paper (paragraph-length experiments).☆54Updated 11 months ago
- Code and data for "Lumos: Learning Agents with Unified Data, Modular Design, and Open-Source LLMs"☆463Updated last year
- ☆163Updated 3 weeks ago
- Code accompanying "How I learned to start worrying about prompt formatting".☆104Updated 6 months ago
- [NeurIPS 2023] This is the code for the paper `Large Language Model as Attributed Training Data Generator: A Tale of Diversity and Bias`.☆151Updated last year
- Learning to Compress Prompts with Gist Tokens - https://arxiv.org/abs/2304.08467☆283Updated last month
- [EMNLP 2023] Adapting Language Models to Compress Long Contexts☆298Updated 6 months ago
- What's In My Big Data (WIMBD) - a toolkit for analyzing large text datasets☆215Updated 4 months ago
- Code and data accompanying our paper on arXiv "Faithful Chain-of-Thought Reasoning".☆157Updated 10 months ago
- Code repo for "Agent Instructs Large Language Models to be General Zero-Shot Reasoners"☆105Updated 6 months ago
- Steer LLM outputs towards a certain topic/subject and enhance response capabilities using activation engineering by adding steering vecto…☆230Updated last month
- PASTA: Post-hoc Attention Steering for LLMs☆113Updated 4 months ago
- This repository provides an original implementation of Detecting Pretraining Data from Large Language Models by *Weijia Shi, *Anirudh Aji…☆218Updated last year