Controlled Text Generation via Language Model Arithmetic
☆224Sep 15, 2024Updated last year
Alternatives and similar repositories for language-model-arithmetic
Users that are interested in language-model-arithmetic are comparing it to the libraries listed below
Sorting:
- ☆11Mar 13, 2023Updated 2 years ago
- Self-Supervised Alignment with Mutual Information☆20May 24, 2024Updated last year
- Official Code for the papers: "Controlled Text Generation as Continuous Optimization with Multiple Constraints" and "Gradient-based Const…☆67Mar 21, 2024Updated last year
- The source code used for paper "Effective Seed-Guided Topic Discovery by Integrating Multiple Types of Contexts", in WSDM 2023.☆15May 27, 2023Updated 2 years ago
- Code associated with Tuning Language Models by Proxy (Liu et al., 2024)☆127Mar 30, 2024Updated last year
- [AAAI 2024] DenoSent: A Denoising Objective for Self-Supervised Sentence Representation Learning☆15Apr 29, 2024Updated last year
- Restore safety in fine-tuned language models through task arithmetic☆32Mar 28, 2024Updated last year
- Evaluate interpretability methods on localizing and disentangling concepts in LLMs.☆57Oct 30, 2025Updated 4 months ago
- code associated with ACL 2021 DExperts paper☆118May 24, 2023Updated 2 years ago
- ☆105Jan 6, 2025Updated last year
- Inference-Time Intervention: Eliciting Truthful Answers from a Language Model☆572Jan 28, 2025Updated last year
- This repository is the implementation of the paper Training Free Pretrained Model Merging (CVPR2024).☆33Mar 5, 2024Updated 2 years ago
- Code for "Seeking Neural Nuggets: Knowledge Transfer in Large Language Models from a Parametric Perspective"☆33May 9, 2024Updated last year
- Group-conditional DRO to alleviate spurious correlations☆15Jul 15, 2021Updated 4 years ago
- Uncertainty quantification for in-context learning of large language models☆15Apr 1, 2024Updated last year
- ☆98Jun 27, 2024Updated last year
- A library for making RepE control vectors☆691Sep 24, 2025Updated 5 months ago
- Code and data for paper "Context-faithful Prompting for Large Language Models".☆42Mar 23, 2023Updated 2 years ago
- Code for the COLING 2022 paper "DoubleMix: Simple Interpolation-Based Data Augmentation for Text Classification"☆19Oct 19, 2022Updated 3 years ago
- A Mechanistic Understanding of Alignment Algorithms: A Case Study on DPO and Toxicity.☆85Mar 7, 2025Updated 11 months ago
- Official Implementation of InstructZero; the first framework to optimize bad prompts of ChatGPT(API LLMs) and finally obtain good prompts…☆199Jul 23, 2024Updated last year
- SAIL: Search Augmented Instruction Learning☆159Jul 22, 2025Updated 7 months ago
- Momentum Decoding: Open-ended Text Generation as Graph Exploration☆19Jan 27, 2023Updated 3 years ago
- This repository contains the implementation of the paper -- KNOT: Knowledge Distillation using Optimal Transport for Solving NLP Tasks☆15Sep 15, 2022Updated 3 years ago
- GUI for selecting text files for concatenation and submission to LLMs☆181Nov 19, 2025Updated 3 months ago
- Serving multiple LoRA finetuned LLM as one☆1,144May 8, 2024Updated last year
- ☆43Sep 3, 2024Updated last year
- Algebraic value editing in pretrained language models☆69Nov 1, 2023Updated 2 years ago
- [ICML'24 Spotlight] LLM Maybe LongLM: Self-Extend LLM Context Window Without Tuning☆665Jun 1, 2024Updated last year
- ☆42Nov 7, 2023Updated 2 years ago
- ☆41Jun 19, 2024Updated last year
- Ungreedy subword tokenizer and vocabulary trainer for Python, Go & Javascript☆614Jul 2, 2024Updated last year
- Code for ACL 2023 paper "BOLT: Fast Energy-based Controlled Text Generation with Tunable Biases".☆22Sep 7, 2023Updated 2 years ago
- Official code and model checkpoints for our EMNLP 2022 paper "RankGen - Improving Text Generation with Large Ranking Models" (https://arx…☆137Aug 2, 2023Updated 2 years ago
- Code for Zero-Shot Tokenizer Transfer☆143Jan 14, 2025Updated last year
- Official repository for ORPO☆471May 31, 2024Updated last year
- simple ansible playbook to take clean ubuntu 18.04 to CUDA 10, PyTorch 1.0, fastai, miniconda heaven☆12Dec 16, 2018Updated 7 years ago
- Official code for the paper: Invertible Neural Network for Graph Prediction☆10Mar 27, 2023Updated 2 years ago
- Implementation of a Hierarchical Mamba as described in the paper: "Hierarchical State Space Models for Continuous Sequence-to-Sequence Mo…☆15Nov 11, 2024Updated last year