xpq-tech / PMET
This is a repository for "PMET: Precise Model Editing in a Transformer"
☆50Updated last year
Alternatives and similar repositories for PMET:
Users that are interested in PMET are comparing it to the libraries listed below
- Official github repo for the paper "Compression Represents Intelligence Linearly" [COLM 2024]☆130Updated 6 months ago
- Code release for Dataless Knowledge Fusion by Merging Weights of Language Models (https://openreview.net/forum?id=FCnohuR6AnM)☆87Updated last year
- Inspecting and Editing Knowledge Representations in Language Models☆115Updated last year
- Self-Alignment with Principle-Following Reward Models☆157Updated last year
- Official repository for paper "Weak-to-Strong Extrapolation Expedites Alignment"☆73Updated 10 months ago
- Replicating O1 inference-time scaling laws☆83Updated 4 months ago
- [ICLR 2024] CLEX: Continuous Length Extrapolation for Large Language Models☆76Updated last year
- [NeurIPS'24 Spotlight] Observational Scaling Laws☆54Updated 6 months ago
- ☆60Updated 11 months ago
- Code for the arXiv preprint "The Unreasonable Effectiveness of Easy Training Data"☆46Updated last year
- Code repository for the c-BTM paper☆106Updated last year
- PASTA: Post-hoc Attention Steering for LLMs☆113Updated 4 months ago
- A simple GPT-based evaluation tool for multi-aspect, interpretable assessment of LLMs.☆85Updated last year
- Semi-Parametric Editing with a Retrieval-Augmented Counterfactual Model☆66Updated 2 years ago
- Code and data for "Dynosaur: A Dynamic Growth Paradigm for Instruction-Tuning Data Curation" (EMNLP 2023)☆63Updated last year
- ☆37Updated last year
- ☆73Updated 11 months ago
- A dataset of LLM-generated chain-of-thought steps annotated with mistake location.☆80Updated 7 months ago
- Code for PHATGOOSE introduced in "Learning to Route Among Specialized Experts for Zero-Shot Generalization"☆82Updated last year
- contrastive decoding☆198Updated 2 years ago
- Code for ACL2023 paper: Pre-Training to Learn in Context☆108Updated 8 months ago
- Code for RL4F: Generating Natural Language Feedback with Reinforcement Learning for Repairing Model Outputs. ACL 2023.☆63Updated 4 months ago
- ☆82Updated 8 months ago
- Code for the paper "The Impact of Positional Encoding on Length Generalization in Transformers", NeurIPS 2023☆133Updated 11 months ago
- Easy control for Key-Value Constrained Generative LLM Inference(https://arxiv.org/abs/2402.06262)☆61Updated last year
- Sparse Backpropagation for Mixture-of-Expert Training☆29Updated 9 months ago
- Code and data used in the paper: "Training on Incorrect Synthetic Data via RL Scales LLM Math Reasoning Eight-Fold"☆30Updated 9 months ago
- [NeurIPS 2024] Knowledge Circuits in Pretrained Transformers☆136Updated last month
- SILO Language Models code repository☆81Updated last year
- Repo accompanying our paper "Do Llamas Work in English? On the Latent Language of Multilingual Transformers".☆72Updated last year