☆34Aug 5, 2023Updated 2 years ago
Alternatives and similar repositories for Transformer-Patcher
Users that are interested in Transformer-Patcher are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Code for Editing Factual Knowledge in Language Models☆142Jan 28, 2022Updated 4 years ago
- Mass-editing thousands of facts into a transformer memory (ICLR 2023)☆549Jan 31, 2024Updated 2 years ago
- ☆14Feb 12, 2024Updated 2 years ago
- ☆37Jan 26, 2024Updated 2 years ago
- From Accuracy to Robustness: A Study of Rule- and Model-based Verifiers in Mathematical Reasoning.☆25Oct 7, 2025Updated 8 months ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Semi-Parametric Editing with a Retrieval-Augmented Counterfactual Model☆72Nov 1, 2022Updated 3 years ago
- [Findings of EMNLP 2022] Code of paper Generative Prompt Tuning for Relation Classification. https://arxiv.org/abs/2210.12435☆20May 7, 2023Updated 3 years ago
- ☆29Jul 16, 2024Updated last year
- ☆18Mar 3, 2025Updated last year
- ☆17Aug 2, 2023Updated 2 years ago
- Repository for "Propagating Knowledge Updates to LMs Through Distillation" (NeurIPS 2023).☆27Aug 25, 2024Updated last year
- [NeurIPS'23] Aging with GRACE: Lifelong Model Editing with Discrete Key-Value Adaptors☆85Dec 21, 2024Updated last year
- Evaluating the Ripple Effects of Knowledge Editing in Language Models☆58Apr 15, 2024Updated 2 years ago
- ☆17Nov 7, 2023Updated 2 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- [AAAI 2024] MELO: Enhancing Model Editing with Neuron-indexed Dynamic LoRA☆28Apr 9, 2024Updated 2 years ago
- ☆14Nov 15, 2022Updated 3 years ago
- [EMNLP 2023] MQuAKE: Assessing Knowledge Editing in Language Models via Multi-Hop Questions☆125Sep 12, 2024Updated last year
- ☆35Jun 13, 2025Updated last year
- [ACL 2023 Findings] What In-Context Learning “Learns” In-Context: Disentangling Task Recognition and Task Learning☆21Jul 9, 2023Updated 2 years ago
- PhyX: Does Your Model Have the "Wits" for Physical Reasoning?☆52Mar 16, 2026Updated 2 months ago
- [NLPCC 2022] Kformer: Knowledge Injection in Transformer Feed-Forward Layers☆39Oct 20, 2022Updated 3 years ago
- Interpretable unified language safety checking with large language models☆32Apr 15, 2023Updated 3 years ago
- The official repository for our paper "The Dual Form of Neural Networks Revisited: Connecting Test Time Predictions to Training Patterns …☆16Jun 11, 2025Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Paper list of "The Life Cycle of Knowledge in Big Language Models: A Survey"☆58Aug 24, 2023Updated 2 years ago
- Code for the NAACL 2024 HCI+NLP Workshop paper "LLMCheckup: Conversational Examination of Large Language Models via Interpretability Tool…☆13Mar 24, 2024Updated 2 years ago
- ☆41Nov 30, 2023Updated 2 years ago
- ☆32Oct 17, 2022Updated 3 years ago
- ☆20May 30, 2024Updated 2 years ago
- [Findings of EMNLP 2024] AdaMoE: Token-Adaptive Routing with Null Experts for Mixture-of-Experts Language Models☆20Oct 2, 2024Updated last year
- [AAAI 2025 oral] Attribution Analysis Meets Model Editing: Advancing Knowledge Correction in Vision Language Models with VisEdit☆19Apr 19, 2025Updated last year
- PathPiece tokenizer☆14Nov 10, 2024Updated last year
- This repository provides the code for applying Contrastive Learning Penalty Loss (CLPL) and Mixture of Experts (MoE) to the BGE-M3 text e…☆11Dec 27, 2024Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Source code repo for paper "TLDR: Token Loss Dynamic Reweighting for Reducing Repetitive Utterance Generation"☆10Aug 11, 2023Updated 2 years ago
- [ACL 2023] Contextual Distortion Reveals Constituency: Mask Language Models are Implicit Parsers.☆14Jun 3, 2023Updated 3 years ago
- Must-read Papers on Knowledge Editing for Large Language Models.☆1,231Jul 12, 2025Updated 11 months ago
- ☆18Dec 12, 2025Updated 6 months ago
- code for EMNLP 2024 paper: Neuron-Level Knowledge Attribution in Large Language Models☆52Nov 17, 2024Updated last year
- Unzipped client files☆11Mar 8, 2020Updated 6 years ago
- Third Person Shooter for Unity☆12Jun 26, 2022Updated 3 years ago