☆34Aug 5, 2023Updated 2 years ago
Alternatives and similar repositories for Transformer-Patcher
Users that are interested in Transformer-Patcher are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Code for Editing Factual Knowledge in Language Models☆142Jan 28, 2022Updated 4 years ago
- Dataset for Unified Editing, EMNLP 2023. This is a model editing dataset where edits are natural language phrases.☆24Sep 4, 2024Updated last year
- Mass-editing thousands of facts into a transformer memory (ICLR 2023)☆544Jan 31, 2024Updated 2 years ago
- ☆37Jan 26, 2024Updated 2 years ago
- From Accuracy to Robustness: A Study of Rule- and Model-based Verifiers in Mathematical Reasoning.☆25Oct 7, 2025Updated 6 months ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Semi-Parametric Editing with a Retrieval-Augmented Counterfactual Model☆72Nov 1, 2022Updated 3 years ago
- [Findings of EMNLP 2022] Code of paper Generative Prompt Tuning for Relation Classification. https://arxiv.org/abs/2210.12435☆20May 7, 2023Updated 2 years ago
- ☆29Jul 16, 2024Updated last year
- ☆18Mar 3, 2025Updated last year
- Repository for "Propagating Knowledge Updates to LMs Through Distillation" (NeurIPS 2023).☆26Aug 25, 2024Updated last year
- [NeurIPS'23] Aging with GRACE: Lifelong Model Editing with Discrete Key-Value Adaptors☆84Dec 21, 2024Updated last year
- ☆17Nov 7, 2023Updated 2 years ago
- [AAAI 2024] MELO: Enhancing Model Editing with Neuron-indexed Dynamic LoRA☆28Apr 9, 2024Updated 2 years ago
- ☆12Nov 15, 2022Updated 3 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- [EMNLP 2023] MQuAKE: Assessing Knowledge Editing in Language Models via Multi-Hop Questions☆124Sep 12, 2024Updated last year
- Data creation, training and eval scripts for the IRCoder paper☆21May 31, 2024Updated last year
- [ACL 2023 Findings] What In-Context Learning “Learns” In-Context: Disentangling Task Recognition and Task Learning☆21Jul 9, 2023Updated 2 years ago
- PhyX: Does Your Model Have the "Wits" for Physical Reasoning?☆52Mar 16, 2026Updated last month
- [NLPCC 2022] Kformer: Knowledge Injection in Transformer Feed-Forward Layers☆39Oct 20, 2022Updated 3 years ago
- Interpretable unified language safety checking with large language models☆32Apr 15, 2023Updated 3 years ago
- ParetoDrug☆11Sep 3, 2024Updated last year
- The official repository for our paper "The Dual Form of Neural Networks Revisited: Connecting Test Time Predictions to Training Patterns …☆16Jun 11, 2025Updated 10 months ago
- [NLPCC 2024] Shared Task 10: Regulating Large Language Models☆14Jun 12, 2024Updated last year
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- Code for the NAACL 2024 HCI+NLP Workshop paper "LLMCheckup: Conversational Examination of Large Language Models via Interpretability Tool…☆13Mar 24, 2024Updated 2 years ago
- ☆41Nov 30, 2023Updated 2 years ago
- ☆15Sep 27, 2024Updated last year
- ☆20May 30, 2024Updated last year
- [Findings of EMNLP 2024] AdaMoE: Token-Adaptive Routing with Null Experts for Mixture-of-Experts Language Models☆20Oct 2, 2024Updated last year
- [AAAI 2025 oral] Attribution Analysis Meets Model Editing: Advancing Knowledge Correction in Vision Language Models with VisEdit☆19Apr 19, 2025Updated last year
- COMA: Efficient Structure-constrained Molecular Generation using Contractive and Margin losses☆18Oct 31, 2023Updated 2 years ago
- PathPiece tokenizer☆14Nov 10, 2024Updated last year
- This repository provides the code for applying Contrastive Learning Penalty Loss (CLPL) and Mixture of Experts (MoE) to the BGE-M3 text e…☆11Dec 27, 2024Updated last year
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Source code repo for paper "TLDR: Token Loss Dynamic Reweighting for Reducing Repetitive Utterance Generation"☆10Aug 11, 2023Updated 2 years ago
- [ACL 2023] Contextual Distortion Reveals Constituency: Mask Language Models are Implicit Parsers.☆14Jun 3, 2023Updated 2 years ago
- Must-read Papers on Knowledge Editing for Large Language Models.☆1,230Jul 12, 2025Updated 9 months ago
- This is a repository for "PMET: Precise Model Editing in a Transformer"☆56Sep 28, 2023Updated 2 years ago
- ☆17Jan 10, 2024Updated 2 years ago
- ☆18Dec 12, 2025Updated 4 months ago
- code for EMNLP 2024 paper: Neuron-Level Knowledge Attribution in Large Language Models☆52Nov 17, 2024Updated last year