☆34Aug 5, 2023Updated 2 years ago
Alternatives and similar repositories for Transformer-Patcher
Users that are interested in Transformer-Patcher are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Code for Editing Factual Knowledge in Language Models☆142Jan 28, 2022Updated 4 years ago
- Dataset for Unified Editing, EMNLP 2023. This is a model editing dataset where edits are natural language phrases.☆24Sep 4, 2024Updated last year
- Mass-editing thousands of facts into a transformer memory (ICLR 2023)☆548Jan 31, 2024Updated 2 years ago
- MEND: Fast Model Editing at Scale☆259Aug 30, 2023Updated 2 years ago
- ☆14Feb 12, 2024Updated 2 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- ☆37Jan 26, 2024Updated 2 years ago
- From Accuracy to Robustness: A Study of Rule- and Model-based Verifiers in Mathematical Reasoning.☆25Oct 7, 2025Updated 7 months ago
- Semi-Parametric Editing with a Retrieval-Augmented Counterfactual Model☆72Nov 1, 2022Updated 3 years ago
- [Findings of EMNLP 2022] Code of paper Generative Prompt Tuning for Relation Classification. https://arxiv.org/abs/2210.12435☆20May 7, 2023Updated 3 years ago
- ☆18Mar 3, 2025Updated last year
- ☆17Aug 2, 2023Updated 2 years ago
- Repository for "Propagating Knowledge Updates to LMs Through Distillation" (NeurIPS 2023).☆27Aug 25, 2024Updated last year
- [NeurIPS'23] Aging with GRACE: Lifelong Model Editing with Discrete Key-Value Adaptors☆85Dec 21, 2024Updated last year
- Evaluating the Ripple Effects of Knowledge Editing in Language Models☆57Apr 15, 2024Updated 2 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- ☆17Nov 7, 2023Updated 2 years ago
- [AAAI 2024] MELO: Enhancing Model Editing with Neuron-indexed Dynamic LoRA☆28Apr 9, 2024Updated 2 years ago
- ☆13Nov 15, 2022Updated 3 years ago
- [EMNLP 2023] MQuAKE: Assessing Knowledge Editing in Language Models via Multi-Hop Questions☆124Sep 12, 2024Updated last year
- Data creation, training and eval scripts for the IRCoder paper☆21May 31, 2024Updated last year
- ☆35Jun 13, 2025Updated 11 months ago
- PhyX: Does Your Model Have the "Wits" for Physical Reasoning?☆52Mar 16, 2026Updated 2 months ago
- [NLPCC 2022] Kformer: Knowledge Injection in Transformer Feed-Forward Layers☆39Oct 20, 2022Updated 3 years ago
- Interpretable unified language safety checking with large language models☆32Apr 15, 2023Updated 3 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- The official repository for our paper "The Dual Form of Neural Networks Revisited: Connecting Test Time Predictions to Training Patterns …☆16Jun 11, 2025Updated 11 months ago
- [NLPCC 2024] Shared Task 10: Regulating Large Language Models☆14Jun 12, 2024Updated last year
- Paper list of "The Life Cycle of Knowledge in Big Language Models: A Survey"☆59Aug 24, 2023Updated 2 years ago
- Code for the NAACL 2024 HCI+NLP Workshop paper "LLMCheckup: Conversational Examination of Large Language Models via Interpretability Tool…☆13Mar 24, 2024Updated 2 years ago
- ☆41Nov 30, 2023Updated 2 years ago
- ☆32Oct 17, 2022Updated 3 years ago
- ☆20May 30, 2024Updated last year
- [Findings of EMNLP 2024] AdaMoE: Token-Adaptive Routing with Null Experts for Mixture-of-Experts Language Models☆20Oct 2, 2024Updated last year
- [AAAI 2025 oral] Attribution Analysis Meets Model Editing: Advancing Knowledge Correction in Vision Language Models with VisEdit☆19Apr 19, 2025Updated last year
- End-to-end encrypted cloud storage - Proton Drive • AdSpecial offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
- PathPiece tokenizer☆14Nov 10, 2024Updated last year
- Source code repo for paper "TLDR: Token Loss Dynamic Reweighting for Reducing Repetitive Utterance Generation"☆10Aug 11, 2023Updated 2 years ago
- [ACL 2023] Contextual Distortion Reveals Constituency: Mask Language Models are Implicit Parsers.☆14Jun 3, 2023Updated 2 years ago
- Must-read Papers on Knowledge Editing for Large Language Models.☆1,231Jul 12, 2025Updated 10 months ago
- This is a repository for "PMET: Precise Model Editing in a Transformer"☆57Sep 28, 2023Updated 2 years ago
- ☆18Dec 12, 2025Updated 5 months ago
- Unzipped client files☆11Mar 8, 2020Updated 6 years ago