[ICLR 2024] Unveiling the Pitfalls of Knowledge Editing for Large Language Models
☆23Jun 13, 2024Updated 2 years ago
Alternatives and similar repositories for PitfallsKnowledgeEditing
Users that are interested in PitfallsKnowledgeEditing are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Exploring Model Kinship for Merging Large Language Models☆29Apr 16, 2025Updated last year
- PaCE: Parsimonious Concept Engineering for Large Language Models (NeurIPS 2024)☆43Jan 18, 2026Updated 5 months ago
- How do transformer LMs encode relations?☆57Feb 24, 2024Updated 2 years ago
- [JMLR] Gradual Domain Adaptation: Theory and Algorithms☆11Jan 14, 2025Updated last year
- Localize-and-Stitch: Efficient Model Merging via Sparse Task Arithmetic☆32Feb 18, 2026Updated 4 months ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Reproduction Code for Paper "Investigating Multi-Hop Factual Shortcuts in Knowledge Editing of Large Language Models"☆14Jun 1, 2024Updated 2 years ago
- ☆11Feb 3, 2025Updated last year
- Collection of Reverse Engineering in Large Model☆35Jan 8, 2025Updated last year
- [NeurIPS 2024] Knowledge Circuits in Pretrained Transformers☆172Nov 14, 2025Updated 7 months ago
- [SIGIR 2023] Schema-aware Reference as Prompt Improves Data-Efficient Knowledge Graph Construction☆43Apr 5, 2023Updated 3 years ago
- [EMNLP 2025] Circuit-Aware Editing Enables Generalizable Knowledge Learners☆20Nov 17, 2025Updated 7 months ago
- [AAAI26] LongLLaDA: Unlocking Long Context Capabilities in Diffusion LLMs☆56Dec 7, 2025Updated 6 months ago
- EMNLP'23 survey: a curation of awesome papers and resources on refreshing large language models (LLMs) without expensive retraining.☆135Dec 12, 2023Updated 2 years ago
- 🏠🔍 Auto check for new apartments in Hamburg from various real estate provides☆16Apr 15, 2026Updated 2 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Pytorch Implementation of "Rethinking Long-tailed Dataset Distillation: A Uni-Level Framework with Unbiased Recovery and Relabeling", AAA…☆25Updated this week
- Code for "Accelerating Training with Neuron Interaction and Nowcasting Networks" [ICLR 2025]☆30Feb 20, 2026Updated 4 months ago
- The code for paper Interpreting Key Mechanisms of Factual Recall in Transformer-Based Language Models.☆13Apr 10, 2024Updated 2 years ago
- Globally Consistent Probabilistic Human Motion Estimation☆23Feb 28, 2023Updated 3 years ago
- Repository for our paper "DeepEdit: Knowledge Editing as Decoding with Constraints". https://arxiv.org/abs/2401.10471☆21Jun 19, 2024Updated 2 years ago
- Unofficial implementation of the Ask-LLM paper 'How to Train Data-Efficient LLMs', arXiv:2402.09668.☆12Jun 19, 2024Updated 2 years ago
- [TMLR'26] UltraEdit: Training-, Subject-, and Memory-Free Lifelong Editing in Large Language Models☆54May 17, 2026Updated last month
- [ ICLR 2025 ] Making LLMs More Effective with Hierarchical Mixture of LoRA Experts☆32Oct 9, 2025Updated 8 months ago
- Must-read Papers on Knowledge Editing for Large Language Models.☆1,235Updated this week
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- [AAAI 2024] MELO: Enhancing Model Editing with Neuron-indexed Dynamic LoRA☆28Apr 9, 2024Updated 2 years ago
- ☆13Sep 8, 2024Updated last year
- A web application for playing 20 Questions to crowdsource common sense. 🤖☆17Sep 29, 2022Updated 3 years ago
- PyTorch implementation of "Towards Impartial Multi-Task Learning"☆13Apr 12, 2024Updated 2 years ago
- ☆25Sep 19, 2023Updated 2 years ago
- [EMNLP2022] Released code for paper "Distilling Causal Effect from Miscellaneous Other-Class for Continual Named Entity Recognition"☆22Feb 9, 2023Updated 3 years ago
- A Student-Course-Manage-Info-System. 一个学生选课管理信息系统。☆11Feb 7, 2021Updated 5 years ago
- ☆11Nov 19, 2024Updated last year
- ☆14Feb 12, 2024Updated 2 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- ☆20Sep 24, 2022Updated 3 years ago
- ☆23Apr 12, 2022Updated 4 years ago
- Code for CVPR2018 "Iterative Learning with Open-set Noisy Labels"☆12Mar 12, 2021Updated 5 years ago
- [ICML 2022 Spotlight] Finding the Task-Optimal Low-Bit Sub-Distribution in Deep Neural Networks☆11May 21, 2023Updated 3 years ago
- ☆37Feb 11, 2025Updated last year
- 通过实验对比LLM推理中Prefill和Decoding阶段的吞吐量差异,揭示性能瓶颈,解释PD分离优化技术的原理。包含CUDA和Apple MPS (M系列芯片) 的测试脚本。☆22May 22, 2025Updated last year
- ☆52Jan 1, 2024Updated 2 years ago