[NLPCC 2022] Kformer: Knowledge Injection in Transformer Feed-Forward Layers
☆38Oct 20, 2022Updated 3 years ago
Alternatives and similar repositories for Kformer
Users that are interested in Kformer are comparing it to the libraries listed below
Sorting:
- Complexity Based Prompting for Multi-Step Reasoning☆17Mar 10, 2023Updated 2 years ago
- Official implementation of the paper "Pretraining Language Models to Ponder in Continuous Space"☆25Jul 21, 2025Updated 7 months ago
- The 4th rank system of the SemEval 2021 Task4.☆10May 7, 2022Updated 3 years ago
- The official implementation of the paper "Self-Updatable Large Language Models by Integrating Context into Model Parameters"☆15May 18, 2025Updated 9 months ago
- ☆12Nov 15, 2022Updated 3 years ago
- ☆32Oct 17, 2022Updated 3 years ago
- Code for the ACL-2022 paper "Knowledge Neurons in Pretrained Transformers"☆173May 4, 2024Updated last year
- NaturalProver: Grounded Mathematical Proof Generation with Language Models☆39Mar 24, 2023Updated 2 years ago
- MEND: Fast Model Editing at Scale☆257Aug 30, 2023Updated 2 years ago
- Code for EMNLP2021 paper "Allocating Large Vocabulary Capacity for Cross-lingual Language Model Pre-training"☆20Nov 12, 2021Updated 4 years ago
- ☆20Sep 28, 2021Updated 4 years ago
- This repository includes code for the paper "Does Localization Inform Editing? Surprising Differences in Where Knowledge Is Stored vs. Ca…☆61May 9, 2023Updated 2 years ago
- BERT-based Biomedical Text Summarizer☆23Oct 2, 2019Updated 6 years ago
- [NLPCC 2023] Reasoning Through Memorization: Nearest Neighbor Knowledge Graph Embeddings with Language Models☆55Jul 31, 2023Updated 2 years ago
- cpp write language detect model☆11Sep 22, 2021Updated 4 years ago
- Repository for "Propagating Knowledge Updates to LMs Through Distillation" (NeurIPS 2023).☆26Aug 25, 2024Updated last year
- ☆68May 18, 2023Updated 2 years ago
- Conic10K: A large-scale dataset for closed-vocabulary math problem understanding. Accepted to EMNLP2023 Findings.☆31Dec 6, 2023Updated 2 years ago
- ☆32Apr 14, 2022Updated 3 years ago
- ☆28Feb 27, 2025Updated last year
- ☆33Aug 5, 2023Updated 2 years ago
- ☆143Jul 21, 2024Updated last year
- EMNLP 2021 - Frustratingly Simple Pretraining Alternatives to Masked Language Modeling☆34Nov 21, 2021Updated 4 years ago
- [ACL 2021] MLBiNet: A Cross-Sentence Collective Event Detection Network☆35Jan 10, 2022Updated 4 years ago
- [SIGIR 2023] Schema-aware Reference as Prompt Improves Data-Efficient Knowledge Graph Construction☆42Apr 5, 2023Updated 2 years ago
- Official repository of "SeGA: Preference-Aware Self-Contrastive Learning with Prompts for Anomalous User Detection on Twitter" @ AAAI 202…☆11Nov 30, 2024Updated last year
- Collection of Reverse Engineering in Large Model☆36Jan 8, 2025Updated last year
- ☆39Apr 29, 2023Updated 2 years ago
- Code for Editing Factual Knowledge in Language Models☆142Jan 28, 2022Updated 4 years ago
- Code and data to accompany the camera-ready version of "Cross-Attention is All You Need: Adapting Pretrained Transformers for Machine Tra…☆33Sep 15, 2021Updated 4 years ago
- A library for finding knowledge neurons in pretrained transformer models.☆159Feb 13, 2022Updated 4 years ago
- ☆10Jul 6, 2023Updated 2 years ago
- Residual Quantization Autoencoder, used for interpreting LLMs☆14Jan 1, 2025Updated last year
- This is the official implementation for MA-LoT.☆19Aug 4, 2025Updated 6 months ago
- ☆14Mar 7, 2025Updated 11 months ago
- ☆12Jul 4, 2024Updated last year
- Machine learning for molecules workshop 2022☆13Nov 30, 2022Updated 3 years ago
- ☆26Updated this week
- ☆20Aug 8, 2025Updated 6 months ago