[ICLR 2022] Towards Continual Knowledge Learning of Language Models
☆92Oct 11, 2022Updated 3 years ago
Alternatives and similar repositories for continual-knowledge-learning
Users that are interested in continual-knowledge-learning are comparing it to the libraries listed below
Sorting:
- [EMNLP 2022] TemporalWiki: A Lifelong Benchmark for Training and Evaluating Ever-Evolving Language Models☆74May 15, 2024Updated last year
- ☆13Apr 24, 2022Updated 3 years ago
- [ICML 2023] Exploring the Benefits of Training Expert Language Models over Instruction Tuning☆98Apr 26, 2023Updated 2 years ago
- All-in-one repository for Fine-tuning & Pretraining (Large) Language Models☆15Mar 8, 2023Updated 2 years ago
- Code and Dataset release of "Carpe Diem: On the Evaluation of World Knowledge in Lifelong Language Models" (NAACL 2024)☆10Oct 16, 2024Updated last year
- [NeurIPS 2025] Reasoning Models Better Express Their Confidence"☆22Nov 19, 2025Updated 3 months ago
- [AAAI 2024] Investigating the Effectiveness of Task-Agnostic Prefix Prompt for Instruction Following☆79Sep 13, 2024Updated last year
- Combining encoder-based language models☆11Nov 11, 2021Updated 4 years ago
- ☆12Apr 24, 2024Updated last year
- [TACL 2024] Improving Probability-based Prompt Selection Through Unified Evaluation and Analysis☆11Nov 14, 2024Updated last year
- ☆100Apr 11, 2025Updated 10 months ago
- [ACL 2023] Gradient Ascent Post-training Enhances Language Model Generalization☆29Sep 12, 2024Updated last year
- Code for evaluating uncertainty estimation methods for Transformer-based architectures in natural language understanding tasks.☆43Aug 16, 2021Updated 4 years ago
- [ICLR 2023] Guess the Instruction! Flipped Learning Makes Language Models Stronger Zero-Shot Learners☆116Jun 28, 2025Updated 8 months ago
- Code for EMNLP 2021 paper: Improving Sequence-to-Sequence Pre-training via Sequence Span Rewriting☆17Nov 30, 2021Updated 4 years ago
- ☆16Oct 6, 2022Updated 3 years ago
- Continue Pretraining T5 on custom dataset based on available pretrained model checkpoints☆38Mar 21, 2021Updated 4 years ago
- Experiments on including metadata such as URLs, timestamps, website descriptions and HTML tags during pretraining.☆31Jun 12, 2023Updated 2 years ago
- The official implemetation of "Evidentiality-guided Generation for Knowledge-Intensive NLP Tasks" (NAACL 2022).☆44Dec 25, 2022Updated 3 years ago
- [EMNLP 2022] Continual Training of Language Models for Few-Shot Learning☆45Feb 13, 2023Updated 3 years ago
- Character-level Korean ELECTRA Model (음절 단위 한국어 ELECTRA)☆54Jun 12, 2023Updated 2 years ago
- ☆32Apr 14, 2022Updated 3 years ago
- ☆32Mar 31, 2023Updated 2 years ago
- Knowledge Infused Decoding☆71Dec 31, 2023Updated 2 years ago
- Adding new tasks to T0 without catastrophic forgetting☆33Oct 20, 2022Updated 3 years ago
- [EMNLP 2021] Efficient Contrastive Learning via Novel Data Augmentation and Curriculum Learning☆17Jun 28, 2025Updated 8 months ago
- Code to reproduce the experiments of "Rethinking Experience Replay: a Bag of Tricks for Continual Learning"☆52Feb 16, 2023Updated 3 years ago
- [EMNLP 2022] Adapting a Language Model While Preserving its General Knowledge☆21Feb 12, 2023Updated 3 years ago
- [ACL 2023] Knowledge Unlearning for Mitigating Privacy Risks in Language Models☆86Sep 12, 2024Updated last year
- Source code of our paper "Focus on the Target’s Vocabulary: Masked Label Smoothing for Machine Translation" @ACL-2022☆18May 19, 2022Updated 3 years ago
- 🦅 Pretrained BigBird Model for Korean (up to 4096 tokens)☆201Dec 28, 2023Updated 2 years ago
- Code for Editing Factual Knowledge in Language Models☆142Jan 28, 2022Updated 4 years ago
- [NeurIPS'23] Aging with GRACE: Lifelong Model Editing with Discrete Key-Value Adaptors☆83Dec 21, 2024Updated last year
- [EACL 2023] CoTEVer: Chain of Thought Prompting Annotation Toolkit for Explanation Verification☆42Apr 29, 2023Updated 2 years ago
- [COLING 2022] Mind the Gap! Injecting Commonsense Knowledge for Abstractive Dialogue Summarization☆25Mar 28, 2024Updated last year
- Code for: "Cutting Down on Prompts and Parameters: Simple Few-Shot Learning with Language Models"☆20Feb 2, 2022Updated 4 years ago
- [NeurIPS 2022 Workshop] A Case Study with Negated Prompts using T0 (3B, 11B), InstructGPT (350M-175B), GPT-3 (350M - 175B) & OPT (125M - …☆24Sep 27, 2022Updated 3 years ago
- ☆24Dec 2, 2023Updated 2 years ago
- [NAACL 2024] Official repository for "KTRL+F: Knowledge-Augmented In-Document Search"☆23Oct 11, 2024Updated last year