Adding new tasks to T0 without catastrophic forgetting
☆33Oct 20, 2022Updated 3 years ago
Alternatives and similar repositories for T0_continual_learning
Users that are interested in T0_continual_learning are comparing it to the libraries listed below
Sorting:
- Code for EMNLP 2021 paper: Improving Sequence-to-Sequence Pre-training via Sequence Span Rewriting☆17Nov 30, 2021Updated 4 years ago
- ☆11Sep 19, 2025Updated 5 months ago
- Source code for a LoRA-based continual relation extraction method.☆14Sep 25, 2023Updated 2 years ago
- Code for paper 'Data-Efficient FineTuning'☆28May 24, 2023Updated 2 years ago
- [ICML 2023] Exploring the Benefits of Training Expert Language Models over Instruction Tuning☆98Apr 26, 2023Updated 2 years ago
- Entailment self-training☆27May 30, 2023Updated 2 years ago
- Korean Abstract Meaning Representation (AMR) Corpus☆10Feb 27, 2022Updated 4 years ago
- [Konvens21] This repository contains the DFKI MobIE Corpus, a dataset of 3,232 German-language documents that have been annotated with fi…☆12Sep 17, 2024Updated last year
- Code for the paper: Rehearsal-free Continual Language Learning via Efficient Parameter Isolation☆12May 16, 2023Updated 2 years ago
- This is the implementation of Online Task-Free Continual Generative and Discriminative Learning via Dynamic Cluster Memory☆14Jul 28, 2024Updated last year
- code for the table-based open domain question answering project, with paper title: "Reasoning over Hybrid Chain for Table-and-Text Open D…☆12Sep 16, 2022Updated 3 years ago
- ☆15Dec 3, 2024Updated last year
- Convert BART models to ONNX with quantization. 3X reduction in size, and upto 3X boost in inference speed☆33Dec 11, 2024Updated last year
- Unifew: Unified Fewshot Learning Model☆18Sep 10, 2021Updated 4 years ago
- An original implementation of "MetaICL Learning to Learn In Context" by Sewon Min, Mike Lewis, Luke Zettlemoyer and Hannaneh Hajishirzi☆273Apr 15, 2023Updated 2 years ago
- Code for the ACL 2022 paper "Continual Sequence Generation with Adaptive Compositional Modules"☆39Apr 4, 2022Updated 3 years ago
- A repository for organizing our submission to the MEDIQA-Chat Tasks @ ACL-ClinicalNLP 2023☆22Jul 21, 2023Updated 2 years ago
- ☆49Oct 10, 2023Updated 2 years ago
- Source code for "Revisiting Unsupervised Relation Extraction" in ACL 2020☆36Jun 20, 2023Updated 2 years ago
- Code for evaluating uncertainty estimation methods for Transformer-based architectures in natural language understanding tasks.☆44Aug 16, 2021Updated 4 years ago
- Apply Iprompt on GLM with innovative new methods. Currently support Chinese QA, English QA and Chinese poem generation.☆20Jun 16, 2022Updated 3 years ago
- ☆16Apr 9, 2021Updated 4 years ago
- ☆18Feb 28, 2022Updated 4 years ago
- Task Compass: Scaling Multi-task Pre-training with Task Prefix (EMNLP 2022: Findings) (stay tuned & more will be updated)☆22Oct 17, 2022Updated 3 years ago
- Optimizing bit-level Jaccard Index and Population Counts for large-scale quantized Vector Search via Harley-Seal CSA and Lookup Tables☆21May 18, 2025Updated 9 months ago
- some common Huggingface transformers in maximal update parametrization (µP)☆87Mar 14, 2022Updated 3 years ago
- Continue Pretraining T5 on custom dataset based on available pretrained model checkpoints☆38Mar 21, 2021Updated 4 years ago
- Official code and dataset repository of KoBBQ (TACL 2024)☆19May 13, 2024Updated last year
- Code for the paper "Mehta, S. V., Patil, D., Chandar, S., & Strubell, E. (2023). An Empirical Investigation of the Role of Pre-training i…☆17Mar 18, 2024Updated last year
- Applying "Load What You Need: Smaller Versions of Multilingual BERT" to LaBSE☆19Sep 22, 2021Updated 4 years ago
- [ICML 2023] Parameter-Level Soft-Masking for Continual Learning☆19Jul 13, 2023Updated 2 years ago
- Multi-Hop Logical Reasoning in Knowledge Graphs☆21Mar 27, 2022Updated 3 years ago
- The official implemetation of "Evidentiality-guided Generation for Knowledge-Intensive NLP Tasks" (NAACL 2022).☆44Dec 25, 2022Updated 3 years ago
- ☆177Jul 24, 2024Updated last year
- Code repository for the paper "MrT5: Dynamic Token Merging for Efficient Byte-level Language Models."☆54Sep 25, 2025Updated 5 months ago
- Official Code Repository for the paper "Neural Mask Generator: Learning to Generate Adaptive Word Maskings for Language Model Adaptation …☆20Jun 19, 2023Updated 2 years ago
- Language-agnostic BERT Sentence Embedding (LaBSE) Pytorch Model☆21Sep 2, 2020Updated 5 years ago
- This repository contains the code for "Generating Datasets with Pretrained Language Models".☆189Aug 17, 2021Updated 4 years ago
- official repository for ListT5☆48Nov 27, 2025Updated 3 months ago