ThomasScialom / T0_continual_learningView external linksLinks
Adding new tasks to T0 without catastrophic forgetting
☆33Oct 20, 2022Updated 3 years ago
Alternatives and similar repositories for T0_continual_learning
Users that are interested in T0_continual_learning are comparing it to the libraries listed below
Sorting:
- Code for EMNLP 2021 paper: Improving Sequence-to-Sequence Pre-training via Sequence Span Rewriting☆17Nov 30, 2021Updated 4 years ago
- ☆11Sep 19, 2025Updated 4 months ago
- Source code for a LoRA-based continual relation extraction method.☆14Sep 25, 2023Updated 2 years ago
- Code for paper 'Data-Efficient FineTuning'☆28May 24, 2023Updated 2 years ago
- [ICML 2023] Exploring the Benefits of Training Expert Language Models over Instruction Tuning☆98Apr 26, 2023Updated 2 years ago
- [ACL 2023] Gradient Ascent Post-training Enhances Language Model Generalization☆29Sep 12, 2024Updated last year
- Korean Abstract Meaning Representation (AMR) Corpus☆10Feb 27, 2022Updated 3 years ago
- [Konvens21] This repository contains the DFKI MobIE Corpus, a dataset of 3,232 German-language documents that have been annotated with fi…☆12Sep 17, 2024Updated last year
- ☆10Feb 6, 2025Updated last year
- This is the implementation of Online Task-Free Continual Generative and Discriminative Learning via Dynamic Cluster Memory☆14Jul 28, 2024Updated last year
- An original implementation of "MetaICL Learning to Learn In Context" by Sewon Min, Mike Lewis, Luke Zettlemoyer and Hannaneh Hajishirzi☆273Apr 15, 2023Updated 2 years ago
- Code for the ACL 2022 paper "Continual Sequence Generation with Adaptive Compositional Modules"☆39Apr 4, 2022Updated 3 years ago
- Code for evaluating uncertainty estimation methods for Transformer-based architectures in natural language understanding tasks.☆43Aug 16, 2021Updated 4 years ago
- ☆18Feb 28, 2022Updated 3 years ago
- ☆16Apr 9, 2021Updated 4 years ago
- Optimizing bit-level Jaccard Index and Population Counts for large-scale quantized Vector Search via Harley-Seal CSA and Lookup Tables☆21May 18, 2025Updated 8 months ago
- Task Compass: Scaling Multi-task Pre-training with Task Prefix (EMNLP 2022: Findings) (stay tuned & more will be updated)☆22Oct 17, 2022Updated 3 years ago
- some common Huggingface transformers in maximal update parametrization (µP)☆87Mar 14, 2022Updated 3 years ago
- Continue Pretraining T5 on custom dataset based on available pretrained model checkpoints☆38Mar 21, 2021Updated 4 years ago
- Multi-Hop Logical Reasoning in Knowledge Graphs☆21Mar 27, 2022Updated 3 years ago
- Apply Iprompt on GLM with innovative new methods. Currently support Chinese QA, English QA and Chinese poem generation.☆20Jun 16, 2022Updated 3 years ago
- Applying "Load What You Need: Smaller Versions of Multilingual BERT" to LaBSE☆19Sep 22, 2021Updated 4 years ago
- [ICML 2023] Parameter-Level Soft-Masking for Continual Learning☆19Jul 13, 2023Updated 2 years ago
- Official code and dataset repository of KoBBQ (TACL 2024)☆19May 13, 2024Updated last year
- The official implemetation of "Evidentiality-guided Generation for Knowledge-Intensive NLP Tasks" (NAACL 2022).☆44Dec 25, 2022Updated 3 years ago
- ☆177Jul 24, 2024Updated last year
- Language-agnostic BERT Sentence Embedding (LaBSE) Pytorch Model☆21Sep 2, 2020Updated 5 years ago
- Official Code Repository for the paper "Neural Mask Generator: Learning to Generate Adaptive Word Maskings for Language Model Adaptation …☆20Jun 19, 2023Updated 2 years ago
- official repository for ListT5☆48Nov 27, 2025Updated 2 months ago
- Code for the paper "True Few-Shot Learning in Language Models" (https://arxiv.org/abs/2105.11447)☆143Oct 25, 2021Updated 4 years ago
- ExtremeBERT is a toolkit that accelerates the pretraining of customized language models on customized datasets, described in the paper “E…☆269Mar 5, 2023Updated 2 years ago
- A lightweight, user-friendly data-plane for LLM training.☆38Sep 10, 2025Updated 5 months ago
- Code for: "Cutting Down on Prompts and Parameters: Simple Few-Shot Learning with Language Models"☆20Feb 2, 2022Updated 4 years ago
- PyTorch implementation of NMT models along with custom tokenizers, models, and datasets☆21Aug 1, 2022Updated 3 years ago
- Repo for training MLMs, CLMs, or T5-type models on the OLM pretraining data, but it should work with any hugging face text dataset.☆96Feb 9, 2023Updated 3 years ago
- Multi-task modelling extensions for huggingface transformers☆21Mar 3, 2023Updated 2 years ago
- 🌏 Modular retrievers for zero-shot multilingual IR.☆30Mar 6, 2024Updated last year
- ☆24Dec 2, 2023Updated 2 years ago
- [NAACL 2022] Contrastive Learning for Prompt-based Few-shot Language Learners☆22Jan 26, 2023Updated 3 years ago