[EMNLP 2022] Adapting a Language Model While Preserving its General Knowledge
☆21Feb 12, 2023Updated 3 years ago
Alternatives and similar repositories for DGA
Users that are interested in DGA are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [EMNLP 2022] Continual Training of Language Models for Few-Shot Learning☆44Feb 13, 2023Updated 3 years ago
- [ICML 2023] Parameter-Level Soft-Masking for Continual Learning☆19Jul 13, 2023Updated 2 years ago
- An Extensible Continual Learning Framework Focused on Language Models (LMs)☆290Jan 28, 2024Updated 2 years ago
- decontamination☆36Mar 4, 2026Updated 3 months ago
- PyContinual (An Easy and Extendible Framework for Continual Learning)☆324Jan 29, 2024Updated 2 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- ✌[ICLR 2024] Class Incremental Learning via Likelihood Ratio Based Task Prediction☆31Oct 29, 2024Updated last year
- Layerwise Relevance Visualization in Convolutional Text Graph Classifiers☆11Jun 2, 2021Updated 5 years ago
- Code for ACL2018 paper "Learn How to Actively Learn: An Imitation Learning Approach"☆10Mar 8, 2019Updated 7 years ago
- ☆13Apr 16, 2021Updated 5 years ago
- Code for our TSD paper "TOKEN is a MASK: Few-shot Named Entity Recognition with Pre-trained Language Models"☆14Aug 19, 2022Updated 3 years ago
- Code Repository for Blog - How to Productionize Large Language Models (LLMs)☆12Mar 27, 2024Updated 2 years ago
- This repository includes the masking vocabulary used in the ICLR 2021 spotlight PMI-Masking paper☆14Aug 9, 2021Updated 4 years ago
- [NeurIPS 2022]MorphTE: Injecting Morphology in Tensorized Embeddings☆17Oct 29, 2022Updated 3 years ago
- ☆10Oct 17, 2022Updated 3 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- The source code for 'Noisy-Labeled NER with Confidence Estimation' accepted by NAACL 2021☆36May 8, 2021Updated 5 years ago
- Evaluate language models using multiple choice items☆13Mar 6, 2026Updated 3 months ago
- The code implementation of the EMNLP2022 paper: DisCup: Discriminator Cooperative Unlikelihood Prompt-tuning for Controllable Text Gene…☆27Nov 13, 2023Updated 2 years ago
- CNN and Contrastive Autoencoder (CAE) on EMNIST using Tensorflow☆10Oct 7, 2018Updated 7 years ago
- https://datahack.analyticsvidhya.com/contest/american-express-amexpert-2018/☆10Nov 29, 2018Updated 7 years ago
- Serial Contrastive Knowledge Distillation for Continual Few-shot Relation Extraction, Findings of ACL 2023☆14May 12, 2023Updated 3 years ago
- ☆10Sep 7, 2020Updated 5 years ago
- ☆16Mar 14, 2020Updated 6 years ago
- Repo for ICML23 "Why do Nearest Neighbor Language Models Work?"☆59Jan 12, 2023Updated 3 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- ☆14May 8, 2023Updated 3 years ago
- Code for "SCL-RAI: Span-based Contrastive Learning with Retrieval Augmented Inference for Unlabeled Entity Problem in NER" @COLING-2022☆11Aug 20, 2022Updated 3 years ago
- 😜Constrative Learning of Sentence Embedding using LoRA (EECS487 final project)☆13Apr 19, 2023Updated 3 years ago
- [ICML 2023] Exploring the Benefits of Training Expert Language Models over Instruction Tuning☆99Apr 26, 2023Updated 3 years ago
- ☆23Aug 7, 2020Updated 5 years ago
- RelEx - A simple framework for Relation Extraction built on AllenNLP☆15Jun 17, 2020Updated 6 years ago
- Dataset and baseline for Coling 2022 long paper (oral): "ConFiguRe: Exploring Discourse-level Chinese Figures of Speech"☆13Jul 27, 2023Updated 2 years ago
- 📖 UI/UX context detection engine☆12Jan 3, 2021Updated 5 years ago
- Source code of our paper "Focus on the Target’s Vocabulary: Masked Label Smoothing for Machine Translation" @ ACL 2022☆13Apr 13, 2022Updated 4 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- WebRED is a large and diverse manually annotated dataset for extracting relationships from a variety of text found on the World Wide Web.☆22Mar 11, 2021Updated 5 years ago
- Dataset for Unified Editing, EMNLP 2023. This is a model editing dataset where edits are natural language phrases.☆24Sep 4, 2024Updated last year
- Winners' solution approach and code for WNS Analytics Wizard 2019☆11Jul 6, 2023Updated 2 years ago
- Implementation of Nested Named Entity Recognition using Flair☆24Oct 29, 2021Updated 4 years ago
- Rationale-enhanced language models are better continual relation learners (EMNLP 2023 Main Conference)☆12Oct 11, 2023Updated 2 years ago
- Code that accompanies the PyData New York (2022) talk: Addressing the sensitivity of Large language models☆13Nov 7, 2022Updated 3 years ago
- A unified versatile interface for dialogue datasets☆19Dec 9, 2023Updated 2 years ago