☆24Feb 7, 2021Updated 5 years ago
Alternatives and similar repositories for Switch-Transformers-in-Seq2Seq
Users that are interested in Switch-Transformers-in-Seq2Seq are comparing it to the libraries listed below
Sorting:
- A utility that downloads your Stable Diffusion images from discord and lets you preview them with Streamlit☆16Aug 19, 2022Updated 3 years ago
- My Implementation of Q-Sparse: All Large Language Models can be Fully Sparsely-Activated☆34Aug 14, 2024Updated last year
- Pytorch implementation of the paper : Modeling Label Dependencies for Audio Tagging with Graph Convolutional Network☆13Sep 18, 2020Updated 5 years ago
- ☆14Oct 28, 2023Updated 2 years ago
- Codes for our paper "Enhancing Continual Relation Extraction via Classifier Decomposition" (Findings of ACL2023)☆10Nov 29, 2023Updated 2 years ago
- Official codebase for “In-Context Learning with Many Demonstration Examples”☆16Feb 13, 2023Updated 3 years ago
- [Findings of EMNLP22] From Mimicking to Integrating: Knowledge Integration for Pre-Trained Language Models☆19Mar 16, 2023Updated 3 years ago
- MATCH-TUNING☆15Aug 6, 2022Updated 3 years ago
- [EMNLP 2022] Code for our paper “ZeroGen: Efficient Zero-shot Learning via Dataset Generation”.☆16Feb 18, 2022Updated 4 years ago
- EMNLP2022: Learning Robust Representations for Continual Relation Extraction via Adversarial Class Augmentation☆15Oct 19, 2022Updated 3 years ago
- ☆13Sep 8, 2020Updated 5 years ago
- Implementation of the paper by Google, Transformer Memory As A Differentiable Search Index☆16May 27, 2022Updated 3 years ago
- Official Repository for Can Language Models be Instructed to Protect Personal Information?☆13Oct 8, 2023Updated 2 years ago
- The official implementation of HybridNorm: Towards Stable and Efficient Transformer Training via Hybrid Normalization☆18Mar 7, 2025Updated last year
- StyleGAN2 - Official TensorFlow Implementation with practical improvements☆11Apr 17, 2020Updated 5 years ago
- Phone-level evaluation of L2 speakers (GOP algorithm)☆27Mar 1, 2017Updated 9 years ago
- ☆10Aug 28, 2020Updated 5 years ago
- DDRel: A new dataset for interpersonal relation classification in dyadic dialogues☆23Sep 12, 2021Updated 4 years ago
- A Python implementation of Toolformer using Huggingface Transformers☆14Mar 20, 2023Updated 3 years ago
- The projects lets you extract glossary words and their definitions from a given piece of text automatically using NLP techniques☆30Jul 23, 2020Updated 5 years ago
- Code for COLING 2022 paper "FactMix: Using a Few Labeled In-domain Examples to Generalize to Cross-domain Named Entity Recognition"☆15Jan 15, 2023Updated 3 years ago
- Code for NAACL2022 Long Paper "An Enhanced Span-based Decomposition Method for Few-Shot Sequence Labeling"☆28Nov 9, 2022Updated 3 years ago
- This repository contains the official code for the paper: "Prompt Injection: Parameterization of Fixed Inputs"☆32Sep 13, 2024Updated last year
- ☆26Apr 15, 2021Updated 4 years ago
- ☆13Aug 26, 2024Updated last year
- StyleGAN2 - Official TensorFlow Implementation☆12Jul 15, 2020Updated 5 years ago
- ☆17Apr 23, 2025Updated 10 months ago
- Benchmark API for Multidomain Language Modeling☆25Aug 26, 2022Updated 3 years ago
- ☆17Aug 2, 2023Updated 2 years ago
- Our research proposes a novel MoGU framework that improves LLMs' safety while preserving their usability.☆18Jan 14, 2025Updated last year
- Memory footprint reduction for transformer models☆11Jan 24, 2023Updated 3 years ago
- Method for evaluating system summaries manually, via crowdsourcing, using a summarization dataset that includes reference summaries.☆12May 5, 2019Updated 6 years ago
- Group Meeting Record for Baobao Chang Group in Peking University☆26May 17, 2021Updated 4 years ago
- Source code for our AAAI'22 paper 《From Dense to Sparse: Contrastive Pruning for Better Pre-trained Language Model Compression》☆25Dec 15, 2021Updated 4 years ago
- Implementation of AAAI 2022 Paper: Go wider instead of deeper☆32Oct 27, 2022Updated 3 years ago
- [ICLR 2025] Code&Data for the paper "Super(ficial)-alignment: Strong Models May Deceive Weak Models in Weak-to-Strong Generalization"☆14Jun 21, 2024Updated last year
- ☆19Feb 1, 2021Updated 5 years ago
- ☆58Updated this week
- Source code for paper "A Two-Stage Method for Chinese AMR Parsing" @ CAMRP-2022 & CCL-2022☆24Aug 29, 2024Updated last year