llyx97 / RositaLinks
[AAAI 2021] "ROSITA: Refined BERT cOmpreSsion with InTegrAted techniques", Yuanxin Liu, Zheng Lin, Fengcheng Yuan
☆14Updated 2 years ago
Alternatives and similar repositories for Rosita
Users that are interested in Rosita are comparing it to the libraries listed below
Sorting:
- [NAACL 2022] "Learning to Win Lottery Tickets in BERT Transfer via Task-agnostic Mask Training", Yuanxin Liu, Fandong Meng, Zheng Lin, Pe…☆15Updated 2 years ago
- Code for Editing Factual Knowledge in Language Models☆139Updated 3 years ago
- Pytorch implementation of paper "Efficient Nearest Neighbor Language Models" (EMNLP 2021)☆73Updated 3 years ago
- Dataset for Unified Editing, EMNLP 2023. This is a model editing dataset where edits are natural language phrases.☆22Updated 11 months ago
- DEMix Layers for Modular Language Modeling☆53Updated 4 years ago
- An Interpretable Neuro-Symbolic Framework for Task-Oriented Dialogue Generation☆24Updated 3 years ago
- TBC☆27Updated 2 years ago
- ☆45Updated 3 years ago
- This is the oficial repository for "Parameter-Efficient Multi-task Tuning via Attentional Mixtures of Soft Prompts" (EMNLP 2022)☆102Updated 2 years ago
- ☆54Updated 2 years ago
- Princeton NLP's pre-training library based on fairseq with DeepSpeed kernel integration 🚃☆114Updated 2 years ago
- [Findings of EMNLP22] From Mimicking to Integrating: Knowledge Integration for Pre-Trained Language Models☆19Updated 2 years ago
- ☆21Updated last year
- ☆45Updated 4 years ago
- ☆87Updated 2 years ago
- Code for the ACL-2022 paper "StableMoE: Stable Routing Strategy for Mixture of Experts"☆48Updated 3 years ago
- Momentum Decoding: Open-ended Text Generation as Graph Exploration☆19Updated 2 years ago
- ☆53Updated last year
- ☆117Updated 3 years ago
- [NeurIPS 2022] Non-Linguistic Supervision for Contrastive Learning of Sentence Embeddings☆22Updated 2 years ago
- This project maintains a reading list for general text generation tasks☆66Updated 3 years ago
- An original implementation of "Noisy Channel Language Model Prompting for Few-Shot Text Classification"☆131Updated 3 years ago
- The code implementation of the EMNLP2022 paper: DisCup: Discriminator Cooperative Unlikelihood Prompt-tuning for Controllable Text Gene…☆26Updated last year
- ☆33Updated 2 years ago
- Code base of In-Context Learning for Dialogue State tracking☆45Updated last year
- [EMNLP 2022] Code and data for "Controllable Dialogue Simulation with In-Context Learning"☆35Updated 2 years ago
- [EMNLP 2022] Training Language Models with Memory Augmentation https://arxiv.org/abs/2205.12674☆197Updated 2 years ago
- Repo for the paper "Large Language Models Struggle to Learn Long-Tail Knowledge"☆77Updated 2 years ago
- ☆51Updated 2 years ago
- Code for the paper Code for the paper InstructDial: Improving Zero and Few-shot Generalization in Dialogue through Instruction Tuning☆100Updated 2 years ago