takase / share_layer_paramsLinks
☆28Updated 3 years ago
Alternatives and similar repositories for share_layer_params
Users that are interested in share_layer_params are comparing it to the libraries listed below
Sorting:
- Official Implementation of "DialogLM: Pre-trained Model for Long Dialogue Understanding and Summarization."☆140Updated 2 years ago
- The source code for the Cutoff data augmentation approach proposed in this paper: "A Simple but Tough-to-Beat Data Augmentation Approach …☆63Updated 4 years ago
- [NeurIPS 2021] COCO-LM: Correcting and Contrasting Text Sequences for Language Model Pretraining☆117Updated last year
- [EMNLP'21] Mirror-BERT: Converting Pretrained Language Models to universal text encoders without labels.☆78Updated 2 years ago
- Code associated with the "Data Augmentation using Pre-trained Transformer Models" paper☆133Updated 2 years ago
- The repository for the paper: Multilingual Translation via Grafting Pre-trained Language Models☆24Updated 3 years ago
- This repo supports various cross-lingual transfer learning & multilingual NLP models.☆92Updated last year
- Improved version of GECToR☆59Updated last year
- Fine-grained Post-training for Improving Retrieval-based Dialogue Systems - NAACL 2021☆96Updated 4 years ago
- ☆37Updated 4 years ago
- Official code for LEWIS, from: "LEWIS: Levenshtein Editing for Unsupervised Text Style Transfer", ACL-IJCNLP 2021 Findings by Machel Rei…☆31Updated 2 years ago
- [EACL'21] Non-Autoregressive with Pretrained Language Model☆62Updated 2 years ago
- Source codes for the paper "Multi-View Sequence-to-Sequence Models with Conversational Structure for Abstractive Dialogue Summarization"☆90Updated last year
- Code for the ACL 2022 paper "Contextual Representation Learning beyond Masked Language Modeling"☆33Updated 2 years ago
- Zero -- A neural machine translation system☆152Updated 2 years ago
- [NAACL'22] TaCL: Improving BERT Pre-training with Token-aware Contrastive Learning☆93Updated 3 years ago
- ☆92Updated 3 years ago
- EMNLP 2021 - CTC: A Unified Framework for Evaluating Natural Language Generation☆97Updated 2 years ago
- ☆177Updated 3 years ago
- source code for paper: WhiteningBERT: An Easy Unsupervised Sentence Embedding Approach.☆55Updated 4 years ago
- Code for NeurIPS2020 "Incorporating BERT into Parallel Sequence Decoding with Adapters"☆32Updated 2 years ago
- CharBERT: Character-aware Pre-trained Language Model (COLING2020)☆120Updated 4 years ago
- Pre-training BART in Flax on The Pile dataset☆21Updated 3 years ago
- Source codes of Neural Quality Estimation with Multiple Hypotheses for Grammatical Error Correction☆43Updated 4 years ago
- DisCo Transformer for Non-autoregressive MT☆77Updated 2 years ago
- MoverScore: Text Generation Evaluating with Contextualized Embeddings and Earth Mover Distance☆208Updated last year
- ☆66Updated 3 years ago
- ELECTRA: Pre-training Text Encoders as Discriminators Rather Than Generators☆91Updated 3 years ago
- This repository contains the code for paper Prompting ELECTRA Few-Shot Learning with Discriminative Pre-Trained Models.☆48Updated 3 years ago
- Thank you BART! Rewarding Pre-Trained Models Improves Formality Style Transfer (ACL 2021)☆30Updated 2 years ago