takase / share_layer_paramsLinks
☆28Updated 3 years ago
Alternatives and similar repositories for share_layer_params
Users that are interested in share_layer_params are comparing it to the libraries listed below
Sorting:
- Official Implementation of "DialogLM: Pre-trained Model for Long Dialogue Understanding and Summarization."☆141Updated 2 years ago
- [EMNLP'21] Mirror-BERT: Converting Pretrained Language Models to universal text encoders without labels.☆78Updated 2 years ago
- The source code for the Cutoff data augmentation approach proposed in this paper: "A Simple but Tough-to-Beat Data Augmentation Approach …☆63Updated 4 years ago
- [NeurIPS 2021] COCO-LM: Correcting and Contrasting Text Sequences for Language Model Pretraining☆117Updated 2 years ago
- Source codes of Neural Quality Estimation with Multiple Hypotheses for Grammatical Error Correction☆43Updated 4 years ago
- CharBERT: Character-aware Pre-trained Language Model (COLING2020)☆121Updated 4 years ago
- [EACL'21] Non-Autoregressive with Pretrained Language Model☆62Updated 2 years ago
- Zero -- A neural machine translation system☆153Updated 2 years ago
- ☆37Updated 4 years ago
- [NAACL'22] TaCL: Improving BERT Pre-training with Token-aware Contrastive Learning☆93Updated 3 years ago
- [EMNLP 2021] LM-Critic: Language Models for Unsupervised Grammatical Error Correction☆119Updated 3 years ago
- source code for paper: WhiteningBERT: An Easy Unsupervised Sentence Embedding Approach.☆56Updated 4 years ago
- Code associated with the "Data Augmentation using Pre-trained Transformer Models" paper☆134Updated 2 years ago
- Official code for LEWIS, from: "LEWIS: Levenshtein Editing for Unsupervised Text Style Transfer", ACL-IJCNLP 2021 Findings by Machel Rei…☆31Updated 2 years ago
- ☆57Updated 3 years ago
- MoverScore: Text Generation Evaluating with Contextualized Embeddings and Earth Mover Distance☆208Updated last year
- Stronger Baselines for Grammatical Error Correction Using a Pretrained Encoder-Decoder Model.☆37Updated 2 years ago
- This repo supports various cross-lingual transfer learning & multilingual NLP models.☆92Updated last year
- PyTorch implementation for Seq2Seq model with attention and Greedy Search / Beam Search for neural machine translation☆58Updated 4 years ago
- Code for our EACL-2021 paper "Generating Syntactically Controlled Paraphrases without Using Annotated Parallel Pairs".☆38Updated last year
- A masked language modeling objective to train a model to predict any subset of the target words, conditioned on both the input text and a…☆244Updated 3 years ago
- Rethinking Perturbations in Encoder-Decoders for Fast Training☆18Updated 3 years ago
- The repository for the paper: Multilingual Translation via Grafting Pre-trained Language Models☆24Updated 3 years ago
- Thank you BART! Rewarding Pre-Trained Models Improves Formality Style Transfer (ACL 2021)☆30Updated 2 years ago
- Pre-training BART in Flax on The Pile dataset☆22Updated 4 years ago
- The official code of the "Frustratingly Easy System Combination for Grammatical Error Correction" paper☆56Updated last year
- This repository contains the code for paper Prompting ELECTRA Few-Shot Learning with Discriminative Pre-Trained Models.☆48Updated 3 years ago
- Beam search decoding with Pytorch☆54Updated last year
- Code base for "G-Transformer for Document-level Machine Translation"☆45Updated 2 years ago
- [TMLR'23] Contrastive Search Is What You Need For Neural Text Generation☆119Updated 2 years ago