bytedance / ParaGen
ParaGen is a PyTorch deep learning framework for parallel sequence generation.
☆186Updated 2 years ago
Alternatives and similar repositories for ParaGen
Users that are interested in ParaGen are comparing it to the libraries listed below
Sorting:
- ☆167Updated 3 years ago
- Implementation of "Glancing Transformer for Non-Autoregressive Neural Machine Translation"☆137Updated 2 years ago
- ☆120Updated 3 years ago
- Introduction to CPM☆165Updated 3 years ago
- 中文图书语料MD5链接☆218Updated last year
- A unified tokenization tool for Images, Chinese and English.☆152Updated 2 years ago
- RoFormer升级版☆152Updated 2 years ago
- Code, Data and Demo for Paper: Controllable Generation from Pre-trained Language Models via Inverse Prompting☆123Updated 2 years ago
- Code for CPM-2 Pre-Train☆158Updated 2 years ago
- 大规模中文语料☆41Updated 5 years ago
- ☆53Updated 3 years ago
- Ongoing research training transformer language models at scale, including: BERT & GPT-2☆69Updated last year
- A PyTorch-based model pruning toolkit for pre-trained language models☆385Updated last year
- Pretrain CPM-1☆51Updated 4 years ago
- This is a code repository for the ACL 2022 paper "Learning to Generalize to More: Continuous Semantic Augmentation for Neural Machine Tra…☆52Updated 3 years ago
- 零样本学习测评基准,中文版☆56Updated 3 years ago
- NTK scaled version of ALiBi position encoding in Transformer.☆68Updated last year
- ☆172Updated 2 years ago
- Finetune CPM-2☆82Updated 2 years ago
- NLU & NLG (zero-shot) depend on mengzi-t5-base-mt pretrained model☆74Updated 2 years ago
- 基于Gated Attention Unit的Transformer模型(尝鲜版)☆97Updated 2 years ago
- P-tuning方法在中文上的简单实验☆139Updated 4 years ago
- Finetune CPM-1☆74Updated 2 years ago
- [ACL 2022] Structured Pruning Learns Compact and Accurate Models https://arxiv.org/abs/2204.00408☆195Updated 2 years ago
- ☆76Updated last year
- A Multi-modal Model Chinese Spell Checker Released on ACL2021.☆159Updated last year
- Simple implementation of using lora form the peft library to fine-tune the chatglm-6b☆83Updated 2 years ago
- ☆218Updated 2 years ago
- reStructured Pre-training☆98Updated 2 years ago
- Tracking the progress in NLG for task-oriented dialogue system (resources, code, and new frontiers etc.)☆134Updated 3 years ago