bytedance / ParaGen
ParaGen is a PyTorch deep learning framework for parallel sequence generation.
☆186Updated last year
Related projects: ⓘ
- Implementation of "Glancing Transformer for Non-Autoregressive Neural Machine Translation"☆135Updated last year
- ☆119Updated 2 years ago
- ☆165Updated 2 years ago
- Introduction to CPM☆164Updated 2 years ago
- ☆57Updated this week
- RoFormer升级版☆151Updated 2 years ago
- Code for CPM-2 Pre-Train☆159Updated last year
- A unified tokenization tool for Images, Chinese and English.☆149Updated last year
- NTK scaled version of ALiBi position encoding in Transformer.☆64Updated last year
- Finetune CPM-2☆83Updated last year
- Finetune CPM-1☆75Updated last year
- NLU & NLG (zero-shot) depend on mengzi-t5-base-mt pretrained model☆75Updated last year
- 基于Gated Attention Unit的Transformer模型(尝鲜版)☆95Updated last year
- [ACL 2022] Structured Pruning Learns Compact and Accurate Models https://arxiv.org/abs/2204.00408☆188Updated last year
- A paper list of pre-trained language models (PLMs).☆137Updated 2 years ago
- ☆73Updated last year
- Must-read papers on improving efficiency for pre-trained language models.☆100Updated last year
- Code, Data and Demo for Paper: Controllable Generation from Pre-trained Language Models via Inverse Prompting☆120Updated last year
- Ongoing research training transformer language models at scale, including: BERT & GPT-2☆67Updated last year
- 中文图书语料MD5链接☆209Updated 7 months ago
- EVA: Large-scale Pre-trained Chit-Chat Models☆304Updated last year
- Pretrain CPM-1☆50Updated 3 years ago
- 大规模中文语料☆34Updated 4 years ago
- ☆245Updated last year
- a Fast, Flexible, Extensible and Easy-to-use NLP Large-scale Pretraining and Multi-task Learning Framework.☆176Updated 3 years ago
- This is a code repository for the ACL 2022 paper "Learning to Generalize to More: Continuous Semantic Augmentation for Neural Machine Tra…☆52Updated 2 years ago
- P-tuning方法在中文上的简单实验☆138Updated 3 years ago
- ☆155Updated last month
- TencentLLMEval is a comprehensive and extensive benchmark for artificial evaluation of large models that includes task trees, standards, …☆38Updated 3 weeks ago
- FLASHQuad_pytorch☆66Updated 2 years ago