Finetune CPM-2
☆81Mar 18, 2023Updated 2 years ago
Alternatives and similar repositories for CPM-2-Finetune
Users that are interested in CPM-2-Finetune are comparing it to the libraries listed below
Sorting:
- Code for CPM-2 Pre-Train☆158Mar 18, 2023Updated 2 years ago
- Introduction to CPM☆165Sep 26, 2021Updated 4 years ago
- Efficient Inference for Big Models☆587Jan 24, 2023Updated 3 years ago
- ☆37Jan 5, 2021Updated 5 years ago
- A plug-in of Microsoft DeepSpeed to fix the bug of DeepSpeed pipeline☆25Apr 16, 2021Updated 4 years ago
- Finetune CPM-1☆75Mar 18, 2023Updated 2 years ago
- Finetune CPM-1☆24Jun 20, 2021Updated 4 years ago
- Inference framework for MoE layers based on TensorRT with Python binding☆41May 31, 2021Updated 4 years ago
- Easy-to-use CPM for Chinese text generation(基于CPM的中文文本生成)☆531Apr 10, 2023Updated 2 years ago
- [Findings of ACL 2023] Communication Efficient Federated Learning for Multilingual Machine Translation with Adapter☆12Sep 4, 2023Updated 2 years ago
- Algorithmic and AI MIDI Drums Generator Implementation☆13Apr 1, 2022Updated 3 years ago
- CSS-LM: Contrastive Semi-supervised Fine-tuning of Pre-trained Language Models☆12Jul 1, 2023Updated 2 years ago
- Pretrain CPM-1☆53Apr 20, 2021Updated 4 years ago
- DeepTrace: A lightweight, scalable real-time diagnostic and analysis tool for distributed training tasks.☆18Nov 4, 2025Updated 4 months ago
- Vision Large Language Models trained on M3IT instruction tuning dataset☆17Aug 16, 2023Updated 2 years ago
- ☆220Dec 8, 2022Updated 3 years ago
- BMInf demos.☆16Oct 14, 2021Updated 4 years ago
- The code for "MoPE: Mixture of Prefix Experts for Zero-Shot Dialogue State Tracking"☆19Jan 25, 2025Updated last year
- Repository for the ACL'22 paper "So Different Yet So Alike! Constrained Unsupervised Text Style Transfer"☆16Jan 19, 2024Updated 2 years ago
- ☆21Mar 19, 2021Updated 4 years ago
- ☆17Nov 14, 2022Updated 3 years ago
- Pun-GAN: Generative Adversarial Network for Pun Generation (EMNLP 2019)☆42Aug 19, 2019Updated 6 years ago
- Finetune CPM-1 For Text Generation☆18Jul 9, 2021Updated 4 years ago
- brat 文本标注系统的官方文档中文翻译☆16Apr 22, 2019Updated 6 years ago
- The code for ``STYLEDGPT: Stylized Response Generation with Pre-trained LanguageModels'' (Findings of EMNLP2020)☆21Nov 16, 2020Updated 5 years ago
- Ultra-Fine Entity Typing with Weak Supervision from a Masked Language Model☆18Aug 2, 2021Updated 4 years ago
- investigating use of variational auto encoders with multinomial latent variables for unsupervised data.☆25Jun 12, 2017Updated 8 years ago
- Implementation of the research paper Consistent Representation Learning for Continual Relation Extraction (Findings of ACL 2022)☆26May 16, 2022Updated 3 years ago
- Code and data for COLING 2022 paper titled "Structural Bias For Aspect Sentiment Triplet Extraction"☆26May 28, 2023Updated 2 years ago
- Global-to-Local Neural Networks for Document-Level Relation Extraction, EMNLP 2020☆53Oct 3, 2020Updated 5 years ago
- EVA: Large-scale Pre-trained Chit-Chat Models☆306Mar 11, 2023Updated 2 years ago
- MFIN7036 NLP Course Project☆10Jul 25, 2024Updated last year
- ☆30May 20, 2022Updated 3 years ago
- The repo of "Improving Seq2Seq Grammatical Error Correction via Decoding Interventions"☆32Jan 22, 2024Updated 2 years ago
- https://pypi.org/project/intent-suggestions/☆10Sep 6, 2022Updated 3 years ago
- using lear to do ner extraction☆29Mar 13, 2022Updated 3 years ago
- Code for CAET5☆23Jun 12, 2023Updated 2 years ago
- 之江杯-电商评论观点挖掘 rank30☆15Nov 3, 2019Updated 6 years ago
- Large-scale open domain KNOwledge grounded conVERsation system based on PaddlePaddle☆673Mar 6, 2024Updated 2 years ago