Finetune CPM-1
☆74Mar 18, 2023Updated 3 years ago
Alternatives and similar repositories for CPM-1-Finetune
Users that are interested in CPM-1-Finetune are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆37Jan 5, 2021Updated 5 years ago
- ☆49Dec 24, 2020Updated 5 years ago
- ☆25Sep 29, 2021Updated 4 years ago
- Chinese Pre-Trained Language Models (CPM-LM) Version-I☆1,580Mar 18, 2023Updated 3 years ago
- A plug-in of Microsoft DeepSpeed to fix the bug of DeepSpeed pipeline☆25Apr 16, 2021Updated 4 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- Code for CPM-2 Pre-Train☆157Mar 18, 2023Updated 3 years ago
- Distill CPM-1☆18May 6, 2021Updated 4 years ago
- Pretrain CPM-1☆53Apr 20, 2021Updated 4 years ago
- ☆34Jul 29, 2021Updated 4 years ago
- Finetune CPM-2☆82Mar 18, 2023Updated 3 years ago
- The pytorch implementation of the SAFE model presented in NAACL-Findings-2022☆17Mar 10, 2023Updated 3 years ago
- 中国法研杯 CAIL 2019☆13Jun 17, 2019Updated 6 years ago
- ☆247Oct 21, 2022Updated 3 years ago
- Official Code for NAACL 2022 paper: "Persona-Guided Planning for Controlling the Protagonist's Persona in Story Generation"☆16Sep 1, 2022Updated 3 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- A framework for cleaning Chinese dialog data☆273May 14, 2021Updated 4 years ago
- Code for our AAAI2021 paper: Token-Aware Virtual Adversarial Training For Language Understanding.☆25Dec 3, 2020Updated 5 years ago
- ☆13May 23, 2021Updated 4 years ago
- Easy-to-use CPM for Chinese text generation(基于CPM的中文文本生成)☆531Apr 10, 2023Updated 2 years ago
- Findings of ACL 2021☆24May 8, 2021Updated 4 years ago
- Tools for training pytorch language models☆27Nov 14, 2020Updated 5 years ago
- KuaiSearch PERKS☆12Nov 16, 2021Updated 4 years ago
- EVA: Large-scale Pre-trained Chit-Chat Models☆305Mar 11, 2023Updated 3 years ago
- A Large-scale Chinese Short-Text Conversation Dataset and Chinese pre-training dialog models☆1,938Jun 12, 2023Updated 2 years ago
- Open source password manager - Proton Pass • AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- Finetune CPM-1☆24Jun 20, 2021Updated 4 years ago
- Dataset and Baseline for SMP-MCC2020☆23Jul 6, 2023Updated 2 years ago
- ☆20Sep 17, 2021Updated 4 years ago
- Datasets for the paper "Improving the Robustness of Question Answering Systems to Question Paraphrasing" (ACL 2019)☆27Aug 7, 2019Updated 6 years ago
- Source code for paper: Knowledge Inheritance for Pre-trained Language Models☆38Apr 24, 2022Updated 3 years ago
- Introduction to CPM☆165Sep 26, 2021Updated 4 years ago
- The source code of Text Style Transfer via Learning Style Instance Supported Latent Space (IJCAI 2020).☆38Dec 21, 2020Updated 5 years ago
- Code accompanying ICML 2021 paper "Few-shot Language Coordination by Modeling Theory of Mind"☆18May 18, 2022Updated 3 years ago
- ChID: A Large-scale Chinese IDiom Dataset for Cloze Test☆150May 8, 2023Updated 2 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- ☆12Nov 11, 2019Updated 6 years ago
- AIR retriever for Multi-Hop QA (ACL 2020 paper)☆30Jul 18, 2020Updated 5 years ago
- Models, data, and codes for the paper: MetaAligner: Towards Generalizable Multi-Objective Alignment of Language Models☆25Sep 26, 2024Updated last year
- [ACL'21 Findings] Why Machine Reading Comprehension Models Learn Shortcuts?☆16Aug 8, 2023Updated 2 years ago
- Code and dataset for paper "End-to-end Emotion-Cause Pair Extraction via Learning to Link"☆16Jan 12, 2022Updated 4 years ago
- Repo containing the Twitter preprocessor module, developed by the AUTH OSWinds team☆26Dec 10, 2020Updated 5 years ago
- Pytorch model for https://github.com/imcaspar/gpt2-ml☆77Nov 21, 2021Updated 4 years ago