Pretrain CPM-1
☆53Apr 20, 2021Updated 5 years ago
Alternatives and similar repositories for CPM-1-Pretrain
Users that are interested in CPM-1-Pretrain are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A plug-in of Microsoft DeepSpeed to fix the bug of DeepSpeed pipeline☆25Apr 16, 2021Updated 5 years ago
- Distill CPM-1☆18May 6, 2021Updated 5 years ago
- Finetune CPM-1☆73Mar 18, 2023Updated 3 years ago
- Introduction to CPM☆164Sep 26, 2021Updated 4 years ago
- Code for CPM-2 Pre-Train☆157Mar 18, 2023Updated 3 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- ☆37Jan 5, 2021Updated 5 years ago
- 中国法研杯 CAIL 2019☆13Jun 17, 2019Updated 6 years ago
- A concise implementation of SimCSE☆16Aug 2, 2021Updated 4 years ago
- EVA: Large-scale Pre-trained Chit-Chat Models☆305Mar 11, 2023Updated 3 years ago
- The baseline method for CCIR 22 https://www.datafountain.cn/competitions/573☆13Aug 2, 2022Updated 3 years ago
- The official implementation of the paper "Self-Updatable Large Language Models by Integrating Context into Model Parameters"☆15May 18, 2025Updated last year
- [ACL-IJCNLP 2021] "EarlyBERT: Efficient BERT Training via Early-bird Lottery Tickets" by Xiaohan Chen, Yu Cheng, Shuohang Wang, Zhe Gan, …☆18Dec 30, 2021Updated 4 years ago
- Code for our SIGIR'2017 paper "Neural Rating Regression with Abstractive Tips Generation for Recommendation"☆14Jul 24, 2020Updated 5 years ago
- [ACL'21] Dialogue Response Selection with Hierarchical Curriculum Learning☆21Nov 15, 2022Updated 3 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- An implementation of Maximum Entropy model☆14Apr 28, 2012Updated 14 years ago
- karthikbmk's independent study☆10Sep 2, 2017Updated 8 years ago
- ☆54Apr 15, 2022Updated 4 years ago
- Dynamic Entity Summarization (DynES)☆20May 10, 2019Updated 7 years ago
- NTK scaled version of ALiBi position encoding in Transformer.☆69Aug 16, 2023Updated 2 years ago
- Code of the COLING22 paper "uChecker: Masked Pretrained Language Models as Unsupervised Chinese Spelling Checkers"☆19Aug 17, 2022Updated 3 years ago
- ☆15Dec 10, 2021Updated 4 years ago
- ☆17Jul 5, 2022Updated 3 years ago
- TREC Real-Time Summarization Tools☆15Jul 19, 2017Updated 8 years ago
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- ☆15Jul 18, 2022Updated 3 years ago
- ☆52Jan 1, 2024Updated 2 years ago
- Example of distributed learning in Julia☆21Jun 28, 2017Updated 8 years ago
- Efficient Inference for Big Models☆584Jan 24, 2023Updated 3 years ago
- ☆15Nov 19, 2021Updated 4 years ago
- Ongoing research training transformer language models at scale, including: BERT & GPT-2☆69Jul 20, 2023Updated 2 years ago
- The dataset and PyTorch Implementation for ACL 2020 paper "MATINF: A Jointly Labeled Large-Scale Dataset for Classification, Question Ans…☆43Sep 7, 2020Updated 5 years ago
- The codebase for "Group-wise Contrastive Learning for Neural Dialogue Generation" (Cai et al., Findings of EMNLP 2020)☆55Feb 24, 2021Updated 5 years ago
- An algorithm that intelligently executes a crypto order over time via Coinbase☆13Oct 26, 2021Updated 4 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- RACE is a multi-dimensional benchmark for code generation that focuses on Readability, mAintainability, Correctness, and Efficiency.☆14Oct 12, 2024Updated last year
- 基于Transformer的单模型、多尺度的VAE模型☆57Jun 29, 2021Updated 4 years ago
- Repository for ACL2021 paper: <Zero-shot Event Extraction via Transfer Learning: Challenges and Insights>.☆30Jan 5, 2023Updated 3 years ago
- ☆16Nov 25, 2022Updated 3 years ago
- ArterialNet reconstructs arterial blood pressure (ABP) waveform☆14Feb 24, 2025Updated last year
- DSTC9 Multi-Domain Task-Oriented Dialog Challenge II☆34Nov 26, 2020Updated 5 years ago
- DOMAINEVAL is an auto-constructed benchmark for multi-domain code generation that consists of 2k+ subjects (i.e., description, reference …☆13Dec 12, 2024Updated last year