Pretrain CPM-1
☆53Apr 20, 2021Updated 5 years ago
Alternatives and similar repositories for CPM-1-Pretrain
Users that are interested in CPM-1-Pretrain are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Finetune CPM-1☆73Mar 18, 2023Updated 3 years ago
- Introduction to CPM☆165Sep 26, 2021Updated 4 years ago
- benchmarks for evaluating MT models☆11Jun 26, 2024Updated last year
- Code for CPM-2 Pre-Train☆157Mar 18, 2023Updated 3 years ago
- 仓库主要记录 NLP 算法工程师相关的顶会论文研读笔记【文本匹配篇】☆13Jul 9, 2022Updated 3 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- 中国法研杯 CAIL 2019☆13Jun 17, 2019Updated 6 years ago
- A concise implementation of SimCSE☆16Aug 2, 2021Updated 4 years ago
- EVA: Large-scale Pre-trained Chit-Chat Models☆305Mar 11, 2023Updated 3 years ago
- The baseline method for CCIR 22 https://www.datafountain.cn/competitions/573☆13Aug 2, 2022Updated 3 years ago
- The official implementation of the paper "Self-Updatable Large Language Models by Integrating Context into Model Parameters"☆15May 18, 2025Updated 11 months ago
- Inference framework for MoE layers based on TensorRT with Python binding☆41May 31, 2021Updated 4 years ago
- [ACL-IJCNLP 2021] "EarlyBERT: Efficient BERT Training via Early-bird Lottery Tickets" by Xiaohan Chen, Yu Cheng, Shuohang Wang, Zhe Gan, …☆18Dec 30, 2021Updated 4 years ago
- Chinese Pre-Trained Language Models (CPM-LM) Version-I☆1,580Mar 18, 2023Updated 3 years ago
- Stochastic Gradient Markov Chain Monte Carlo and Optimisation☆17Mar 21, 2017Updated 9 years ago
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- [ACL'21] Dialogue Response Selection with Hierarchical Curriculum Learning☆21Nov 15, 2022Updated 3 years ago
- An implementation of Maximum Entropy model☆14Apr 28, 2012Updated 14 years ago
- karthikbmk's independent study☆10Sep 2, 2017Updated 8 years ago
- ☆54Apr 15, 2022Updated 4 years ago
- NTK scaled version of ALiBi position encoding in Transformer.☆69Aug 16, 2023Updated 2 years ago
- Code of the COLING22 paper "uChecker: Masked Pretrained Language Models as Unsupervised Chinese Spelling Checkers"☆19Aug 17, 2022Updated 3 years ago
- ☆15Dec 10, 2021Updated 4 years ago
- ☆17Jul 5, 2022Updated 3 years ago
- TREC Real-Time Summarization Tools☆15Jul 19, 2017Updated 8 years ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- [EMNLP 2023] Once Upon a *Time* in *Graph*: Relative-Time Pretraining for Complex Temporal Reasoning☆17Oct 31, 2023Updated 2 years ago
- Example of distributed learning in Julia☆20Jun 28, 2017Updated 8 years ago
- ☆15Nov 19, 2021Updated 4 years ago
- The codebase for "Group-wise Contrastive Learning for Neural Dialogue Generation" (Cai et al., Findings of EMNLP 2020)☆55Feb 24, 2021Updated 5 years ago
- 基于Transformer的单模型、多尺度的VAE模型☆57Jun 29, 2021Updated 4 years ago
- Repository for ACL2021 paper: <Zero-shot Event Extraction via Transfer Learning: Challenges and Insights>.☆30Jan 5, 2023Updated 3 years ago
- DOMAINEVAL is an auto-constructed benchmark for multi-domain code generation that consists of 2k+ subjects (i.e., description, reference …☆13Dec 12, 2024Updated last year
- Conversational Toolkit. An Open-Source Toolkit for Fast Development and Fair Evaluation of Text Generation☆129Aug 31, 2020Updated 5 years ago
- a baseline to practice☆45Jul 6, 2021Updated 4 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- ☆13Aug 23, 2017Updated 8 years ago
- ☆38Aug 5, 2024Updated last year
- Repository for DISRPT2019 shared task☆12Sep 5, 2022Updated 3 years ago
- [ACL 2021 Findings] HySPA: Hybrid Span Generation for Scalable Text-to-Graph Extraction☆10Sep 16, 2021Updated 4 years ago
- Parses a document (scanned or phone captured) and returns the underlying question - answer layout structured capture by LayoutXLM model☆10Jun 14, 2021Updated 4 years ago
- ☆58Dec 18, 2022Updated 3 years ago
- code for "Fine-grained Entity Typing via Label Reasoning" EMNLP2021☆13May 27, 2022Updated 3 years ago