TsinghuaAI/CPM-1-Pretrain

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/TsinghuaAI/CPM-1-Pretrain)

TsinghuaAI / CPM-1-Pretrain

Pretrain CPM-1

☆53

Alternatives and similar repositories for CPM-1-Pretrain

Users that are interested in CPM-1-Pretrain are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

TsinghuaAI / CPM
View on GitHub
Introduction to CPM
☆164Sep 26, 2021Updated 4 years ago
TsinghuaAI / CPM-2-Finetune
View on GitHub
Finetune CPM-2
☆80Mar 18, 2023Updated 3 years ago
km1994 / nlp_paper_study_text_match
View on GitHub
仓库主要记录 NLP 算法工程师相关的顶会论文研读笔记【文本匹配篇】
☆13Jul 9, 2022Updated 4 years ago
bqw18744018044 / Concise_SimCSE
View on GitHub
A concise implementation of SimCSE
☆16Aug 2, 2021Updated 4 years ago
Tianxiaomo / cail2019_rc
View on GitHub
中国法研杯 CAIL 2019
☆13Jun 17, 2019Updated 7 years ago
Open source password manager - Proton Pass • Ad
Securely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
thu-coai / EVA
View on GitHub
EVA: Large-scale Pre-trained Chit-Chat Models
☆304Mar 11, 2023Updated 3 years ago
xhuang28 / NewBioNer
View on GitHub
☆11Nov 16, 2019Updated 6 years ago
VITA-Group / EarlyBERT
View on GitHub
[ACL-IJCNLP 2021] "EarlyBERT: Efficient BERT Training via Early-bird Lottery Tickets" by Xiaohan Chen, Yu Cheng, Shuohang Wang, Zhe Gan, …
☆18Dec 30, 2021Updated 4 years ago
nicyun / easyME
View on GitHub
An implementation of Maximum Entropy model
☆14Apr 28, 2012Updated 14 years ago
yxuansu / HCL
View on GitHub
[ACL'21] Dialogue Response Selection with Hierarchical Curriculum Learning
☆21Nov 15, 2022Updated 3 years ago
shinchen / cx-extractor
View on GitHub
Automatically exported from code.google.com/p/cx-extractor
☆14Mar 8, 2016Updated 10 years ago
tapilab / is-karthikbmk
View on GitHub
karthikbmk's independent study
☆10Sep 2, 2017Updated 8 years ago
mgormley / agiga
View on GitHub
Annotated Gigaword Java API and Command Line Tools
☆15Mar 30, 2016Updated 10 years ago
TsinghuaAI / CUGE
View on GitHub
☆54Apr 15, 2022Updated 4 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
trecrts / trecrts-eval
View on GitHub
TREC Real-Time Summarization Tools
☆15Jul 19, 2017Updated 9 years ago
keezen / ntk_alibi
View on GitHub
NTK scaled version of ALiBi position encoding in Transformer.
☆69Aug 16, 2023Updated 2 years ago
DAMO-NLP-SG / RemeMo
View on GitHub
[EMNLP 2023] Once Upon a *Time* in *Graph*: Relative-Time Pretraining for Complex Temporal Reasoning
☆17Oct 31, 2023Updated 2 years ago
pluskid / DistLearn.jl
View on GitHub
Example of distributed learning in Julia
☆21Jun 28, 2017Updated 9 years ago
OpenBMB / BMInf
View on GitHub
Efficient Inference for Big Models
☆583Jul 7, 2026Updated 3 weeks ago
stmilab / ArterialNet
View on GitHub
ArterialNet reconstructs arterial blood pressure (ABP) waveform
☆14Feb 24, 2025Updated last year
jszheng21 / RACE
View on GitHub
RACE is a multi-dimensional benchmark for code generation that focuses on Readability, mAintainability, Correctness, and Efficiency.
☆14Oct 12, 2024Updated last year
XinshuangL / SELF-PARAM
View on GitHub
The official implementation of the paper "Self-Updatable Large Language Models by Integrating Context into Model Parameters"
☆15May 18, 2025Updated last year
ProjectD-AI / LLaMA-Megatron-DeepSpeed
View on GitHub
Ongoing research training transformer language models at scale, including: BERT & GPT-2
☆69Jul 20, 2023Updated 3 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
WHUIR / MATINF
View on GitHub
The dataset and PyTorch Implementation for ACL 2020 paper "MATINF: A Jointly Labeled Large-Scale Dataset for Classification, Question Ans…
☆42Sep 7, 2020Updated 5 years ago
hengyicai / ContrastiveLearning4Dialogue
View on GitHub
The codebase for "Group-wise Contrastive Learning for Neural Dialogue Generation" (Cai et al., Findings of EMNLP 2020)
☆55Feb 24, 2021Updated 5 years ago
bojone / univae
View on GitHub
基于Transformer的单模型、多尺度的VAE模型
☆57Jun 29, 2021Updated 5 years ago
veronica320 / Zeroshot-Event-Extraction
View on GitHub
Repository for ACL2021 paper: <Zero-shot Event Extraction via Transfer Learning: Challenges and Insights>.
☆30Jan 5, 2023Updated 3 years ago
domaineval / DomainEval
View on GitHub
DOMAINEVAL is an auto-constructed benchmark for multi-domain code generation that consists of 2k+ subjects (i.e., description, reference …
☆13Dec 12, 2024Updated last year
leezythu / Awesome-Harness-Self-Improvement
View on GitHub
A curated reading list on harness engineering for recursive self-improvement of LLM agents (EN/ZH).
☆20Jul 9, 2026Updated 3 weeks ago
tabpfn-anonym / TabPFNAnonym
View on GitHub
☆16Nov 25, 2022Updated 3 years ago
franticnerd / triovecevent
View on GitHub
☆13Aug 23, 2017Updated 8 years ago
gilfernandes / complex_chain_playground
View on GitHub
Playground project acting as an example for a complex LangChain workflow
☆11Jun 20, 2023Updated 3 years ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
fuxuelinwudi / ccks2021_track3_baseline
View on GitHub
a baseline to practice
☆45Jul 6, 2021Updated 5 years ago
disrpt / sharedtask2019
View on GitHub
Repository for DISRPT2019 shared task
☆12Sep 5, 2022Updated 3 years ago
thu-coai / cotk
View on GitHub
Conversational Toolkit. An Open-Source Toolkit for Fast Development and Fair Evaluation of Text Generation
☆129Aug 31, 2020Updated 5 years ago
staymylove / COT_Compresstion_via_Step_entropy
View on GitHub
☆29Aug 8, 2025Updated 11 months ago
genggui001 / Megatron-DeepSpeed-Llama
View on GitHub
☆84Sep 9, 2023Updated 2 years ago
althayr / Document-Layout-Parser
View on GitHub
Parses a document (scanned or phone captured) and returns the underlying question - answer layout structured capture by LayoutXLM model
☆10Jun 14, 2021Updated 5 years ago
NikhilSehgal123 / coinbase-execution-algorithm
View on GitHub
An algorithm that intelligently executes a crypto order over time via Coinbase
☆13Oct 26, 2021Updated 4 years ago