TsinghuaAI / CPM-1-PretrainView external linksLinks
Pretrain CPM-1
☆52Apr 20, 2021Updated 4 years ago
Alternatives and similar repositories for CPM-1-Pretrain
Users that are interested in CPM-1-Pretrain are comparing it to the libraries listed below
Sorting:
- A plug-in of Microsoft DeepSpeed to fix the bug of DeepSpeed pipeline☆25Apr 16, 2021Updated 4 years ago
- Distill CPM-1☆18May 6, 2021Updated 4 years ago
- karthikbmk's independent study☆10Sep 2, 2017Updated 8 years ago
- Introduction to CPM☆165Sep 26, 2021Updated 4 years ago
- Code for our SIGIR'2017 paper "Neural Rating Regression with Abstractive Tips Generation for Recommendation"☆14Jul 24, 2020Updated 5 years ago
- Finetune CPM-1☆75Mar 18, 2023Updated 2 years ago
- 仓库主要记录 NLP 算法工程师相关的顶会论文研读笔记【文本匹配篇】☆13Jul 9, 2022Updated 3 years ago
- An implementation of Maximum Entropy model☆14Apr 28, 2012Updated 13 years ago
- The baseline method for CCIR 22 https://www.datafountain.cn/competitions/573☆13Aug 2, 2022Updated 3 years ago
- TREC Real-Time Summarization Tools☆15Jul 19, 2017Updated 8 years ago
- ☆17Jul 5, 2022Updated 3 years ago
- Dynamic Entity Summarization (DynES)☆20May 10, 2019Updated 6 years ago
- Automatically exported from code.google.com/p/cx-extractor☆14Mar 8, 2016Updated 9 years ago
- A concise implementation of SimCSE☆16Aug 2, 2021Updated 4 years ago
- [ACL'21] Dialogue Response Selection with Hierarchical Curriculum Learning☆21Nov 15, 2022Updated 3 years ago
- EVA: Large-scale Pre-trained Chit-Chat Models☆306Mar 11, 2023Updated 2 years ago
- Finetune CPM-2☆81Mar 18, 2023Updated 2 years ago
- Named Entity Recognition (NER) models (neural and sparse) implemented based on package LibN3L☆19Jan 2, 2017Updated 9 years ago
- Scripts and library for the "Dictionary Learning Algorithms and Applications" book.☆25May 29, 2018Updated 7 years ago
- 基于Transformer的单模型、多尺度的VAE模型☆58Jun 29, 2021Updated 4 years ago
- ☆54Apr 15, 2022Updated 3 years ago
- Tensorflow implementation of RankGan (Adversarial Ranking for Language Generation)☆22Jun 15, 2018Updated 7 years ago
- The codebase for "Group-wise Contrastive Learning for Neural Dialogue Generation" (Cai et al., Findings of EMNLP 2020)☆55Feb 24, 2021Updated 4 years ago
- Code for CPM-2 Pre-Train☆158Mar 18, 2023Updated 2 years ago
- [Findings of ACL 2022] Meta-Path Guided Contrastive Learning for Logical Reasoning of Text☆28Mar 21, 2022Updated 3 years ago
- Tigon: A Distributed Database for a CXL Pod [OSDI '25]☆45Nov 25, 2025Updated 2 months ago
- Chinese Pre-Trained Language Models (CPM-LM) Version-I☆1,582Mar 18, 2023Updated 2 years ago
- NTK scaled version of ALiBi position encoding in Transformer.☆69Aug 16, 2023Updated 2 years ago
- Conversational Toolkit. An Open-Source Toolkit for Fast Development and Fair Evaluation of Text Generation☆129Aug 31, 2020Updated 5 years ago
- Group workspace for improvements to the Columbia Newsblaster system.☆31May 12, 2016Updated 9 years ago
- This is a repository for machine translation with open license.☆24Dec 1, 2015Updated 10 years ago
- The Argument Reasoning Comprehension Task: Source codes & Datasets☆77Jan 29, 2022Updated 4 years ago
- An experimental implementation of the retrieval-enhanced language model☆75Dec 29, 2022Updated 3 years ago
- LiBai(李白): A Toolbox for Large-Scale Distributed Parallel Training☆405Jul 31, 2025Updated 6 months ago
- I use various Data Science and machine learning techniques to analyze customer data using STP framework. I preprocessed the data, perform…☆12Apr 26, 2020Updated 5 years ago
- End-to-end integration of HuggingFace's models for sequence labeling.☆11Oct 4, 2020Updated 5 years ago
- CSE201 Objected-Oriented Programming in C++: Teach an AI to produce pieces of music☆12Jan 23, 2019Updated 7 years ago
- [EMNLP 2020] Discern: Discourse-Aware Entailment Reasoning Network for Conversational Machine Reading☆38Nov 22, 2022Updated 3 years ago
- Pre-training Cross-modal Transformer for Audio-and-Language Representations☆38Apr 20, 2021Updated 4 years ago