JaniceZhao / Douban-Dushu-Dataset
A dataset contains 37 million douban dushu comments
☆59Updated 6 years ago
Alternatives and similar repositories for Douban-Dushu-Dataset:
Users that are interested in Douban-Dushu-Dataset are comparing it to the libraries listed below
- ☆31Updated 6 years ago
- Code for Synchronous Bidirectional Neural Machine Translation (SB-NMT)☆66Updated 5 years ago
- 高性能小模型测评 Shared Tasks in NLPCC 2020. Task 1 - Light Pre-Training Chinese Language Model for NLP Task☆57Updated 4 years ago
- A Chinese Cloze-style RC Dataset: People's Daily & Children's Fairy Tale (CFT)☆167Updated 5 years ago
- MAsked Sequence to Sequence (MASS) pre-training for language generation☆21Updated 5 years ago
- Source codes for paper "Neural Networks Incorporating Dictionaries for Chinese Word Segmentation", AAAI 2018☆90Updated 7 years ago
- QANet+DuReader中文机器阅读理解☆221Updated 6 years ago
- A curated list of resources of chinese corpora for NLP(Natural Language Processing)☆74Updated 5 years ago
- An open-source classical Chinese information processing toolkit developed by Tsinghua Natural Language Processing Group☆48Updated 6 years ago
- 离线端阅读理解应用 QA for mobile, Android & iPhone☆60Updated 2 years ago
- a simple yet complete implementation of the popular BERT model☆127Updated 4 years ago
- 中文 预训练 ELECTRA 模型: 基于对抗学习 pretrain Chinese Model☆139Updated 4 years ago
- Finetune CPM-1☆75Updated last year
- Python toolkit for Chinese Language Understanding(CLUE) Evaluation benchmark☆128Updated last year
- DistilBERT for Chinese 海量中文预训练蒸馏bert模型☆91Updated 5 years ago
- Implementation of paper: Deng K, Bol P K, Li K J, et al. On the unsupervised analysis of domain-specific Chinese texts[J]. Proceedings of…☆77Updated 8 years ago
- Subword Encoding in Lattice LSTM for Chinese Word Segmentation☆53Updated 5 years ago
- Bert finetune for CMRC2018, CJRC, DRCD, CHID, C3☆182Updated 4 years ago
- The source codes of Working Memory model for Chinese poetry generation (IJCAI 2018).☆53Updated 4 years ago
- 中文文本自动纠错☆81Updated 6 years ago
- cw2vec: Learning Chinese Word Embeddings with Stroke n-gram Information☆273Updated last year
- An Implementation of 'Attention is all you need' with Chinese Corpus☆130Updated 9 months ago
- Collections of Chinese reading comprehension datasets☆215Updated 5 years ago
- Source codes and corpora of paper "Iterated Dilated Convolutions for Chinese Word Segmentation"☆135Updated 3 years ago
- Chinese "spelling" error correction☆259Updated 7 years ago
- NLP NER datasets video/music/book bio☆84Updated 4 years ago
- ☆36Updated 5 years ago
- Deep contextualized word representations for Chinese☆151Updated 5 years ago
- language model in Chinese,基于Pytorch官方文档实现☆68Updated 6 years ago
- A toolkit for abstractive summarization, which is easy to implement the baseline and our proposed models, which can achieve the SOTA perf…☆49Updated 6 years ago