UniversalDependencies / UD_Chinese-GSDSimpLinks
Conversion of UD_Chinese-GSD to simplified Chinese characters.
☆37Updated last month
Alternatives and similar repositories for UD_Chinese-GSDSimp
Users that are interested in UD_Chinese-GSDSimp are comparing it to the libraries listed below
Sorting:
- SemEval-2016 Task 9: Chinese Semantic Dependency Parsing☆138Updated 7 years ago
- 各大中文分词性能评测☆157Updated 6 years ago
- ☆174Updated 2 years ago
- Corpus creator for Chinese Wikipedia☆41Updated 4 years ago
- ☆96Updated last month
- This directory contains the training, test, and gold-standard data used in the 2nd International Chinese Word Segmentation Bakeoff. Also …☆68Updated 7 years ago
- ☆75Updated 2 years ago
- ChID: A Large-scale Chinese IDiom Dataset for Cloze Test☆151Updated 2 years ago
- 中文分词软件基准测试 | Chinese tokenizer benchmark☆25Updated 7 years ago
- 人民日报语料处理工具集 | Tools for Corpus of People's Daily☆285Updated 2 years ago
- Simple Solution for Multi-Criteria Chinese Word Segmentation☆303Updated 5 years ago
- 人民日报1998年1-4月中文标注语料库☆32Updated 6 years ago
- Must-read Papers on Sememe Computation☆198Updated 2 years ago
- Cognitive Inference,认知推理、常识知识库、常识推理与常识推理评估的系统项目,以现有国内外已有的常识知识库为研究对象,从常识知识库资源建设和常识推理测试评估两个方面出发进行整理,并结合自己近几年来在逻辑性推理知识库的构建、应用以及理论思考进行介绍。具体包括…☆123Updated 5 years ago
- 教育行业新闻 自动文摘 语料库 自动摘要☆203Updated 7 years ago
- Convolutional neural network and word embeddings for Chinese word segmentation☆147Updated 3 years ago
- An open-source classical Chinese information processing toolkit developed by Tsinghua Natural Language Processing Group☆51Updated 6 years ago
- This is a corpus of Chinese abbreviation, including negative full forms.☆197Updated 4 years ago
- 基于 Bi-LSTM 和 CRF 的中文语义角色标注☆87Updated 6 years ago
- BERT-CCPoem is an BERT-based pre-trained model particularly for Chinese classical poetry☆157Updated 3 years ago
- A paper list of automatic poetry generation, analysis, translation, etc.☆186Updated 4 years ago
- cw2vec: Learning Chinese Word Embeddings with Stroke n-gram Information☆275Updated 2 years ago
- 中文生成式预训练模型☆99Updated 5 years ago
- Codes for Stylistic Chinese Poetry Generation via Unsupervised Style Disentanglement (EMNLP 2018)☆195Updated 5 years ago
- NLP NER datasets video/music/book bio☆90Updated 4 years ago
- Python scripts preprocessing Penn Treebank and Chinese Treebank☆161Updated 5 years ago
- Subword Encoding in Lattice LSTM for Chinese Word Segmentation☆54Updated 6 years ago
- 一个轻量且功能全面的中文分词器,帮助学生了解分词器的工作原理。MicroTokenizer: A lightweight Chinese tokenizer designed for educational and research purposes. Provides a…☆157Updated last year
- Code for NeurIPS 2019 - Glyce: Glyph-vectors for Chinese Character Representations☆429Updated 2 years ago
- 中文分词工具评估☆63Updated 2 years ago