zeicold / pycnnum
☆19Updated 9 months ago
Alternatives and similar repositories for pycnnum:
Users that are interested in pycnnum are comparing it to the libraries listed below
- ☆92Updated 4 months ago
- Hanzi Converter for Traditional and Simplified Chinese☆184Updated 5 years ago
- Constants used in Chinese text processing☆371Updated 3 months ago
- High performance Trie and Ahocorasick automata (AC automata) Keyword Match & Replace Tool for python. Correct case insensitive implementa…☆94Updated 5 months ago
- 中文分词软件基准测试 | Chinese tokenizer benchmark☆23Updated 6 years ago
- Chinese word segmentation module of LTP☆46Updated 9 years ago
- Berserker - BERt chineSE woRd toKenizER☆16Updated 6 years ago
- Re-rank n-best lists using additional features.☆28Updated 6 years ago
- Pipelined quality estimation.☆51Updated 5 years ago
- Subword Encoding in Lattice LSTM for Chinese Word Segmentation☆53Updated 5 years ago
- Python module that identifies Chinese text as being Simplified or Traditional☆91Updated 4 months ago
- SegEval Segmentation Evaluation Package☆56Updated last year
- Corpus of Annotations for Misspelings☆24Updated last year
- 人民日报1998年1-4月中文标注语料库☆30Updated 6 years ago
- NMT for chinese-english using tensor2tensor☆47Updated 7 years ago
- Code of EMNLP paper: http://aclweb.org/anthology/D18-1531☆62Updated 5 years ago
- ☆75Updated 2 years ago
- Translation Error Rate (TER)☆43Updated 6 years ago
- ICU based universal language tokenizer☆30Updated 3 years ago
- THU Chinese Keyphrase Extraction Toolkit☆125Updated 6 years ago
- Prior Knowledge Integration for Neural Machine Translation using Posterior Regularization☆11Updated 6 years ago
- This directory contains the training, test, and gold-standard data used in the 2nd International Chinese Word Segmentation Bakeoff. Also …☆66Updated 6 years ago
- Clone of "A Good Part-of-Speech Tagger in about 200 Lines of Python" by Matthew Honnibal☆48Updated 8 years ago
- Pure python Aho-Corasick library.☆214Updated 2 years ago
- ☆172Updated 2 years ago
- A Fast ELMo Implementation. (NOT MAINTAIN ANYMORE)☆38Updated 2 years ago
- ODSQA: OPEN-DOMAIN SPOKEN QUESTION ANSWERING DATASET☆59Updated 3 years ago
- Python version of the evaluation script from CoNLL'00-☆92Updated 4 years ago
- ☆125Updated 4 years ago
- MaxMatch (M^2) Scorer - Evaluation program for grammatical error correction systems.☆150Updated 2 years ago