zeicold / pycnnum
☆19Updated 10 months ago
Alternatives and similar repositories for pycnnum:
Users that are interested in pycnnum are comparing it to the libraries listed below
- Hanzi Converter for Traditional and Simplified Chinese☆186Updated 5 years ago
- ☆93Updated 5 months ago
- Chinese word segmentation module of LTP☆46Updated 9 years ago
- Constants used in Chinese text processing☆370Updated 4 months ago
- High performance Trie and Ahocorasick automata (AC automata) Keyword Match & Replace Tool for python. Correct case insensitive implementa…☆94Updated 6 months ago
- ☆129Updated 7 years ago
- ODSQA: OPEN-DOMAIN SPOKEN QUESTION ANSWERING DATASET☆62Updated 3 years ago
- Corpus of Annotations for Misspelings☆24Updated last year
- an open solution for collecting n-gram Chinese lexicon and n-gram statistics☆74Updated 9 years ago
- ☆125Updated 4 years ago
- NMT for chinese-english using tensor2tensor☆47Updated 7 years ago
- Re-rank n-best lists using additional features.☆28Updated 6 years ago
- ICU based universal language tokenizer☆31Updated 3 years ago
- General-Purpose Neural Networks for Sentence Boundary Detection☆73Updated 2 years ago
- Translation Error Rate (TER)☆43Updated 6 years ago
- Subword Encoding in Lattice LSTM for Chinese Word Segmentation☆53Updated 6 years ago
- Berserker - BERt chineSE woRd toKenizER☆16Updated 6 years ago
- ☆36Updated 2 years ago
- Textprep is an analyzing tool for both parallel and non-parallel corpus and its down-stream Natural Language Processing and Machine Trans…☆32Updated 6 years ago
- repo for Tibetan corpora☆21Updated 2 years ago
- A Chinese Cloze-style RC Dataset: People's Daily & Children's Fairy Tale (CFT)☆170Updated 6 years ago
- 中文分词软件基准测试 | Chinese tokenizer benchmark☆24Updated 6 years ago
- Baseline models, training scripts, and instructions on how to reproduce our results for our state-of-art grammar correction system from M…☆73Updated 5 years ago
- You do not need to modify your model when applied it to Chinese,you can translate chinese chars to wubi ,then you can process chinese cha…☆38Updated 6 years ago
- Codes for Lexical Sememe Prediction via Word Embeddings and Matrix Factorization (IJCAI 2017).☆60Updated 5 years ago
- Spoken Cantonese from Hong Kong.☆29Updated 5 months ago
- Skip-Thought Vectors implement by tensorflow☆10Updated 7 years ago
- Tools for extracting parallel corpora from article titles across languages in Wikipedia☆73Updated 10 years ago
- Universal dependencies homepage☆39Updated this week
- Pre-processing and training scripts for WMT 2017 ZH-EN translation task☆39Updated 4 years ago