voidism / pywordseg
Open Source State-of-the-art Chinese Word Segmentation System with BiLSTM and ELMo. https://arxiv.org/abs/1901.05816
☆40Updated 3 years ago
Alternatives and similar repositories for pywordseg:
Users that are interested in pywordseg are comparing it to the libraries listed below
- A list of awesome machine question answering dataset - 機器問答數據集☆15Updated 5 years ago
- ODSQA: OPEN-DOMAIN SPOKEN QUESTION ANSWERING DATASET☆59Updated 2 years ago
- 🍳 NLPrep - dataset tool for many natural language processing task☆28Updated 3 years ago
- Fine tuning bert for text generation☆37Updated 5 years ago
- Normalize text string☆12Updated 6 years ago
- ☆30Updated 6 years ago
- 🤖📇 handling multiple nlp task in one pipeline☆56Updated last year
- 🏃 hosting nlp models in one line☆20Updated 9 months ago
- A question answering dataset for machine comprehension of spoken content☆78Updated 6 years ago
- Phraseg - 一言:新詞發現工具包☆26Updated 3 years ago
- A 30000+ Chinese MRC dataset - Delta Reading Comprehension Dataset☆308Updated 4 years ago
- Query-based Attention CNN for Text Similarity Map☆31Updated 7 years ago
- ROUGE score calculator with traditional chinese word segmentation☆9Updated 3 years ago
- The enhanced version of ZEN, larger and more powerful.☆28Updated 2 years ago
- 轉換好的 Albert 中文模型 (for pytorch-transformers)☆18Updated 4 years ago
- Why Attention? Analyze BiLSTM Deficiency and Its Remedies in the Case of NER☆9Updated 5 years ago
- ☆31Updated 3 years ago
- Ordered Neurons LSTM☆30Updated 3 years ago
- Upcoming ACL 2020 paper☆25Updated 4 years ago
- This is the code in <Selection Bias Explorations and Debias Methods for Natural Language Sentence Matching Datasets> which has been accep…☆34Updated last year
- explores Chinese language models with sub-character level visual information☆16Updated 6 years ago
- A web crawler specifically for PTT website.☆19Updated 6 years ago
- Tensorflow implementation of Bi-directional RNN Langauge Model☆38Updated 6 years ago
- Python package for understanding the difficulty of text classification datasets. (in CoNNL 2018)☆63Updated 4 years ago
- COS960: A Chinese Word Similarity Dataset of 960 Word Pairs☆35Updated 5 years ago
- Code and data for the NAACL 2019 paper "Improving Cross-Domain Chinese Word Segmentation with Word Embeddings"☆10Updated 5 years ago
- Unofficial implementation of "Non-Autoregressive Machine Translation with Latent Alignments" https://arxiv.org/abs/2004.07437☆24Updated 4 years ago
- The implementation of the papers on dual learning of natural language understanding and generation. (ACL2019,2020; Findings of EMNLP 2020…☆66Updated 4 years ago
- Re-rank n-best lists using additional features.☆28Updated 6 years ago
- ⚙️Tool for NLP - handle file and text☆15Updated 7 months ago