voidism / pywordsegLinks
Open Source State-of-the-art Chinese Word Segmentation System with BiLSTM and ELMo. https://arxiv.org/abs/1901.05816
☆45Updated 4 years ago
Alternatives and similar repositories for pywordseg
Users that are interested in pywordseg are comparing it to the libraries listed below
Sorting:
- A question answering dataset for machine comprehension of spoken content☆78Updated 7 years ago
- A list of awesome machine question answering dataset - 機器問答數據集☆15Updated 5 years ago
- ODSQA: OPEN-DOMAIN SPOKEN QUESTION ANSWERING DATASET☆63Updated 3 years ago
- 🍳 NLPrep - dataset tool for many natural language processing task☆28Updated 4 years ago
- 🤖📇 handling multiple nlp task in one pipeline☆56Updated last month
- Normalize text string☆12Updated 6 years ago
- Fine tuning bert for text generation☆37Updated 5 years ago
- ☆31Updated 4 years ago
- 🏃 hosting nlp models in one line☆20Updated last year
- ☆31Updated 7 years ago
- Sub-Character Representation Learning☆25Updated 7 years ago
- explores Chinese language models with sub-character level visual information☆16Updated 7 years ago
- Upcoming ACL 2020 paper☆25Updated 5 years ago
- ⚙️Tool for NLP - handle file and text☆15Updated 8 months ago
- Phraseg - 一言:新詞發現工具包☆26Updated 3 years ago
- Unofficial implementation of "Non-Autoregressive Machine Translation with Latent Alignments" https://arxiv.org/abs/2004.07437☆23Updated 5 years ago
- Source code for ACL 2020 paper "Learning Spoken Language Representations with Neural Lattice Language Modeling"☆17Updated 2 years ago
- ☆12Updated 10 years ago
- Code for "A BERT-based Distractor Generation Scheme with Multi-tasking and Negative Answer Training Strategies."☆27Updated 3 years ago
- Spoken Cantonese from Hong Kong.☆30Updated last month
- Query-based Attention CNN for Text Similarity Map☆31Updated 7 years ago
- ☆15Updated 4 years ago
- Unsupervised spoken sentence embeddings☆14Updated 2 years ago
- A fast LSTM Language Model for large vocabulary language like Japanese and Chinese☆111Updated 6 years ago
- The implementation of the papers on dual learning of natural language understanding and generation. (ACL2019,2020; Findings of EMNLP 2020…☆67Updated 5 years ago
- ☆10Updated 3 years ago
- A 30000+ Chinese MRC dataset - Delta Reading Comprehension Dataset☆313Updated 5 years ago
- ☆50Updated 3 years ago
- Neural quality estimation toolkit for grammatical error correction and other language generation applications.☆49Updated 6 years ago
- 轉換好的 Albert 中文模型 (for pytorch-transformers)☆19Updated 5 years ago