結巴中文斷詞台灣繁體版本
☆109Nov 3, 2017Updated 8 years ago
Alternatives and similar repositories for jieba-tw
Users that are interested in jieba-tw are comparing it to the libraries listed below
Sorting:
- MONPA 罔拍是一個提供正體中文斷詞、詞性標註以及命名實體辨識的多任務模型☆247Feb 20, 2025Updated last year
- CKIP Neural Chinese Word Segmentation, POS Tagging, and NER☆1,681Jul 9, 2025Updated 8 months ago
- CKIP Transformers☆765Apr 21, 2023Updated 2 years ago
- CKIP CoreNLP Toolkits☆128Apr 9, 2023Updated 2 years ago
- API of Articut 中文斷詞 (兼具語意詞性標記):「斷詞」又稱「分詞」,是中文資訊處理的基礎。Articut 不用機器學習,不需資料模 型,只用現代白話中文語法規則,即能達到 SIGHAN 2005 F1-measure 94% 以上,Recall 96% 以上的…☆414Feb 10, 2026Updated 3 weeks ago
- ☆10Dec 3, 2020Updated 5 years ago
- ☆29Feb 11, 2025Updated last year
- ☆29Jan 17, 2019Updated 7 years ago
- CyberCan is a lexicon of contemporary Cantonese based on more than 100 million pieces of internet texts from discussion forums in Hong Ko…☆12Aug 24, 2021Updated 4 years ago
- 🍳 NLPrep - dataset tool for many natural language processing task☆28Jul 30, 2021Updated 4 years ago
- ODSQA: OPEN-DOMAIN SPOKEN QUESTION ANSWERING DATASET☆63Feb 20, 2022Updated 4 years ago
- 正規化台灣地址,並取得縣市、鄉鎮、區碼☆137Apr 10, 2025Updated 10 months ago
- Go 程式設 計語言技術共筆 如要加入這個技術共筆與線上討論, 請參考右邊的網站☆13Jun 6, 2016Updated 9 years ago
- ☆14Feb 9, 2022Updated 4 years ago
- 粵語/廣東話參考資料 Reference Materials for Yue / Cantonese☆14Dec 12, 2025Updated 2 months ago
- 聯發創新基地(MediaTek Research) 致力於研究基礎模型。我們將研究體現在適合繁體中文使用者的模型上,並在使用權許可的情況下,提供模型給學術界研究 或產業界使用。☆266Sep 8, 2025Updated 6 months ago
- 媠聲/鬥拍字 - 唸台語文予你聽,台語語音合成開源服務☆36Jan 20, 2026Updated last month
- 公眾人物新聞的追蹤☆18Aug 24, 2025Updated 6 months ago
- 臺灣閩南語常用詞辭典 資料檔☆82May 2, 2023Updated 2 years ago
- 《香港二十世紀中期粵語語料庫》打包器☆16Apr 12, 2016Updated 9 years ago
- ⚙️Tool for NLP - handle file and text☆15Feb 16, 2025Updated last year
- ☆17Nov 18, 2024Updated last year
- ☆33Jul 15, 2025Updated 7 months ago
- 台語、族語、客語的語料清單、彙整☆46Apr 6, 2020Updated 5 years ago
- Powerful front-end framework for Flux☆21Jan 19, 2015Updated 11 years ago
- Dictionary of pairs of Korean word and IPA crawled from Wiktionary (Korean edition)☆23Nov 12, 2025Updated 3 months ago
- A frequency lexicon for Hong Kong Cantonese☆23Aug 27, 2020Updated 5 years ago
- JavaScript library based on Web Components.☆21Sep 6, 2022Updated 3 years ago
- 🏃 hosting nlp models in one line☆20May 8, 2024Updated last year
- Using Generative AI to generate FAQ from descriptions☆51Jun 22, 2024Updated last year
- Traditional Mandarin LLMs for Taiwan☆1,392Apr 20, 2025Updated 10 months ago
- 公開的情緒訓練資料☆58Mar 7, 2023Updated 3 years ago
- ☆31Apr 12, 2016Updated 9 years ago
- Pre-trained ELECTRA from Hong Kong data☆29Jul 7, 2020Updated 5 years ago
- Cantonese segmentation tool 粵語分詞工具☆30Aug 22, 2020Updated 5 years ago
- Phraseg - 一言:新詞發現工具包☆26Nov 30, 2021Updated 4 years ago
- PTT 網路版爬蟲☆452Mar 31, 2024Updated last year
- The best PTT library☆720Nov 22, 2025Updated 3 months ago
- Open source traditional chinese handwriting dataset.☆223May 20, 2021Updated 4 years ago