ckiplab / ckip-transformers
CKIP Transformers
☆698Updated last year
Related projects ⓘ
Alternatives and complementary repositories for ckip-transformers
- CKIP Neural Chinese Word Segmentation, POS Tagging, and NER☆1,641Updated 5 months ago
- CKIP CoreNLP Toolkits☆115Updated last year
- PTT 八卦版問答中文語料☆238Updated last month
- API of Articut 中文斷詞 (兼具語意詞性標記):「斷詞」又稱「分詞」,是中文資訊處理的基礎。Articut 不用機器學習,不需資料模型,只用現代白話中文語法規則,即能達到 SIGHAN 2005 F1-measure 94% 以上,Recall 96% 以上的…☆408Updated last week
- A 30000+ Chinese MRC dataset - Delta Reading Comprehension Dataset☆306Updated 4 years ago
- MONPA 罔拍是一個提供正體中文斷詞、詞性標註以及命名實體辨識的多任務模型☆245Updated 2 years ago
- A Traditional-Chinese instruction-following model with datasets based on Alpaca.☆134Updated last year
- 結巴中文斷詞台灣繁體版本☆101Updated 7 years ago
- 結巴中文斷詞台灣繁體版本☆318Updated 8 years ago
- 聯發創新基地(MediaTek Research) 致力於研究基礎模型。我們將研究體現在適合繁體中文使用者的模型上,並在使用權許可的情況下,提供模型給學術界研究或產業界使用。☆155Updated 5 months ago
- Awesome-nlp 繁體中文翻譯計畫。原作者:https://github.com/keon/awesome-nlp☆59Updated 5 years ago
- 公開的情緒訓練資料☆58Updated last year
- OpenCC made with Python☆537Updated 11 months ago
- Traditional Mandarin LLMs for Taiwan☆1,258Updated 3 months ago
- 訓練中文詞向量 Word2vec, Word2vec was created by a team of researchers led by Tomas Mikolov at Google.☆58Updated last year
- 批踢踢推文產生器☆221Updated last month
- 台北QA問答機器人(使用BERT、ALBERT)☆41Updated 4 years ago
- 高质量中文预训练模型集合:最先进大模型、最快小模型、相似度专门模型☆805Updated 4 years ago
- A CWN Python binding with graph structure☆26Updated last year
- Pre-trained Chinese ELECTRA(中文ELECTRA预训练模型)☆1,403Updated last year
- KeyExtractor performs keyword extraction for chinese documents with state-of-the-art transformer models without training and labeled data…☆16Updated 3 years ago
- Revisiting Pre-trained Models for Chinese Natural Language Processing (MacBERT)☆645Updated last year
- PERT: Pre-training BERT with Permuted Language Model☆354Updated last year
- [COLING 2022] CSL: A Large-scale Chinese Scientific Literature Dataset 中文科学文献数据集☆569Updated last year
- A Large-scale Chinese Short-Text Conversation Dataset and Chinese pre-training dialog models☆1,786Updated last year
- A web crawler specifically for PTT website.☆19Updated 6 years ago
- Open source traditional chinese handwriting dataset.☆178Updated 3 years ago
- 中文詞向量訓練教學☆518Updated last year
- Collections of Chinese NLP corpus☆876Updated 3 years ago