PTT 八卦版問答中文語料
☆247Oct 18, 2024Updated last year
Alternatives and similar repositories for Gossiping-Chinese-Corpus
Users that are interested in Gossiping-Chinese-Corpus are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- 批踢踢推文產生器☆224Oct 19, 2024Updated last year
- A web crawler specifically for PTT website.☆19Jul 18, 2018Updated 7 years ago
- CKIP Neural Chinese Word Segmentation, POS Tagging, and NER☆1,683Jul 9, 2025Updated 11 months ago
- MONPA 罔拍是一個提供正體中文斷詞、詞性標註以及命名實體辨識的多任務模型☆247Feb 20, 2025Updated last year
- A Traditional-Chinese instruction-following model with datasets based on Alpaca.☆137Mar 28, 2023Updated 3 years ago
- Open source password manager - Proton Pass • AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- CKIP Transformers☆767Apr 21, 2023Updated 3 years ago
- 公開的情緒訓練資料☆58Mar 7, 2023Updated 3 years ago
- 基於向量匹配的情境式聊天機器人☆908Oct 18, 2024Updated last year
- Taiwanese Translation with BERT based model and RNN. Collection of Taiwanese text corpus☆13Oct 15, 2022Updated 3 years ago
- 🍳 NLPrep - dataset tool for many natural language processing task☆28Jul 30, 2021Updated 4 years ago
- 聯發創新基地(MediaTek Research) 致力於研究基礎模型。我們將研究體現在適合繁體中文使用者的模型上,並在使用權許可的情況下,提供模型給學術界研究或產業界使用。☆285Apr 23, 2026Updated last month
- 結巴中文斷詞台灣繁體版本☆323Jul 15, 2016Updated 9 years ago
- 总结了一些可以用作聊天机器人训练实作的文字语聊,包含中英文不同语言☆118Jun 7, 2018Updated 8 years ago
- An automation webcrawler based on Selenium library for retrieving parliamentary questions on The Website of Taiwan Legislative Yuan (http…☆11Jun 8, 2023Updated 3 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- 轉換好的 Albert 中文模型 (for pytorch-transformers)☆19Mar 6, 2020Updated 6 years ago
- CKIP CoreNLP Toolkits☆129Apr 9, 2023Updated 3 years ago
- Traditional Mandarin LLMs for Taiwan☆1,417Apr 20, 2025Updated last year
- Code for "A BERT-based Distractor Generation Scheme with Multi-tasking and Negative Answer Training Strategies."☆27Feb 2, 2022Updated 4 years ago
- A Chinese Cloze-style RC Dataset: People's Daily & Children's Fairy Tale (CFT)☆176Mar 26, 2019Updated 7 years ago
- 訓練中文詞向量 Word2vec, Word2vec was created by a team of researchers led by Tomas Mikolov at Google.☆60Jul 4, 2023Updated 2 years ago
- API of Articut 中文斷詞 (兼具語意詞性標記):「斷詞」又稱「分詞」,是中文資訊處理的基礎。Articut 不用機器學習,不需資料模型,只用現代白話中文語法規則,即能達到 SIGHAN 2005 F1-measure 94% 以上,Recall 96% 以上的…☆415Jun 4, 2026Updated 2 weeks ago
- 結巴中文斷詞台灣繁體版本☆114Nov 3, 2017Updated 8 years ago
- PTT 網路版爬蟲☆450Mar 31, 2024Updated 2 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- The best PTT library☆725Jun 10, 2026Updated last week
- 臺灣閩南語常用詞辭典 資料檔☆82May 2, 2023Updated 3 years ago
- meta-analyses of language acquisition phenomena☆13May 28, 2019Updated 7 years ago
- AI CUP 2024 RAG☆13Nov 19, 2024Updated last year
- a series of tutorials on sequence to sequence learning, implemented with PyTorch.☆74Apr 4, 2020Updated 6 years ago
- A 30000+ Chinese MRC dataset - Delta Reading Comprehension Dataset☆312Apr 21, 2020Updated 6 years ago
- 用于训练中英文对话系统的语料库 Datasets for Training Chatbot System☆2,052Sep 23, 2020Updated 5 years ago
- 台灣新聞拆拆樂☆178Oct 30, 2020Updated 5 years ago
- AI CUP 2019 - Abstract Labeling and Classification Competition Tutorial☆11Oct 21, 2019Updated 6 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- CamRest676 is an English data set, I translate it into Chinese for training nlu.☆12Dec 20, 2017Updated 8 years ago
- 繁體+簡體中文詞庫字典檔☆113Apr 8, 2024Updated 2 years ago
- A Public Corpus for Machine Learning☆44Jul 3, 2018Updated 7 years ago
- ☆19Jul 20, 2022Updated 3 years ago
- OpenCC made with Python☆570Dec 8, 2023Updated 2 years ago
- [Kaggle-2nd] Lightweight yet Effective Chinese LLM.☆55Jun 14, 2025Updated last year
- Bayesian Visual Working Memory in Python.☆13Mar 28, 2020Updated 6 years ago