PTT 八卦版問答中文語料
☆247Oct 18, 2024Updated last year
Alternatives and similar repositories for Gossiping-Chinese-Corpus
Users that are interested in Gossiping-Chinese-Corpus are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- 批踢踢推文產生器☆224Oct 19, 2024Updated last year
- A web crawler specifically for PTT website.☆19Jul 18, 2018Updated 7 years ago
- CKIP Neural Chinese Word Segmentation, POS Tagging, and NER☆1,682Jul 9, 2025Updated 10 months ago
- MONPA 罔拍是一個提供正體中文斷詞、詞性標註以及命名實體辨識的多任務模型☆247Feb 20, 2025Updated last year
- A Traditional-Chinese instruction-following model with datasets based on Alpaca.☆136Mar 28, 2023Updated 3 years ago
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- CKIP Transformers☆767Apr 21, 2023Updated 3 years ago
- 公開的情緒訓練資料☆58Mar 7, 2023Updated 3 years ago
- 基於向量匹配的情境式聊天機器人☆906Oct 18, 2024Updated last year
- Taiwanese Translation with BERT based model and RNN. Collection of Taiwanese text corpus☆13Oct 15, 2022Updated 3 years ago
- 🍳 NLPrep - dataset tool for many natural language processing task☆28Jul 30, 2021Updated 4 years ago
- 聯發創新基地(MediaTek Research) 致力於研究基礎模型。我們將研究體現在適合繁體中文使用者的模型上,並在使用權許可的情況下,提供模型給學術界研究或產業界使用。☆274Apr 23, 2026Updated 2 weeks ago
- 結巴中文斷詞台灣繁體版本☆323Jul 15, 2016Updated 9 years ago
- 总结了一些可以用作聊天机器人训练实作的文字语聊,包含中英文不同语言☆118Jun 7, 2018Updated 7 years ago
- An automation webcrawler based on Selenium library for retrieving parliamentary questions on The Website of Taiwan Legislative Yuan (http…☆11Jun 8, 2023Updated 2 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- 轉換好的 Albert 中文模型 (for pytorch-transformers)☆19Mar 6, 2020Updated 6 years ago
- Traditional Mandarin LLMs for Taiwan☆1,407Apr 20, 2025Updated last year
- Code for "A BERT-based Distractor Generation Scheme with Multi-tasking and Negative Answer Training Strategies."☆27Feb 2, 2022Updated 4 years ago
- A Chinese Cloze-style RC Dataset: People's Daily & Children's Fairy Tale (CFT)☆176Mar 26, 2019Updated 7 years ago
- 訓練中文詞向量 Word2vec, Word2vec was created by a team of researchers led by Tomas Mikolov at Google.☆60Jul 4, 2023Updated 2 years ago
- API of Articut 中文斷詞 (兼具語意詞性標記):「斷詞」又稱「分詞」,是中文資訊處理的基礎。Articut 不用機器學習,不需資料模型,只用現代白話中文語法規則,即能達到 SIGHAN 2005 F1-measure 94% 以上,Recall 96% 以上的…☆415Feb 10, 2026Updated 2 months ago
- 結巴中文斷詞台灣繁體版本☆111Nov 3, 2017Updated 8 years ago
- PTT 網路版爬蟲☆452Mar 31, 2024Updated 2 years ago
- The best PTT library☆723Apr 27, 2026Updated last week
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- 臺灣閩南語常用詞辭典 資料檔☆82May 2, 2023Updated 3 years ago
- meta-analyses of language acquisition phenomena☆13May 28, 2019Updated 6 years ago
- AI CUP 2024 RAG☆13Nov 19, 2024Updated last year
- a series of tutorials on sequence to sequence learning, implemented with PyTorch.☆74Apr 4, 2020Updated 6 years ago
- A 30000+ Chinese MRC dataset - Delta Reading Comprehension Dataset☆312Apr 21, 2020Updated 6 years ago
- 用于训练中英文对话系统的语料库 Datasets for Training Chatbot System☆2,052Sep 23, 2020Updated 5 years ago
- 台灣新聞拆拆樂☆177Oct 30, 2020Updated 5 years ago
- AI CUP 2019 - Abstract Labeling and Classification Competition Tutorial☆11Oct 21, 2019Updated 6 years ago
- CamRest676 is an English data set, I translate it into Chinese for training nlu.☆12Dec 20, 2017Updated 8 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- 繁體+簡體中文詞庫字典檔☆113Apr 8, 2024Updated 2 years ago
- A Public Corpus for Machine Learning☆44Jul 3, 2018Updated 7 years ago